Gupta et al. (2026) Optimizing surveillance efficiency with deep learning-driven flood segmentation
Identification
- Journal: Natural Hazards
- Year: 2026
- Date: 2026-02-26
- Authors: Shaurya Gupta, Vinay Dubey, Rahul Katarya
- DOI: 10.1007/s11069-025-07941-6
Research Groups
- Big Data Analytics and Web Intelligence Laboratory, Delhi Technological University, New Delhi, India
- School of Engineering & Technology, Vivekananda Institute of Professional Studies-Technical Campus, New Delhi, Delhi, India
Short Summary
This paper introduces Flood-X, a novel deep learning architecture for pixel-level flood segmentation in ground-level surveillance images, utilizing an Xception-based encoder and a custom lightweight decoder. The model achieves state-of-the-art performance with a mean Intersection over Union (mIoU) of 94% on a combined augmented dataset, outperforming existing methods and demonstrating superior efficiency and accuracy.
Objective
- To develop and evaluate deep learning-based semantic segmentation architectures for accurate pixel-level flood detection in ground-level surveillance images.
- To introduce a novel framework, Flood-X, and compare its performance against existing approaches and a coarse-based network.
- To analyze the effect of different loss functions (Binary Cross-Entropy, Tversky Loss, Log-Cosh Dice Loss) on model convergence and segmentation quality for imbalanced flood datasets.
Study Configuration
- Spatial Scale: Ground-level surveillance images, focusing on local and regional scenes. Input images are resized to 256 pixels × 256 pixels.
- Temporal Scale: Focused on rapid assessment and disaster response for current flood events, enabling real-time deployment for monitoring.
Methodology and Data
- Models used:
- Flood-X: Novel architecture comprising an Xception-based encoder and a custom lightweight decoder with multi-scale concatenation connections and skip connections.
- Coarse-based network: An adapted coarse-to-fine network originally for depth estimation, repurposed for binary segmentation.
- Comparison models: U-Net, SegNet, Mask R-CNN, DeepLabv3+, FASegNet, MoSWIN, ParseNet, Gated-SCNN, Tiramisu, ContextNet, PSPNet.
- Data sources:
- Combined dataset from Lopez-Fuentes et al. (2017) and Sazara et al. (2019), consisting of 553 original ground-level images of river and flood scenes.
- Data augmentation performed using the Albumentations library (e.g., flipping, shifting, scaling, rotation, random crop, brightness adjustments, grid distortion, elastic transformation) to expand the dataset to over 2.5 thousand samples.
- Dataset split: 85% for training (2350 images) and 15% for validation (415 images).
- Final testing conducted on unseen real-world images sourced from the internet.
Main Results
- Flood-X achieved state-of-the-art performance in ground-level flood segmentation, attaining an mIoU of 94% on the combined augmented dataset.
- Flood-X demonstrated significantly faster convergence, reaching a 90% IOU score in approximately 10 epochs, with minimal overfitting.
- Tversky Loss was identified as the optimal loss function for Flood-X, yielding the highest accuracy and effectively addressing data imbalance, closely followed by Log-Cosh Dice Loss.
- Binary Cross-Entropy loss consistently underperformed, resulting in lower probability predictions near flood boundaries.
- An ablation study confirmed the critical contributions of Flood-X's architectural components: multi-scale concatenation connections (2.75% mIoU drop without them), the Xception encoder (superior efficiency and accuracy compared to standard convolutions), and decoder skip additions (~1.8% mIoU decline without them).
- The lightweight decoder design was validated, as a heavier decoder did not provide significant accuracy gains despite increased computational parameters.
Contributions
- A novel flood segmentation framework (Flood-X) leveraging an Xception-based encoder and a custom decoder optimized for ground-surveillance imagery.
- Comprehensive comparison of Flood-X with a coarse-based network and other state-of-the-art segmentation techniques, demonstrating superior performance.
- Detailed analysis of three loss functions (Binary Cross-Entropy, Tversky Loss, and Log-Cosh Dice Loss) to study their impact on convergence stability and segmentation accuracy for imbalanced flood datasets.
- Extensive quantitative and qualitative evaluations, including results on unseen real-world images, demonstrating the robustness and generalization capability of the proposed Flood-X model for local flood monitoring.
Funding
No funding available.
Citation
@article{Gupta2026Optimizing,
author = {Gupta, Shaurya and Dubey, Vinay and Katarya, Rahul},
title = {Optimizing surveillance efficiency with deep learning-driven flood segmentation},
journal = {Natural Hazards},
year = {2026},
doi = {10.1007/s11069-025-07941-6},
url = {https://doi.org/10.1007/s11069-025-07941-6}
}
Original Source: https://doi.org/10.1007/s11069-025-07941-6