Li et al. (2025) Surrogate modeling for rapid estimation of spatially-resolved flood damage: Application to a coastal region

Identification

Journal: Journal of Hydrology
Year: 2025
Date: 2025-12-09
Authors: Shicheng Li, Can Ding, James Yang
DOI: 10.1016/j.jhydrol.2025.134763

Research Groups

Department of Civil and Architectural Engineering, KTH Royal Institute of Technology, Stockholm, Sweden
Department of Built Environment, Aalto University, Espoo, Finland
R&D Hydraulic Laboratory, Vattenfall AB, Älvkarleby, Sweden

Short Summary

This study introduces BayFlood, a Bayesian-optimized machine learning surrogate model for rapid, accurate, and spatially resolved flood damage estimation using river discharge and tidal level inputs. The boosting-ensemble-driven BayFlood achieved the best performance (coefficient of determination = 0.92–0.98; root mean square error = 0.04–0.08) and reduced computational time by two orders of magnitude compared to hydraulic modeling.

Objective

To develop a machine learning (ML)-based surrogate model for flood damage assessment (FDA) that provides rapid, accurate, and spatially resolved damage estimation, addressing the computational cost limitations of conventional hydraulic model-based FDA approaches and the lack of spatially resolved quantification in existing ML-based methods.

Study Configuration

Spatial Scale: A coastal region of the Jianjiang River, Guangdong Province, China, extending approximately 15 km upstream from the estuary, with a computational domain of approximately 76 km².
Temporal Scale: Focus on static peak flood damage assessment for 60 distinct flood events (30 river flow-dominant, 30 storm surge-dominant, and 4 compound flood scenarios), disregarding temporal flood evolution.

Methodology and Data

Models used:
- BayFlood: A Bayesian-optimized machine learning surrogate model.
- ML Engines: Regression Tree (RT), Boosting Ensemble (BE), Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN).
- Optimization: Bayesian Optimization (BO) using a Gaussian Process (GP) surrogate model and Expected Improvement (EI) acquisition function.
- Hydraulic Model: TELEMAC (open-source 2D hydraulic simulation program) used to generate training/validation data.
- Damage Function: Unit Damage Functions (UDFs) developed by Su et al. (2022) for Guangdong Province, quantifying damage rate based on inundation depth and land use.
Data sources:
- Flood Event Data: 60 flood events (river flow-dominant, storm surge-dominant, and compound flood scenarios) generated from 2D hydraulic simulations using TELEMAC.
- Topographic Data: River channel topography measured in 2020 (1:5000 scale) from the Guangdong Provincial Design Institute of Water Conservancy and Electric Power.
- Land Use Data: 2019 annual land cover dataset.
- Hydrological Inputs: River discharge (Q) and tidal level (SL).
- Hydraulic Model Validation Data: Water level measurements from September and October 2023 flood events for 1D model calibration/validation.

Main Results

The Boosting Ensemble (BE)-driven BayFlood model consistently outperformed other ML engines (RT, LSTM, CNN) in accuracy and efficiency across all flood scenarios.
For river flow-dominant (RFD) events, the BE-driven BayFlood achieved a coefficient of determination (CD) of 0.92 and a root mean square error (RMSE) of 0.08 for damage rate (DR) forecasting. 93% of predictions had errors below 0.10, and 97% below 0.20. For flood depth (FD) forecasting, it achieved CD = 0.95, RMSE = 0.53 m, and mean absolute error (MAE) = 0.25 m.
For storm surge-dominant (SSD) events, the BE-driven BayFlood achieved a CD of 0.98 and an RMSE of 0.04 for DR predictions. 97% of forecasts showed errors below 0.10, and 99% below 0.20. For FD estimates, it achieved CD = 0.99, RMSE = 0.24 m, and MAE = 0.10 m.
Under super-extreme compound flood (CF) conditions, the BE-driven BayFlood maintained stable predictive performance with a mean CD of 0.93 ± 0.01 and RMSE fluctuating by only 0.02.
Monte Carlo uncertainty analysis (1000 runs, 5% noise level) revealed a mean damage rate uncertainty of 0.18 (18%) and a mean flood depth uncertainty of 1.3 m, confirming model robustness.
BayFlood significantly reduced computational time by two orders of magnitude (factor of 66–330) compared to the hydraulic model (which took approximately 1 hour per event), generating damage results within minutes (< 5 minutes per event after training).

Contributions

Development of the first ML-based surrogate model (BayFlood) for rapid and spatially resolved flood damage estimation, bridging the gap between high-accuracy hydraulic simulations and the need for real-time, high-resolution FDA.
Achieves significant computational efficiency gains (two orders of magnitude faster) over traditional hydraulic models, enabling practical application for real-time emergency response, pre-disaster planning, and post-disaster recovery.
Demonstrates high predictive accuracy and robustness across diverse flood scenarios (river flow-dominant, storm surge-dominant, and compound floods) and under input uncertainty.
Provides actionable, spatially explicit damage maps, enhancing localized decision-making and risk management strategies.

Funding

Swedish Center for Sustainable Hydropower (SVC)

Citation

@article{Li2025Surrogate,
  author = {Li, Shicheng and Ding, Can and Yang, James},
  title = {Surrogate modeling for rapid estimation of spatially-resolved flood damage: Application to a coastal region},
  journal = {Journal of Hydrology},
  year = {2025},
  doi = {10.1016/j.jhydrol.2025.134763},
  url = {https://doi.org/10.1016/j.jhydrol.2025.134763}
}

Original Source: https://doi.org/10.1016/j.jhydrol.2025.134763