Yousefi et al. (2025) A reinforcement learning approach with explainable AI for spatial flood susceptibility analysis
Identification
- Journal: Journal of Hydrology Regional Studies
- Year: 2025
- Date: 2025-12-11
- Authors: Saleh Yousefi, Sara Mardanian, Abolfazl Jaafari, Zahra Tavangar
- DOI: 10.1016/j.ejrh.2025.103035
Research Groups
- Soil Conservation and Watershed Management Research Department, Chaharmahal and Bakhtiari Agricultural and Natural Resources Research and Education Center, AREEO, Shahrekord, Iran
- Research Institute of Forests and Rangelands, Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran
- Shahid Beheshti University, Faculty of Architecture and Urban Planning, Landscape Architecture Department, Tehran, Iran
Short Summary
This study develops and compares reinforcement learning (RL) models, including a novel RL-Stack ensemble, for spatial flood susceptibility mapping in a semi-arid mountainous region, finding that the Proximal Updating (PU) model achieved the highest accuracy and stability, with snow depth identified as the primary hydrological control.
Objective
- To develop and evaluate a spatially explicit framework integrating reinforcement learning (RL) algorithms (Q-Learning, Deep Q-Learning, Proximal Updating, and a novel RL-Stack ensemble) with Geographic Information Systems (GIS) for flood susceptibility mapping.
- To enhance interpretability of the RL models and quantify the contribution of explanatory variables using SHapley Additive exPlanations (SHAP).
Study Configuration
- Spatial Scale: Chaharmahal and Bakhtiari Province, western Iran, covering approximately 16553 square kilometers, with elevations ranging from 778 meters to 4203 meters above sea level. All raster layers were resampled to a standardized resolution of 30 meters.
- Temporal Scale: Flood inventory data and explanatory variables were collected and selected to correspond to the period from 1983 to 2023.
Methodology and Data
- Models used: Q-Learning (QL), Proximal Policy Optimization (PPO)-based Proximal Updating (PU), Deep Q-Learning (DQL), and RL-Stack (a novel ensemble approach combining QL, PU, and DQL with a logistic regression meta-learner). SHapley Additive exPlanations (SHAP) was used for model interpretability.
- Data sources:
- Flood inventory data: 545 location points (346 flood, 199 non-flood) compiled from historical records of the regional water administrative, field surveys, and local disaster management authorities (1983–2023).
- Explanatory variables: 22 distinct factors (selected from an initial 28 after correlation analysis) encompassing environmental, topographic, climatic, and anthropogenic characteristics. Sources include a 30-meter Digital Elevation Model (DEM), Iran Meteorological Organization (IRIMO), Soil Grid, Landsat 8 and 9 operational land imager (OLI) - USGS, and regional datasets.
Main Results
- The Proximal Updating (PU) model consistently demonstrated the best performance, achieving a validation Area Under the Curve (AUC) of 0.9642 and a Kappa coefficient of 0.7523, indicating superior accuracy, sensitivity, and generalization capacity.
- The novel RL-Stack ensemble model ranked second, with a validation AUC of 0.9523 and high sensitivity (0.9468), effectively leveraging the complementary strengths of its base RL learners.
- Deep Q-Learning (DQL) showed strong performance (validation AUC = 0.9331) but was characterized by the highest modeling error (RMSE) and overestimation bias in local predictions.
- Q-Learning (QL) exhibited the weakest performance (validation AUC = 0.5871), demonstrating poor generalization and a strong bias towards positive classifications.
- SHAP analysis identified snow depth (SD) as the most influential variable (mean SHAP = 2.108), playing a dual role in intensifying or attenuating floods. Other key contributors included flow accumulation (FA), maximum 24-hour rainfall (R24HR), and distance from residential areas (DRA).
- Strong interactions were observed between short-duration intense rainfall (R24HR) and snow depth (SD) and flow accumulation (FA), highlighting the significance of rain-on-snow processes in generating large floods.
Contributions
- Expansion of Reinforcement Learning (RL) applications from flood management to the domain of spatial flood susceptibility mapping.
- Introduction of RL-Stack, a novel ensemble hybrid model specifically designed to capture the complex flood dynamics in semi-arid, mountainous environments by combining multiple RL algorithms.
- Integration of Explainable AI (SHAP) to provide transparent, instance-level, and dataset-wide insights into RL model predictions, enhancing trust and supporting informed decision-making.
- Generation of highly accurate and actionable flood susceptibility maps, offering critical guidance for policymakers, urban planners, and disaster managers in reducing flood risks and enhancing community resilience.
Funding
- The Research Institute of Forests and Rangelands (RIFR) under the National Research Project No. 0–09–09–002–000095.
Citation
@article{Yousefi2025reinforcement,
author = {Yousefi, Saleh and Mardanian, Sara and Jaafari, Abolfazl and Tavangar, Zahra},
title = {A reinforcement learning approach with explainable AI for spatial flood susceptibility analysis},
journal = {Journal of Hydrology Regional Studies},
year = {2025},
doi = {10.1016/j.ejrh.2025.103035},
url = {https://doi.org/10.1016/j.ejrh.2025.103035}
}
Original Source: https://doi.org/10.1016/j.ejrh.2025.103035