Vischer et al. (2025) Spatially resolved rainfall streamflow modeling in central Europe
Identification
- Journal: Hydrology and earth system sciences
- Year: 2025
- Date: 2025-10-16
- Authors: Marc Aurel Vischer, Noelia Otero, Jackie Ma
- DOI: 10.5194/hess-29-5233-2025
Research Groups
Fraunhofer Heinrich-Hertz Institute, Applied Machine Learning Group
Short Summary
This study develops an end-to-end, spatially resolved neural network pipeline for rainfall-streamflow modeling in central Europe, addressing the limitations of previous aggregated approaches in large and human-impacted catchments. The pipeline demonstrates improved accuracy, data efficiency, and interpretability, particularly for larger river basins.
Objective
- To develop and evaluate a spatially resolved, end-to-end neural network pipeline for rainfall-streamflow modeling that can accurately predict streamflow in large catchments, including those characterized by human activity, by processing inputs on a regular grid rather than aggregating them per catchment.
Study Configuration
- Spatial Scale: Five major river basins in central Europe (Elbe, Oder, Weser, Rhine, and the upstream Danube River up to Bratislava), covering a contiguous area of 570 581 square kilometers. Input data are on a regular grid with a spatial resolution of 0.1° × 0.1° (approximately 9 km × 9 km). The study includes 239 river gauging stations.
- Temporal Scale: Water years 1981–2011 (October 1 to September 30 annually) at a daily temporal resolution.
Methodology and Data
- Models used:
- A two-stage neural network pipeline trained end-to-end:
- Local Stage: An LSTM-based network (250 units) processes meteorological and static inputs in parallel for each grid cell, predicting local runoff. It includes a fully connected embedding layer for static inputs.
- Routing Stage: Consists of two linear layers (a fully connected layer and a 1D-convolution layer with a 9-day kernel) without nonlinearity, mapping local runoff to streamflow along the river network. River network connectivity is incorporated as an inductive bias.
- Baselines: An aggregated LSTM model (similar to Kratzert et al., 2019b) and a "naive routing" model (spatially resolved local stage but unconstrained routing stage).
- A two-stage neural network pipeline trained end-to-end:
- Data sources:
- Dynamic input data: Daily minimum, average, and maximum temperature; daily sum and standard deviation of precipitation; and average potential evaporation from ERA5-Land (0.1° × 0.1° resolution). Also, sine-cosine embeddings of the day of the week and day of the year.
- Static input data: 46 feature dimensions including hydrogeological properties, soil class, land cover, and orographic features derived from a digital elevation map.
- Target streamflow time series: Daily streamflow records from the Global Runoff Data Center (GRDC) data portal.
Main Results
- The spatially resolved pipeline with structured routing achieved a median Nash–Sutcliffe Efficiency (NSE) of 0.773 on the test dataset, outperforming the aggregated baseline (median NSE 0.691) and naive routing (median NSE 0.719).
- Spatially resolved processing demonstrated a positive correlation between performance and catchment size, showing particular benefits for larger catchments where it outperformed aggregated processing.
- The spatially resolved model exhibited greater robustness against overfitting, with a more graceful performance deterioration from training (median NSE 0.90) to test (median NSE 0.77) compared to the aggregated baseline (0.86 to 0.69).
- The approach showed increased data efficiency, yielding stronger performance gains over aggregated processing when training data were limited (e.g., 6 years of data).
- The internal states of the model's local stage were found to be hydrologically interpretable, reflecting plausible runoff generation patterns (e.g., snowmelt in spring, precipitation-driven runoff in summer).
- Human influence, particularly from large-scale brown coal mining operations, was identified as a significant challenge, leading to negative NSE values for specific affected stations.
Contributions
- Presentation of a novel end-to-end neural network pipeline for spatially resolved rainfall-streamflow modeling that integrates local runoff generation on a regular grid with routing along river networks.
- Introduction of a simple, linear routing module that enhances interpretability and offers potential for simulating human water management activities (e.g., extraction/injection).
- Development and public release of a new, spatially resolved dataset for five central European river basins, suitable for distributed hydrological modeling and representative of areas with significant human impact.
- Demonstration that this spatially resolved approach improves prediction accuracy, especially for large and heterogeneous catchments, enhances data efficiency, and reduces overfitting compared to traditional aggregated neural network models.
- Revelation of hydrologically interpretable internal model states, an emergent property that facilitates scientific discovery and potential for internal model control.
Funding
- Federal Ministry for Economic Affairs and Climate Action (BMWK) under grant DAKI-FWS (01MK21009A).
- European Union’s Horizon Europe research and innovation program (EU Horizon Europe) project MedEWSa under grant agreement no. 101121192.
Citation
@article{Vischer2025Spatially,
author = {Vischer, Marc Aurel and Otero, Noelia and Ma, Jackie},
title = {Spatially resolved rainfall streamflow modeling in central Europe},
journal = {Hydrology and earth system sciences},
year = {2025},
doi = {10.5194/hess-29-5233-2025},
url = {https://doi.org/10.5194/hess-29-5233-2025}
}
Original Source: https://doi.org/10.5194/hess-29-5233-2025