Jaiswal et al. (2025) More accurate forecasting of drought indices using a decomposition-based hybrid machine learning model
Identification
- Journal: Theoretical and Applied Climatology
- Year: 2025
- Date: 2025-11-01
- Authors: Ronit Jaiswal, Kapil Choudhary, Rajeev Ranjan Kumar, Girish Kumar Jha, Vijay Kamal Meena, N S Sudhakara, Mahesh Kumar Poonia
- DOI: 10.1007/s00704-025-05848-7
Research Groups
- ICAR- Central Institute of Temperate Horticulture, Srinagar, India
- Agriculture University, Jodhpur, Rajasthan, India
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
Short Summary
This study develops a decomposition-based hybrid machine learning framework for more accurate forecasting of precipitation-based drought indices (Effective Drought Index (EDI), Standardized Precipitation Index (SPI) at 3- and 6-month scales) in two drought-prone districts of Maharashtra, India. The Ensemble Empirical Mode Decomposition-Time Delay Neural Network (EEMD-TDNN) hybrid model emerged as the most effective, achieving a 15–30% reduction in Root Mean Squared Error (RMSE) and Mean Absolute Percentage Error (MAPE) compared to conventional and other hybrid models.
Objective
- To develop and systematically evaluate a comprehensive modelling framework for forecasting precipitation-based drought indices (EDI, SPI-3, and SPI-6) in Ahmednagar and Jalgaon districts, Maharashtra, India.
- To compare the performance of various forecasting models, including Autoregressive Integrated Moving Average (ARIMA), Time Delay Neural Network (TDNN), and Extreme Learning Machine (ELM), with their hybrid variants incorporating Seasonal-Trend Decomposition using Loess (STL) and Ensemble Empirical Mode Decomposition (EEMD).
Study Configuration
- Spatial Scale: Two drought-prone districts in Maharashtra, India: Ahmednagar and Jalgaon.
- Temporal Scale: Monthly rainfall data spanning 42 years (January 1981 to December 2022). Drought indices (EDI, SPI-3, SPI-6) were computed at daily and monthly scales.
Methodology and Data
- Models used:
- Individual models: Autoregressive Integrated Moving Average (ARIMA), Time Delay Neural Network (TDNN), Extreme Learning Machine (ELM).
- Decomposition techniques: Seasonal-Trend Decomposition using Loess (STL), Ensemble Empirical Mode Decomposition (EEMD).
- Hybrid models: STL-ARIMA, STL-ELM, STL-TDNN, EEMD-ARIMA, EEMD-ELM, EEMD-TDNN.
- Evaluation metrics: Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), Mean Absolute Error (MAE), Diebold-Mariano (DM) test for statistical significance.
- Data sources:
- Historical monthly rainfall data (January 1981 to December 2022) for Ahmednagar and Jalgaon districts, sourced from the NASA POWER database (https://power.larc.nasa.gov).
- Derived drought indices: Effective Drought Index (EDI), Standardized Precipitation Index at 3-month scale (SPI-3), and Standardized Precipitation Index at 6-month scale (SPI-6).
Main Results
- The EEMD-TDNN hybrid model consistently demonstrated superior predictive capability across all three drought indices (EDI, SPI-3, SPI-6) and both study districts (Ahmednagar and Jalgaon).
- EEMD-TDNN achieved a quantifiable improvement of 15–30% reduction in RMSE and MAPE across all three drought indices when compared to conventional (ARIMA, ELM, TDNN) and other hybrid models (STL-based, EEMD-ELM).
- Specifically, for Ahmednagar, EEMD-TDNN reduced RMSE by up to 51.92% (D1), 41.89% (D2), and 6.35% (D3) compared to standalone TDNN. For Jalgaon, RMSE reductions were 70.21% (D4), 31.11% (D5), and 33.33% (D6).
- EEMD-based hybrid models significantly outperformed STL-based hybrid models, with EEMD-TDNN showing RMSE reductions of up to 52.83% (D1), 39.44% (D2), and 13.24% (D3) over stlTDNN for Ahmednagar, and 61.64% (D4), 30.34% (D5), and 4.76% (D6) for Jalgaon.
- The Diebold-Mariano test confirmed the statistical significance of EEMD-TDNN's improvement over other models in most cases, particularly for D2, D5, and D6.
- The superior performance is attributed to the synergy between EEMD’s ability to decompose complex, non-stationary time series into intrinsic mode functions and TDNN’s strength in capturing temporal dependencies.
Contributions
- Proposes and systematically evaluates two distinct classes of hybrid forecasting frameworks (STL-based and EEMD-based) integrated with ARIMA, ELM, and TDNN, enabling a comparative analysis of component-based versus frequency-based decomposition techniques.
- Forecasts three widely accepted precipitation-based drought indices (EDI, SPI-3, and SPI-6) to capture short-, medium-, and dynamically evolving drought conditions, providing a holistic view of drought severity.
- Applies hybrid models to district-level datasets from two drought-prone regions in Maharashtra, India (Ahmednagar and Jalgaon), enhancing the operational relevance for climate-vulnerable agroecological zones.
- Identifies the EEMD-TDNN model as consistently demonstrating superior predictive capability, particularly in capturing nonlinear and non-stationary behavior of drought index time series.
- Demonstrates improved model generalizability and forecasting accuracy by effectively addressing high variability and nonlinear patterns in precipitation data through decomposition techniques.
- Enhances decision-support capabilities for agricultural planning, water resource allocation, and drought early warning systems (DEWS) by accurately forecasting short-term, medium-term, and dynamic drought conditions.
Funding
The author(s) received no financial support for this article's research, authorship and/or publication.
Citation
@article{Jaiswal2025More,
author = {Jaiswal, Ronit and Choudhary, Kapil and Kumar, Rajeev Ranjan and Jha, Girish Kumar and Meena, Vijay Kamal and Sudhakara, N S and Poonia, Mahesh Kumar},
title = {More accurate forecasting of drought indices using a decomposition-based hybrid machine learning model},
journal = {Theoretical and Applied Climatology},
year = {2025},
doi = {10.1007/s00704-025-05848-7},
url = {https://doi.org/10.1007/s00704-025-05848-7}
}
Original Source: https://doi.org/10.1007/s00704-025-05848-7