Villegas-Vega et al. (2025) Optimization of LSTM networks through neuroevolution for drought forecasting in Mexico
Identification
- Journal: Theoretical and Applied Climatology
- Year: 2025
- Date: 2025-10-09
- Authors: Ramiro Villegas-Vega, Aldo Márquez-Grajales, Efrén Mezura‐Montes, Fernando Salas-Martínez, Manuel Alejandro Ojeda-Misses, Claudia Romo-Gómez
- DOI: 10.1007/s00704-025-05818-z
Research Groups
- Artificial Intelligence Research Institute, University of Veracruz
- Área Académica de Computación y Electrónica, Instituto de Ciencias Básicas e Ingeniería, Universidad Autónoma del Estado de Hidalgo
- Área Académica de Química, Instituto de Ciencias Básicas e Ingeniería, Universidad Autónoma del Estado de Hidalgo
Short Summary
This study proposes DeepGA-LSTM, a neuroevolution-based method using genetic algorithms to optimize Long Short-Term Memory (LSTM) networks for drought forecasting in Mexico. The DeepGA-LSTM consistently outperformed baseline LSTM and CNN-LSTM models in two Mexican regions (Chihuahua and Zacatecas) using SPEI and SPI indices, demonstrating its effectiveness in finding optimal network architectures.
Objective
- To evaluate the performance of neuroevolution in optimizing the structure and hyperparameters of LSTM networks for drought forecasting, specifically analyzing if the proposed DeepGA-LSTM achieves lower forecasting errors using the Standardized Precipitation-Evapotranspiration Index (SPEI) and the Standardized Precipitation Index (SPI) in two regions of Mexico (Chihuahua and Zacatecas).
Study Configuration
- Spatial Scale: Two Mexican states: Chihuahua and Zacatecas. Zacatecas is further divided into four subregions: Semi-arid, Canyons, High Plain, and Mountains.
- Temporal Scale:
- Chihuahua: Monthly SPEI-12 and SPEI-24 data from 1901 to 2020, with a one-step forward forecasting scheme.
- Zacatecas: Monthly SPI data from 1964 to 2020, with a 24-month forecasting horizon.
Methodology and Data
- Models used:
- DeepGA-LSTM: Long Short-Term Memory (LSTM) networks optimized by a genetic algorithm (neuroevolution).
- Baseline LSTM: Predefined LSTM architectures from previous studies or empirical tuning.
- CNN-LSTM: Hybrid model combining Convolutional Neural Network and LSTM layers.
- Data sources:
- Chihuahua: SPEI monthly data (12 and 24 months) from 1901 to 2020, sourced from the Global SPEI Database of the Consejo Superior de Investigaciones Científicas de España.
- Zacatecas: 31 precipitation time series from 1964 to 2020, obtained from the Servicio Meteorológico Nacional (SMN) meteorological stations network, used to compute the SPI.
Main Results
- DeepGA-LSTM consistently outperformed baseline LSTM and CNN-LSTM models in both Chihuahua and Zacatecas.
- Chihuahua (SPEI, one-step forward forecasting):
- For SPEI-12, DeepGA-LSTM achieved RMSE of 0.0644, MAE of 0.0470, R² of 0.8833, and a correlation coefficient (r) of 0.9535.
- For SPEI-24, DeepGA-LSTM achieved RMSE of 0.0355, MAE of 0.0260, R² of 0.9414, and r of 0.9710.
- Forecasting performance was notably higher for SPEI-24 due to lower fluctuations and greater smoothness of long-term drought time series.
- Zacatecas (SPI, 24-month forecasting horizon):
- DeepGA-LSTM's performance across the four subregions ranged from RMSE 0.2951 to 0.4304, MAE 0.2260 to 0.3321, R² 0.1187 to 0.6294, and r 0.3731 to 0.8053.
- The baseline LSTM model yielded negative R² values in all regions, indicating poor predictive capability compared to a simple mean-based predictor.
- DeepGA-LSTM showed strong performance in the Semi-arid (R² = 0.6294, r = 0.8053) and High Plain (R² = 0.4672, r = 0.7172) regions.
Contributions
- First study to utilize neuroevolution for optimizing LSTM networks for drought forecasting in Mexico, addressing a gap in the literature for this region.
- Introduction of a novel hybrid encoding for genetic algorithms, combining a block sequence (for network layers, neuron count, and dropout rate) with a real-valued vector (for learning rate, context size, and batch size). This allows for more flexible and comprehensive optimization of LSTM architectures and hyperparameters compared to previous rigid encoding methods.
- Demonstrated the efficacy of neuroevolution in automatically discovering suitable network structures and hyperparameters, reducing reliance on subjective expert knowledge or limited grid search approaches.
Funding
- Secretaría de Ciencia, Humanidades, Tecnología e Innovación (SECIHTI) for a scholarship grant (CVU number 1311162) to the first author.
Citation
@article{VillegasVega2025Optimization,
author = {Villegas-Vega, Ramiro and Márquez-Grajales, Aldo and Mezura‐Montes, Efrén and Salas-Martínez, Fernando and Ojeda-Misses, Manuel Alejandro and Romo-Gómez, Claudia},
title = {Optimization of LSTM networks through neuroevolution for drought forecasting in Mexico},
journal = {Theoretical and Applied Climatology},
year = {2025},
doi = {10.1007/s00704-025-05818-z},
url = {https://doi.org/10.1007/s00704-025-05818-z}
}
Original Source: https://doi.org/10.1007/s00704-025-05818-z