Mohanty et al. (2025) Machine Learning Insights for Fire Impacts on Snow Disappearance Predictability in Northern California

Identification

Journal: Mendeley Data
Year: 2025
Date: 2025-11-14
Authors: Mohanty, Yashnil, Abolafia-Rosenzweig, Ronnie, He, Cenlin
DOI: 10.17632/4xrk8g6458

Research Groups

Not explicitly stated in the provided text.

Short Summary

This study investigates the impact of wildfires on snow disappearance predictability in Northern California using machine learning models. It finds that wildfires significantly alter snow dynamics, and including post-fire landscapes in model training data is crucial for improving snow disappearance predictions in burned areas.

Objective

To examine the influence of fires on snow disappearance predictability using a suite of machine learning experiments.

Study Configuration

Spatial Scale: Regional (Northern California), down to pixel level (e.g., pixels with >75% burn fraction).
Temporal Scale: Seasonal (day of snow disappearance) and multi-year (lasting perturbations from wildfires).

Methodology and Data

Models used: Random Forest, Multiple Linear Regression, Long Short-Term Memory (LSTM), Multilayer Perceptron (MLP), Extreme Gradient Boosting (XGBoost).
Data sources: Not explicitly detailed in the provided text, but includes hydrometeorological predictors, burn fraction, and vegetation greenness.

Main Results

Machine learning models trained solely on no/low-burn areas perform significantly worse when applied to moderate-to-high burn areas.
For random forest models, the bias standard deviation for snow disappearance day (DSD) is 12.0 days and R² is 0.86 in no/low-burn areas, compared to 14.0–15.3 days and R² of 0.72–0.79 in heavily burned categories.
A systematic overestimation of DSD in very high burn areas suggests that post-fire-enhanced snow ablation is not captured by models trained on unburned data.
Including samples from burned areas in model training improves DSD predictions in burned landscapes without degrading performance in unburned regions.
The largest accuracy improvements occur in pixels with >75% burn fraction, where DSD overestimation is reduced by over 50% (mean bias from 4.0 days to 1.6 days).
Model performance in moderate-to-high burn areas remains lower than in no/low-burn regions (R² lower by 0.05–0.06; bias standard deviation higher by 1.8–2.3 days).
Adding explicit fire-related predictors (burn fraction and vegetation greenness) does not yield further accuracy gains.

Contributions

Demonstrates that wildfires significantly impact snowpack dynamics and that machine learning models need to account for these changes.
Establishes that including post-fire landscapes in training data is essential for improving machine learning-based snow predictions in burned areas.
Shows that implicitly incorporating fire effects through training data is more effective than explicitly adding fire-related predictors for the models tested.

Funding

Not explicitly stated in the provided text.

Citation

@article{Mohanty2025Machine,
  author = {Mohanty, Yashnil and Abolafia-Rosenzweig, Ronnie and He, Cenlin},
  title = {Machine Learning Insights for Fire Impacts on Snow Disappearance Predictability in Northern California},
  journal = {Mendeley Data},
  year = {2025},
  doi = {10.17632/4xrk8g6458},
  url = {https://doi.org/10.17632/4xrk8g6458}
}

Original Source: https://doi.org/10.17632/4xrk8g6458