Hydrology and Climate Change Article Summaries

Donohue et al. (2025) Structured dataset of reported cloud seeding activities in the United States (2000–2025) using an LLM

Identification

Research Groups

Short Summary

This study presents a structured dataset of reported cloud seeding activities in the United States from 2000 to 2025, extracted from 832 historical NOAA reports using a multi-stage PDF-to-text pipeline combined with an LLM, achieving an estimated 98.38% accuracy. The dataset addresses a critical data gap and demonstrates a scalable framework for unlocking historical environmental data using large language models.

Objective

Study Configuration

Methodology and Data

Main Results

Contributions

Funding

Citation

@article{Donohue2025Structured,
  author = {Donohue, Jared Joseph and Lamb, Kara D.},
  title = {Structured dataset of reported cloud seeding activities in the United States (2000–2025) using an LLM},
  journal = {Scientific Data},
  year = {2025},
  doi = {10.1038/s41597-025-06273-1},
  url = {https://doi.org/10.1038/s41597-025-06273-1}
}

Original Source: https://doi.org/10.1038/s41597-025-06273-1