2025 et al. (2025) CLLMate: A Multimodal Benchmark for Weather and Climate Events Forecasting
Identification
- Journal: Open MIND
- Year: 2025
- Date: 2025-10-10
- Authors: Association for Computational Linguistics 2025, Lau, Alexis Kai Hon, Li, Haobo, Qu, Huamin, Wang, Zhaowei, Wang, Jiachen, WANG, Yueya
- DOI: 10.48448/f85m-y402
Research Groups
Not explicitly available in the provided text, but mentions "Jiaxin Bai and 6 other authors".
Short Summary
This paper introduces Weather and Climate Event Forecasting (WCEF), a new task that leverages numerical meteorological raster data and textual event data to predict weather and climate events. To facilitate this, the authors present CLLMate, the first multimodal dataset for WCEF, and benchmark 32 existing models, revealing their advantages and limitations for this task.
Objective
- To propose Weather and Climate Event Forecasting (WCEF) as a novel task aimed at predicting weather and climate events by translating numerical meteorological variables into actionable textual narratives.
- To address the lack of supervised datasets and multimodal data alignment challenges by presenting CLLMate, the first multimodal dataset for WCEF.
- To systematically benchmark existing models on CLLMate to evaluate their performance and highlight the dataset's value for training and benchmarking in WCEF.
Study Configuration
- Spatial Scale: Not explicitly defined, but implied global coverage through the use of ERA5 reanalysis data.
- Temporal Scale: Not explicitly defined for the dataset's temporal span, but comprises 26,156 environmental news articles.
Methodology and Data
- Models used: 32 existing models, including closed-source, open-source, and fine-tuned models (specific model names not provided in the abstract).
- Data sources:
- 26,156 environmental news articles (textual event data).
- ERA5 reanalysis data (numerical meteorological raster data).
Main Results
- Introduction of CLLMate, the first multimodal dataset specifically designed for Weather and Climate Event Forecasting (WCEF), which aligns 26,156 environmental news articles with ERA5 reanalysis data.
- Systematic benchmarking of 32 diverse models on the CLLMate dataset was performed.
- Experiments revealed the advantages and limitations of existing Multimodal Large Language Models (MLLMs) when applied to the WCEF task.
- The study demonstrated the significant value of the CLLMate dataset for both training and benchmarking models in the context of weather and climate event forecasting.
Contributions
- Proposal of Weather and Climate Event Forecasting (WCEF) as a new and challenging task that bridges the gap between numerical meteorological predictions and actionable textual narratives of events.
- Creation of CLLMate, the first multimodal dataset for WCEF, providing a crucial resource for research in this emerging field by aligning environmental news articles with ERA5 reanalysis data.
- A comprehensive benchmark of 32 diverse models on the new CLLMate dataset, offering insights into the current capabilities and limitations of existing Multimodal Large Language Models (MLLMs) for multimodal environmental forecasting.
Funding
Not available in the provided text.
Citation
@article{20252025CLLMate,
author = {2025, Association for Computational Linguistics and Lau, Alexis Kai Hon and Li, Haobo and Qu, Huamin and Wang, Zhaowei and Wang, Jiachen and WANG, Yueya},
title = {CLLMate: A Multimodal Benchmark for Weather and Climate Events Forecasting},
journal = {Open MIND},
year = {2025},
doi = {10.48448/f85m-y402},
url = {https://doi.org/10.48448/f85m-y402}
}
Original Source: https://doi.org/10.48448/f85m-y402