Choi et al. (2025) Probabilistic Deep Learning Framework for Greenhouse Microclimate Prediction with Time-Varying Uncertainty and Covariance Analysis

Identification

Journal: Agriculture
Year: 2025
Date: 2025-11-27
Authors: Woo-Joo Choi, Myongkyoon Yang
DOI: 10.3390/agriculture15232461

Research Groups

Department of Agricultural Machinery Engineering, Jeonbuk National University, Jeonju, Republic of Korea
Department of Bioindustrial Machinery Engineering, Jeonbuk National University, Jeonju, Republic of Korea
Institute of Agricultural Machinery & ICT Convergence, Jeonbuk National University, Jeonju, Republic of Korea

Short Summary

This study developed a probabilistic deep learning framework to predict greenhouse microclimate variables with time-varying uncertainty and covariance analysis. The framework, based on a 1D CNN, demonstrated comparable predictive accuracy to deterministic models while providing explainable uncertainty interpretation and robust decision support for greenhouse operators.

Objective

To develop a probabilistic deep learning framework that quantitatively predicts greenhouse microclimate (internal temperature, relative humidity, and CO2 concentration) three hours later, while simultaneously estimating and interpreting time-varying uncertainty and correlations.

Study Configuration

Spatial Scale: A smart strawberry greenhouse located in Nonsan, Chungcheongnam-do, Republic of Korea.
Temporal Scale:
- Data collection period: 21 September 2023 to 26 April 2024.
- Data sampling interval: Hourly for microclimate and external environment, 1 minute for control data (resampled to hourly).
- Prediction horizon: 3 hours into the future.
- Input sequence length: Optimized to 24 hours.

Methodology and Data

Models used:
- Probabilistic Deep Learning Framework: One-dimensional Convolutional Neural Network (1D CNN) and Long Short-Term Memory (LSTM) models.
- Loss function for probabilistic models: Negative Log Likelihood (NLL).
- Deterministic models (for comparison): LSTM and 1D CNN with a Multi-Task Learning (MTL) structure.
- Loss function for deterministic models: Mean Absolute Error (MAE).
Data sources:
- Public datasets provided by the Korea Institute of Agriculture Technology Promotion Agency.
- Greenhouse microclimate data (hourly): Internal temperature (degrees Celsius), internal relative humidity (%), CO2 concentration (ppm).
- External environment data (hourly): External temperature (degrees Celsius), external relative humidity (%), precipitation (mm), wind speed (m/s), wind direction (degrees), solar radiation (W/m^2), snowfall (cm), ground temperature (degrees Celsius).
- Control data (1-minute interval): Window and curtain opening statuses (%).
- Data preprocessing: Handling missing values (replacement with zero for meteorological data, forward fill for control data), outlier detection (seasonal-trend decomposition with 24-hour cycle, residuals exceeding five standard deviations), linear interpolation, resampling to hourly, Min-Max scaling, and sine/cosine transforms for temporal variables.

Main Results

The 1D CNN-based probabilistic model achieved an average R2 of 0.93, demonstrating predictive performance comparable to deterministic models (no statistically significant difference in RMSE).
The model exhibited high sharpness and calibration, with an average Negative Log Likelihood (NLL) of 2.08 and a Coverage@90% of 0.901 across the three microclimate variables.
For internal temperature, relative humidity, and CO2 concentration, the model achieved R2 values of 0.95, 0.94, and 0.92, and RMSE values of 1.4 degrees Celsius, 2.73%, and 43.54 ppm, respectively.
The estimated covariance matrix successfully interpreted time-varying correlations between microclimate variables, revealing local simultaneous variability that global correlation analysis failed to capture.
Prediction uncertainty, quantified by standard deviation, was notably higher in time intervals with lower data sampling frequencies.
Average pairwise correlations were 0.23 for temperature–relative humidity, -0.05 for temperature–CO2 concentration, and -0.00 for relative humidity–CO2 concentration, with extreme values spanning the full theoretical range of -1 to 1.

Contributions

Building a probabilistic deep neural network for the temporal projection of important greenhouse microclimate variables (internal temperature, relative humidity, and CO2 concentration).
Integration of a Negative Log Likelihood (NLL) training objective enabling concurrent generation of projections and temporal covariance, thereby encoding model variability.
Deriving model variability and dependence between nonlinear variables through temporal covariance analysis using Cholesky factorization.
Introducing analytical methods to quantify the variability and reliability of network output and support the assessment of predictive reliability and resilience.

Funding

Korea Institute of Planning and Evaluation for Technology in Food, Agriculture and Forestry (IPET)
Korea Smart Farm R&D Foundation (KosFarm) through Smart Farm Innovation Technology Development Program
Ministry of Agriculture, Food and Rural Affairs (MAFRA)
Ministry of Science and ICT (MSIT)
Rural Development Administration (RDA) (Project Reference: RS-2024-00399854)

Citation

@article{Choi2025Probabilistic,
  author = {Choi, Woo-Joo and Yang, Myongkyoon},
  title = {Probabilistic Deep Learning Framework for Greenhouse Microclimate Prediction with Time-Varying Uncertainty and Covariance Analysis},
  journal = {Agriculture},
  year = {2025},
  doi = {10.3390/agriculture15232461},
  url = {https://doi.org/10.3390/agriculture15232461}
}

Original Source: https://doi.org/10.3390/agriculture15232461