Data-driven evaluation of machine learning models for climate control in operational smart greenhouses

Nowadays, human overpopulation is stressing our ecosystems in different ways, agriculture being a critical example as different predictions point towards food shortages in the near future. Accordingly, smart farming is becoming key to the optimization of natural resources so that different crops can...

Descripción completa

Detalles Bibliográficos
Autores: Morales García, Juan, Bueno Crespo, Andrés, Martínez España, Raquel, Cecilia Canales, José María
Tipo de recurso: artículo
Fecha de publicación:2023
País:España
Institución:Universidad Católica San Antonio de Murcia (UCAM)
Repositorio:RIUCAM. Repositorio Institucional de la Universidad Católica San Antonio de Murcia
OAI Identifier:oai:repositorio.ucam.edu:10952/7390
Acceso en línea:http://hdl.handle.net/10952/7390
Access Level:acceso abierto
Palabra clave:Precision Agriculture
Artificial Intelligence
Machine Learning
Temperature Forecasting
Smart Greenhouses
Descripción
Sumario:Nowadays, human overpopulation is stressing our ecosystems in different ways, agriculture being a critical example as different predictions point towards food shortages in the near future. Accordingly, smart farming is becoming key to the optimization of natural resources so that different crops can be grown efficiently, consuming as few resources as possible. In particular, greenhouses have proved to be an effective way of producing a high volume of vegetables/fruits in a reduced space and within a short time span. Hence, optimizing greenhouse functioning results in less water use and nutrient consumption, less energy use, faster growth, and better product quality. In this article, we carry out an in-depth analysis of different machine learning (ML) models to improve climate control in smart greenhouses. As part of the analysis of the techniques we also considered 3 ways of pre-processing the data, as well as 12-hour and 24-hour forecasting. We focus on forecasting the indoor air temperature of an operational smart greenhouse, i.e. assessing the data anomalies that are inherently present in these environments due to the instability of IoT infrastructures. Several ML models are adapted to time series forecasting to provide an overview of these techniques and to find out which one performs better in this particular scenario. Our results show that, after statistically validating the results, the Random Forest Regression technique gives the best overall result with a mean absolute error of less than 1 degree Celsius.