Exploring the effect of temporal aggregation on SCADA data for wind turbine prognosis using a normality model

Today, the use of SCADA data for predictive maintenance and forecasting of wind turbines in wind farms is gaining popularity due to the low cost of this solution compared to others that require the installation of additional equipment. SCADA data provides four statistical measures (mean, standard de...

Descripción completa

Detalles Bibliográficos
Autores: Martí Puig, Pere, Bennásar Sevilla, Alejandro, Blanco, Alejandro, Solé-Casals, Jordi
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2021
País:España
Institución:Universitat Oberta de Catalunya (UOC)
Repositorio:O2, repositorio institucional de la UOC
OAI Identifier:oai:openaccess.uoc.edu:10609/137149
Acceso en línea:https://hdl.handle.net/10609/137149
https://doi.org/10.3390/app11146405
Access Level:acceso abierto
Palabra clave:wind turbine
fault diagnosis
renewable energy
extreme learning machines (ELM)
feature engineering
normal behaviour models
Descripción
Sumario:Today, the use of SCADA data for predictive maintenance and forecasting of wind turbines in wind farms is gaining popularity due to the low cost of this solution compared to others that require the installation of additional equipment. SCADA data provides four statistical measures (mean, standard deviation, maximum value, and minimum value) of hundreds of wind turbine magnitudes, usually in a 5-min or 10-min interval. Several studies have analysed the loss of information associated with the reduction of information when using five minutes instead of four seconds as a sampling frequency, or when compressing a time series recorded at 5 min to 10 min, concluding that some, but not all, of these magnitudes are seriously affected. However, to our knowledge, there are no studies on increasing the time interval beyond 10 min to take these four statistical values, and how this aggregation affects prognosis models. Our work shows that, despite the irreversible loss of information that occurs in the first 5 min, increasing the time considered to take the four representative statistical values improves the performance of the predicted targets in normality models.