Early detection of gearbox faults in wind turbines using a fine-tuned transformer encoder

Gearbox failures remain one of the most critical and costly issues in wind turbine operation, often leading to extended downtime and increased maintenance expenses. This paper presents a novel methodology for the early detection of gearbox faults using Supervisory Control and Data Acquisition (SCADA...

Descripción completa

Detalles Bibliográficos
Autores: España, Sofía, Sánchez, Juan, Puruncajas Maza, Bryan, Castellani, Francesco, Tutivén Gálvez, Christian|||0000-0001-6322-4608, Vidal Seguí, Yolanda|||0000-0003-4964-6948
Tipo de recurso: artículo
Fecha de publicación:2026
País:España
Institución:Universitat Politècnica de Catalunya (UPC)
Repositorio:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglés
OAI Identifier:oai:dnet:upcommonspor::2d1b0c6d7f3e42cb0747d8c9e59b62ad
Acceso en línea:https://hdl.handle.net/2117/459964
https://dx.doi.org/10.1016/j.rineng.2026.109915
Access Level:acceso abierto
Palabra clave:Wind turbine
SCADA
Transformer
Fault detection
Descripción
Sumario:Gearbox failures remain one of the most critical and costly issues in wind turbine operation, often leading to extended downtime and increased maintenance expenses. This paper presents a novel methodology for the early detection of gearbox faults using Supervisory Control and Data Acquisition (SCADA) data and a fine-tuned Trans- former encoder model. Unlike traditional condition monitoring systems, which rely on high-frequency sensors or require extensive labeling, the proposed approach uses the existing low-frequency SCADA infrastructure and employs a semi-supervised learning strategy. A normal behavior model is first pre-trained on healthy data from a single wind turbine and subsequently fine-tuned to adapt to the operational characteristics of additional turbines in the same wind farm. Key contributions include an optimized variable selection process based on Kendall’s Tau correlation, data imputation and smoothing techniques to address missing values and noise, and a fault prognosis indicator derived from model residuals. Results obtained with real SCADA datasets demonstrate the effectiveness of the method in detecting incipient gearbox faults under realistic operating conditions.