A machine learning-based surrogate model for the identification of risk zones due to off-stream reservoir failure

Approximately 70,000 Spanish off-stream reservoirs, many of them irrigation ponds, need to be evaluated in terms of their potential hazard to comply with the new national Regulation of the Hydraulic Public Domain. This requires a great engineering effort to evaluate different scenarios with two-dime...

Descripción completa

Detalles Bibliográficos
Autores: Silva Cancino, Nathalia, Salazar González, Fernando, Sanz Ramos, Marcos|||0000-0003-2534-0039, Bladé i Castellet, Ernest|||0000-0003-1770-3960
Tipo de recurso: artículo
Fecha de publicación:2022
País:España
Institución:Universitat Politècnica de Catalunya (UPC)
Repositorio:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglés
OAI Identifier:oai:upcommons.upc.edu:2117/371776
Acceso en línea:https://hdl.handle.net/2117/371776
https://dx.doi.org/10.3390/w14152416
Access Level:acceso abierto
Palabra clave:Dams -- Risk assessment
Machine learning
Iber
Off-stream reservoirs
Dam breach
Floods
Random forest
Surrogate model
Preses -- Avaluació del risc
Àrees temàtiques de la UPC::Enginyeria civil::Enginyeria hidràulica, marítima i sanitària::Embassaments i preses
Descripción
Sumario:Approximately 70,000 Spanish off-stream reservoirs, many of them irrigation ponds, need to be evaluated in terms of their potential hazard to comply with the new national Regulation of the Hydraulic Public Domain. This requires a great engineering effort to evaluate different scenarios with two-dimensional hydraulic models, for which many owners lack the necessary resources. This work presents a simplified methodology based on machine learning to identify risk zones at any point in the vicinity of an off-stream reservoir without the need to elaborate and run full two-dimensional hydraulic models. A predictive model based on random forest was created from datasets including the results of synthetic cases computed with an automatic tool based on the two-dimensional numerical software Iber. Once fitted, the model provided an estimate on the potential hazard considering the physical characteristics of the structure, the surrounding terrain and the vulnerable locations. Two approaches were compared for balancing the dataset: the synthetic minority oversampling and the random undersampling. Results from the random forest model adjusted with the random undersampling technique showed to be useful for the estimation of risk zones. On a real application test the simplified method achieved 91% accuracy.