Density estimation of the main structuring sessile species in underwater marine caves with a deep learning approach

Monitoring marine biodiversity is a challenge in some vulnerable and difficult-to-access habitats, such as underwater caves. Underwater caves are a great focus of biodiversity, concentrating a large number of species in their environment. However, most of the sessile species that live on the rocky w...

ver descrição completa

Detalhes bibliográficos
Autores: Sierra Menéndez, Sergio, Prado Ortega, Elena, Rodríguez Cobo, Luis|||0000-0002-2068-2956, Quiles Pons, Carla, Roldán Varona, Pablo, Díaz Viñolas, David, Anuarbe Cortés, Pedro|||0000-0002-7459-0571, Cobo García, Adolfo|||0000-0003-1498-9238, Sánchez Delgado, Francisco
Tipo de documento: artigo
Data de publicação:2023
País:España
Recursos:Universidad de Cantabria (UC)
Repositório:UCrea Repositorio Abierto de la Universidad de Cantabria
Idioma:inglês
OAI Identifier:oai:repositorio.unican.es:10902/31767
Acesso em linha:https://hdl.handle.net/10902/31767
Access Level:Acceso aberto
Palavra-chave:Marine Biodiversity
Underwater Caves
Underwater Images
Deep Learning
Semantic Segmentation
Descrição
Resumo:Monitoring marine biodiversity is a challenge in some vulnerable and difficult-to-access habitats, such as underwater caves. Underwater caves are a great focus of biodiversity, concentrating a large number of species in their environment. However, most of the sessile species that live on the rocky walls are very vulnerable, and they are often threatened by different pressures. The use of these spaces as a destination for recreational divers can cause different impacts on the benthic habitat. In this work, we propose a methodology based on video recordings of cave walls and image analysis with deep learning algorithms to estimate the spatial density of structuring species in a study area. We propose a combination of automatic frame overlap detection, estimation of the actual extent of surface cover, and semantic segmentation of the main 10 species of corals and sponges to obtain species density maps. These maps can be the data source for monitoring biodiversity over time. In this paper, we analyzed the performance of three different semantic segmentation algorithms and backbones for this task and found that the Mask R-CNN model with the Xception101 backbone achieves the best accuracy, with an average segmentation accuracy of 82%.