Hyperspectral Image Analysis Using Cloud-Based Support Vector Machines

Hyperspectral image processing techniques involve time-consuming calculations due to the large volume and complexity of the data. Indeed, hyperspectral scenes contain a wealth of spatial and spectral information thanks to the hundreds of narrow and continuous bands collected across the electromagnet...

Descripción completa

Detalles Bibliográficos
Autores: Haut, Juan M., Franco Valiente, José M., Paoletti, Mercedes Eugenia, Moreno Álvarez, Sergio, Pardo-Diaz, Alfonso
Tipo de recurso: artículo
Fecha de publicación:2024
País:España
Institución:Universidad Nacional de Educación a Distancia
Repositorio:e-spacio. Repositorio Institucional de la UNED
Idioma:inglés
OAI Identifier:oai:e-spacio.uned.es:20.500.14468/24446
Acceso en línea:https://hdl.handle.net/20.500.14468/24446
Access Level:acceso abierto
Palabra clave:12 Matemáticas::1203 Ciencia de los ordenadores ::1203.17 Informática
Cloud computing (CC)
Support vector machines (SVMs)
Hyperspectral imaging (HSIs)
Machine learning (ML)
Remote sensing (RS)
Descripción
Sumario:Hyperspectral image processing techniques involve time-consuming calculations due to the large volume and complexity of the data. Indeed, hyperspectral scenes contain a wealth of spatial and spectral information thanks to the hundreds of narrow and continuous bands collected across the electromagnetic spectrum. Predictive models, particularly supervised machine learning classifiers, take advantage of this information to predict the pixel categories of images through a training set of real observations. Most notably, the Support Vector Machine (SVM) has demonstrate impressive accuracy results for image classification. Notwithstanding the performance offered by SVMs, dealing with such a large volume of data is computationally challenging. In this paper, a scalable and high-performance cloud-based approach for distributed training of SVM is proposed. The proposal address the overwhelming amount of remote sensing (RS) data information through a parallel training allocation. The implementation is performed over a memory-efficient Apache Spark distributed environment. Experiments are performed on a benchmark of real hyperspectral scenes to show the robustness of the proposal. Obtained results demonstrate efficient classification whilst optimising data processing in terms of training times.