Selection of Radiomics Features based on their Reproducibility

Dimensionality reduction is key to alleviate machine learning artifacts in clinical applications with Small Sample Size (SSS) unbalanced datasets. Existing methods rely on either the probabilistic distribution of training data or the discriminant power of the reduced space, disregarding the impact o...

ver descrição completa

Detalhes bibliográficos
Autores: Ligero, Marta, Torres, Guillermo|||0000-0002-1576-6178, Sánchez Ramos, Carles|||0000-0003-3435-9882, Diaz-Chito, Katerine|||0000-0002-8860-8082, Pérez, Raquel, Gil, Debora|||0000-0002-2770-4767
Tipo de documento: capítulo de livro
Data de publicação:2019
País:España
Recursos:Universitat Autònoma de Barcelona
Repositório:Dipòsit Digital de Documents de la UAB
Idioma:inglês
OAI Identifier:oai:ddd.uab.cat:257861
Acesso em linha:https://ddd.uab.cat/record/257861
https://dx.doi.org/urn:doi:10.1109/EMBC.2019.8857879
Access Level:Acceso aberto
Palavra-chave:Feature Selection
Reproducibility
Radiomics
Descrição
Resumo:Dimensionality reduction is key to alleviate machine learning artifacts in clinical applications with Small Sample Size (SSS) unbalanced datasets. Existing methods rely on either the probabilistic distribution of training data or the discriminant power of the reduced space, disregarding the impact of repeatability and uncertainty in features.In the present study is proposed the use of reproducibility of radiomics features to select features with high inter-class correlation coefficient (ICC). The reproducibility includes the variability introduced in the image acquisition, like medical scans acquisition parameters and convolution kernels, that affects intensity-based features and tumor annotations made by physicians, that influences morphological descriptors of the lesion.For the reproducibility of radiomics features three studies were conducted on cases collected at Vall Hebron Oncology Institute (VHIO) on responders to oncology treatment. The studies focused on the variability due to the convolution kernel, image acquisition parameters, and the inter-observer lesion identification. The features selected were those features with a ICC higher than 0.7 in the three studies.The selected features based on reproducibility were evaluated for lesion malignancy classification using a different database. Results show better performance compared to several state-of-the-art methods including Principal Component Analysis (PCA), Kernel Discriminant Analysis via QR decomposition (KDAQR), LASSO, and an own built Convolutional Neural Network.