Classification between odorant and gustatory receptors: Supervised learning applied to arthropod proteins

Functional characterisation of proteins is limited and difficult to achieve using automatic systems based on sequence alignments which have not found evidence of the presence of odorant receptors in other arthropods than insects. In this project, a supervised learning based system is developed in or...

Descripción completa

Detalles Bibliográficos
Autor: Enríquez Romero, Félix Francisco
Tipo de recurso: tesis de maestría
Fecha de publicación:2022
País:España
Institución:Universitat Oberta de Catalunya (UOC)
Repositorio:O2, repositorio institucional de la UOC
OAI Identifier:oai:openaccess.uoc.edu:10609/138687
Acceso en línea:http://hdl.handle.net/10609/138687
Access Level:acceso abierto
Palabra clave:odorant receptor
gustatory receptor
supervised learning
receptor de olores
receptor gustativo
aprendizaje supervisado
receptor d'olors
receptor gustatiu
aprenentatge supervisat
Bioinformatics -- TFM
Bioinformàtica -- TFM
Bioinformática -- TFM
Descripción
Sumario:Functional characterisation of proteins is limited and difficult to achieve using automatic systems based on sequence alignments which have not found evidence of the presence of odorant receptors in other arthropods than insects. In this project, a supervised learning based system is developed in order to first classify insect protein sequences functionality as odorant or gustatory receptors and second, identify in non-insect arthropod sequences (annotated as gustatory receptors by alignment based systems) potential sequences that suffered a similar functional divergence as it happened in insects around 440 million years ago. Dataset sequences were obtained from UniProtKB, three different encoding methods were used to train three artificial neural networks. The results obtained confirm the efficiency of the developed model with AUC, precision and F1 score of 0.911, 0.971 and 0.926, respectively. A list of candidate sequences to have a functionality similar to that of odorant receptors in non-insect arthropods was obtained. Downstream analysis regarding functionality of these sequences should be done to corroborate the model predictions. Findings regarding the presence of proteins with similar functionality to odorant receptors in other arthropods would indicate an extraordinary evolutionary convergence.