Classification between odorant and gustatory receptors: Supervised learning applied to arthropod proteins
Functional characterisation of proteins is limited and difficult to achieve using automatic systems based on sequence alignments which have not found evidence of the presence of odorant receptors in other arthropods than insects. In this project, a supervised learning based system is developed in or...
| Author: | |
|---|---|
| Format: | master thesis |
| Publication Date: | 2022 |
| Country: | España |
| Institution: | Universitat Oberta de Catalunya (UOC) |
| Repository: | O2, repositorio institucional de la UOC |
| OAI Identifier: | oai:openaccess.uoc.edu:10609/138687 |
| Online Access: | http://hdl.handle.net/10609/138687 |
| Access Level: | Open access |
| Keyword: | odorant receptor gustatory receptor supervised learning receptor de olores receptor gustativo aprendizaje supervisado receptor d'olors receptor gustatiu aprenentatge supervisat Bioinformatics -- TFM Bioinformàtica -- TFM Bioinformática -- TFM |
| Summary: | Functional characterisation of proteins is limited and difficult to achieve using automatic systems based on sequence alignments which have not found evidence of the presence of odorant receptors in other arthropods than insects. In this project, a supervised learning based system is developed in order to first classify insect protein sequences functionality as odorant or gustatory receptors and second, identify in non-insect arthropod sequences (annotated as gustatory receptors by alignment based systems) potential sequences that suffered a similar functional divergence as it happened in insects around 440 million years ago. Dataset sequences were obtained from UniProtKB, three different encoding methods were used to train three artificial neural networks. The results obtained confirm the efficiency of the developed model with AUC, precision and F1 score of 0.911, 0.971 and 0.926, respectively. A list of candidate sequences to have a functionality similar to that of odorant receptors in non-insect arthropods was obtained. Downstream analysis regarding functionality of these sequences should be done to corroborate the model predictions. Findings regarding the presence of proteins with similar functionality to odorant receptors in other arthropods would indicate an extraordinary evolutionary convergence. |
|---|