ProteoMutaMetrics

Author(s)
Jiahui Huang, Tanja Osthushenrich, Aidan MacNamara, Anders Mälarstig, Silvia Brocchetti, Samuel Bradberry, Lia Scarabottolo, Evandro Ferrada, Sergey Sosnin, Daniela Digles, Giulio Superti-Furga, Gerhard F. Ecker
Abstract

The solute carrier transporter family 6 (SLC6) is of key interest for their critical role in the transport of small amino acids or amino acid-like molecules. Their dysfunction is strongly associated with human diseases such as including schizophrenia, depression, and Parkinson's disease. Linking single point mutations to disease may support insights into the structure-function relationship of these transporters. This work aimed to develop a computational model for predicting the potential pathogenic effect of single point mutations in the SLC6 family. Missense mutation data was retrieved from UniProt, LitVar, and ClinVar, covering multiple protein-coding transcripts. As encoding approach, amino acid descriptors were used to calculate the average sequence properties for both original and mutated sequences. In addition to the full-sequence calculation, the sequences were cut into twelve domains. The domains are defined according to the transmembrane domains of the SLC6 transporters to analyse the regions' contributions to the pathogenicity prediction. Subsequently, several classification models, namely Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), and Extreme Gradient Boosting (XGBoost) with the hyperparameters optimized through grid search were built. For estimation of model performance, repeated stratified k-fold cross-validation was used. The accuracy values of the generated models are in the range of 0.72 to 0.80. Analysis of feature importance indicates that mutations in distinct regions of SLC6 transporters are associated with an increased risk for pathogenicity. When applying the model on an independent validation set, the performance in accuracy dropped to averagely 0.6 with high precision but low sensitivity scores.

Organisation(s)
Department of Pharmaceutical Sciences
External organisation(s)
Bayer BioScience nv, Emerging Science & Innovation, Wyeth, LLC, Axxam SpA, Österreichische Akademie der Wissenschaften (ÖAW)
Journal
Rsc advances
Volume
14
Pages
13083-13094
No. of pages
12
ISSN
2046-2069
DOI
https://doi.org/10.1039/d4ra00748d
Publication date
04-2024
Peer reviewed
Yes
Austrian Fields of Science 2012
104027 Computational chemistry, 301207 Pharmaceutical chemistry
ASJC Scopus subject areas
Chemistry(all), Chemical Engineering(all)
Sustainable Development Goals
SDG 3 - Good Health and Well-being
Portal url
https://ucrisportal.univie.ac.at/en/publications/proteomutametrics(cb5a2f5a-efb1-4fe7-bf47-1589deb26323).html