Incorporating post translational modification information for enhancing the predictive performance of membrane transport proteins

Transporters involved in the cellular entry and exit of ions or molecules throughout the membrane proteins and thereby play an essential role in recognizing the immune system and energy transducers. According to their relevance in proteomics, numerous studies have been conducted to analyze the transporters; especially the discrimination of their classes and subfamilies. We realized that post translational modification information had a critical role in the process of transport proteins. Therefore, in this study, we aim to incorporate post translational information with radial basis function networks to improve the predictive performance of transport proteins in major classes (channels/pores, electrochemical transporters, and active transporters) and six different families (α-type channels, β-barrel porins, pore-forming toxins, porters, PP bond hydrolysis-driven transporters, and oxidoreduction-driven transporters). The experiment results by using PSSM profiles combined with PTM information could classify the transporters into three classes and six families with five-fold cross-validation accuracy of 87.6% and 92.5%, respectively. For the independent dataset of 444 proteins, the performance with post translational modification attained the accuracy of 82.13% and 89.34% for classifying three classes and six families, respectively. Compared with the other methods and previous works, our result shows that the predictive performance is better with the accuracy improvement by 12%. We suggest that our study could become a robust model for biologists to discriminate transport proteins with high performance and understand better the function of transport proteins. Further, the contributions of this study could be fundamental for further research that can use PTM information to enhance numerous computational biology problems.

[1]  G. Heijne Membrane-protein topology , 2006, Nature Reviews Molecular Cell Biology.

[2]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[3]  Shandar Ahmad,et al.  PSSM-based prediction of DNA binding sites in proteins , 2005, BMC Bioinformatics.

[4]  V. Gomord,et al.  Posttranslational modification of therapeutic proteins in plants. , 2004, Current opinion in plant biology.

[5]  Hwee Tong Tan,et al.  Membrane proteins and membrane proteomics , 2008, Proteomics.

[6]  C. A. Andersen,et al.  Prediction of human protein function from post-translational modifications and localization features. , 2002, Journal of molecular biology.

[7]  Yu-Yen Ou,et al.  Incorporating efficient radial basis function networks and significant amino acid pairs for predicting GTP binding sites in transport proteins , 2016, BMC Bioinformatics.

[8]  Milton H. Saier,et al.  TCDB: the Transporter Classification Database for membrane transport protein analyses and information , 2005, Nucleic Acids Res..

[9]  Kuo-Chen Chou,et al.  MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. , 2007, Biochemical and biophysical research communications.

[10]  M. Ishii,et al.  Phosphoserine Phosphatase Is Required for Serine and One-Carbon Unit Synthesis in Hydrogenobacter thermophilus , 2017, Journal of bacteriology.

[11]  M. Michael Gromiha,et al.  Functional discrimination of membrane proteins using machine learning techniques , 2008, BMC Bioinformatics.

[12]  R. Stoltenburg,et al.  Post‐translational modifications of the AFET3 gene product—a component of the iron transport system in budding cells and mycelia of the yeast Arxula adeninivorans , 2002, Yeast.

[13]  Yu-Yen Ou,et al.  Identifying the molecular functions of electron transport proteins using radial basis function networks and biochemical properties. , 2017, Journal of molecular graphics & modelling.

[14]  Yingbo Li,et al.  Detection and tracking of overlapping cell nuclei for large scale mitosis analyses , 2016, BMC Bioinformatics.

[15]  K Nishikawa,et al.  Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies. , 1994, Journal of molecular biology.

[16]  Fanfan Zhou,et al.  The Role of N-Linked Glycosylation in Protein Folding, Membrane Targeting, and Substrate Binding of Human Organic Anion Transporter hOAT4 , 2005, Molecular Pharmacology.

[17]  C. Giménez,et al.  The Role of N-Glycosylation in Transport to the Plasma Membrane and Sorting of the Neuronal Glycine Transporter GLYT2* , 2001, The Journal of Biological Chemistry.

[18]  Minoru Kanehisa,et al.  Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs , 2003, Bioinform..

[19]  C. Indiveri,et al.  Post-translational modification by acetylation regulates the mitochondrial carnitine/acylcarnitine transport protein , 2016, Molecular and Cellular Biochemistry.

[20]  Karen H. Almeida,et al.  A unified view of base excision repair: lesion-dependent protein complexes regulated by post-translational modification. , 2007, DNA repair.

[21]  Yu-Yen Ou,et al.  Incorporating deep learning with convolutional neural networks and position specific scoring matrices for identifying electron transport proteins , 2017, J. Comput. Chem..

[22]  Yuchen Liu,et al.  Expanded Cellular Amino Acid Pools Containing Phosphoserine, Phosphothreonine, and Phosphotyrosine , 2014, ACS chemical biology.

[23]  Patrick X. Zhao,et al.  Prediction of Membrane Transport Proteins and Their Substrate Specificities Using Primary Sequence Information , 2014, PloS one.

[24]  Chih-Jen Lin,et al.  Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[25]  M. Dembo,et al.  Cell movement is guided by the rigidity of the substrate. , 2000, Biophysical journal.

[26]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[27]  Yu-Yen Ou,et al.  Prediction of FAD binding sites in electron transport proteins according to efficient radial basis function networks and significant amino acid pairs , 2016, BMC Bioinformatics.

[28]  M. Gromiha,et al.  Classification of transporters using efficient radial basis function networks with position‐specific scoring matrices and biochemical properties , 2010, Proteins.

[29]  Kuo-Chen Chou,et al.  Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes , 2005, Bioinform..

[30]  Sun-Yuan Kung,et al.  Transductive Learning for Multi-Label Protein Subchloroplast Localization Prediction , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[31]  Yu-Yen Ou,et al.  Protein disorder prediction by condensed PSSM considering propensity for order or disorder , 2006, BMC Bioinformatics.

[32]  I. Goldman,et al.  N-linked glycosylation and its impact on the electrophoretic mobility and function of the human proton-coupled folate transporter (HsPCFT) , 2008, Biochimica et Biophysica Acta (BBA) - Biomembranes.

[33]  Yu-Yen Ou,et al.  Prediction of transporter targets using efficient RBF networks with PSSM profiles and biochemical properties , 2011, Bioinform..

[34]  I. Coe,et al.  N-linked glycosylation of human SLC1A5 (ASCT2) transporter is critical for trafficking to membrane. , 2015, Biochimica et biophysica acta.

[35]  Sun-Yuan Kung,et al.  Ensemble Linear Neighborhood Propagation for Predicting Subchloroplast Localization of Multi-Location Proteins. , 2016, Journal of proteome research.

[36]  Yu-Yen Ou,et al.  Identification of efflux proteins using efficient radial basis function networks with position‐specific scoring matrices and biochemical properties , 2013, Proteins.

[37]  Lei Yang,et al.  Discrimination of membrane transporter protein types using K-nearest neighbor method derived from the similarity distance of total diversity measure. , 2015, Molecular bioSystems.

[38]  Shinn-Ying Ho,et al.  SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides , 2015, BMC Genomics.

[39]  Yen-Jen Oyang,et al.  A novel radial basis function network classifier with centers set by hierarchical clustering , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[40]  Yu-Yen Ou,et al.  Bioinformatics approaches for functional annotation of membrane proteins , 2014, Briefings Bioinform..