MemBrain: An Easy-to-Use Online Webserver for Transmembrane Protein Structure Prediction

AbstractMembrane proteins are an important kind of proteins embedded in the membranes of cells and play crucial roles in living organisms, such as ion channels, transporters, receptors. Because it is difficult to determinate the membrane protein’s structure by wet-lab experiments, accurate and fast amino acid sequence-based computational methods are highly desired. In this paper, we report an online prediction tool called MemBrain, whose input is the amino acid sequence. MemBrain consists of specialized modules for predicting transmembrane helices, residue–residue contacts and relative accessible surface area of α-helical membrane proteins. MemBrain achieves a prediction accuracy of 97.9% of ATMH, 87.1% of AP, 3.2 ± 3.0 of N-score, 3.1 ± 2.8 of C-score. MemBrain-Contact obtains 62%/64.1% prediction accuracy on training and independent dataset on top L/5 contact prediction, respectively. And MemBrain-Rasa achieves Pearson correlation coefficient of 0.733 and its mean absolute error of 13.593. These prediction results provide valuable hints for revealing the structure and function of membrane proteins. MemBrain web server is free for academic use and available at www.csbio.sjtu.edu.cn/bioinf/MemBrain/.

[1]  Kuo-Chen Chou,et al.  MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. , 2007, Biochemical and biophysical research communications.

[2]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[3]  Jing Yang,et al.  R2C: improving ab initio residue contact map prediction using dynamic fusion strategy and Gaussian noise filter , 2016, Bioinform..

[4]  Kuo-Chen Chou,et al.  Fuzzy KNN for predicting membrane protein types from pseudo-amino acid composition. , 2006, Journal of theoretical biology.

[5]  Martin S. Taylor,et al.  Architectural Organization of the Metabolic Regulatory Enzyme Ghrelin O-Acyltransferase* , 2013, The Journal of Biological Chemistry.

[6]  Andrei L. Lomize,et al.  OPM: Orientations of Proteins in Membranes database , 2006, Bioinform..

[7]  Hong-Bin Shen,et al.  Prediction Enhancement of Residue Real-Value Relative Accessible Surface Area in Transmembrane Helical Proteins by Solving the Output Preference Problem of Machine Learning-Based Predictors , 2015, J. Chem. Inf. Model..

[8]  D. Frishman,et al.  Prediction of helix–helix contacts and interacting helices in polytopic membrane proteins using neural networks , 2009, Proteins.

[9]  David T. Jones,et al.  Predicting Transmembrane Helix Packing Arrangements using Residue Contacts and a Force-Directed Algorithm , 2010, PLoS Comput. Biol..

[10]  Lijun Liu,et al.  Evolution of the α-Subunit of Na/K-ATPase from Paramecium to Homo sapiens: Invariance of Transmembrane Helix Topology , 2016, Journal of Molecular Evolution.

[11]  Zsuzsanna Dosztányi,et al.  PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank , 2004, Nucleic Acids Res..

[12]  A Elofsson,et al.  Prediction of transmembrane alpha-helices in prokaryotic membrane proteins: the dense alignment surface method. , 1997, Protein engineering.

[13]  A. Hopkins,et al.  The druggable genome , 2002, Nature Reviews Drug Discovery.

[14]  Zhen Li,et al.  Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model , 2016, bioRxiv.

[15]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[16]  Jens Meiler,et al.  Solvent accessible surface area approximations for rapid and accurate protein structure prediction , 2009, Journal of molecular modeling.

[17]  Pierre Baldi,et al.  Deep architectures for protein contact map prediction , 2012, Bioinform..

[18]  Zheng Yuan,et al.  SVMtm: Support vector machines to predict transmembrane segments , 2004, J. Comput. Chem..

[19]  E. Butelman,et al.  Pharmacotherapy of addictions , 2002, Nature Reviews Drug Discovery.

[20]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[21]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[22]  Hongbin Shen,et al.  MemBrain: Improving the Accuracy of Predicting Transmembrane Helices , 2008, PloS one.

[23]  S. Gebhard,et al.  Identification of Regions Important for Resistance and Signalling within the Antimicrobial Peptide Transporter BceAB of Bacillus subtilis , 2013, Journal of bacteriology.

[24]  Seung-Yeon Kim,et al.  Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method , 2005, Bioinform..

[25]  Massimiliano Pontil,et al.  PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments , 2012, Bioinform..

[26]  István Simon,et al.  TOPDB: topology data bank of transmembrane proteins , 2007, Nucleic Acids Res..

[27]  Hong-Bin Shen,et al.  Enhancing the Prediction of Transmembrane β-Barrel Segments with Chain Learning and Feature Sparse Representation , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  Yang Zhang,et al.  Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins , 2015, Bioinform..

[29]  Yang Zhang,et al.  High-accuracy prediction of transmembrane inter-helix contacts and application to GPCR 3D structure modeling , 2013, Bioinform..