Transmembrane Protein Alignment and Fold Recognition Based on Predicted Topology

Background Although Transmembrane Proteins (TMPs) are highly important in various biological processes and pharmaceutical developments, general prediction of TMP structures is still far from satisfactory. Because TMPs have significantly different physicochemical properties from soluble proteins, current protein structure prediction tools for soluble proteins may not work well for TMPs. With the increasing number of experimental TMP structures available, template-based methods have the potential to become broadly applicable for TMP structure prediction. However, the current fold recognition methods for TMPs are not as well developed as they are for soluble proteins. Methodology We developed a novel TMP Fold Recognition method, TMFR, to recognize TMP folds based on sequence-to-structure pairwise alignment. The method utilizes topology-based features in alignment together with sequence profile and solvent accessibility. It also incorporates a gap penalty that depends on predicted topology structure segments. Given the difference between α-helical transmembrane protein (αTMP) and β-strands transmembrane protein (βTMP), parameters of scoring functions are trained respectively for these two protein categories using 58 αTMPs and 17 βTMPs in a non-redundant training dataset. Results We compared our method with HHalign, a leading alignment tool using a non-redundant testing dataset including 72 αTMPs and 30 βTMPs. Our method achieved 10% and 9% better accuracies than HHalign in αTMPs and βTMPs, respectively. The raw score generated by TMFR is negatively correlated with the structure similarity between the target and the template, which indicates its effectiveness for fold recognition. The result demonstrates TMFR provides an effective TMP-specific fold recognition and alignment method.

[1]  A. Krogh,et al.  A combined transmembrane topology and signal peptide prediction method. , 2004, Journal of molecular biology.

[2]  Markus Porto,et al.  High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABERTOOTH , 2010, BMC Bioinformatics.

[3]  Maya Topf,et al.  PREDICT modeling and in‐silico screening for G‐protein coupled receptors , 2004, Proteins.

[4]  K. Diederichs,et al.  Crystal structure of Omp32, the anion-selective porin from Comamonas acidovorans, in complex with a periplasmic peptide at 2.1 A resolution. , 2000, Structure.

[5]  Andrei N Lupas,et al.  Gene duplication of the eight-stranded beta-barrel OmpX produces a functional pore: a scenario for the evolution of transmembrane beta-barrels. , 2007, Journal of molecular biology.

[6]  Burkhard Rost,et al.  Refining Neural Network Predictions for Helical Transmembrane Proteins by Dynamic Programming , 1996, ISMB.

[7]  Thomas A. Hopf,et al.  Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing , 2012, Cell.

[8]  Jeffrey Skolnick,et al.  Improving threading algorithms for remote homology modeling by combining fragment and template comparisons , 2010, Proteins.

[9]  Masami Ikeda,et al.  TMPDB: a database of experimentally-characterized transmembrane topologies , 2003, Nucleic Acids Res..

[10]  Shay Bar-Haim,et al.  G protein-coupled receptors: in silico drug discovery in 3D. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  C Venclovas,et al.  Processing and analysis of CASP3 protein structure predictions , 1999, Proteins.

[12]  Sebastian Kelm,et al.  MEDELLER: homology-based coordinate generation for membrane proteins , 2010, Bioinform..

[13]  Charlotte M. Deane,et al.  MP-T: improving membrane protein alignment for structure prediction , 2013, Bioinform..

[14]  M. Gromiha,et al.  Real value prediction of solvent accessibility from amino acid sequence , 2003, Proteins.

[15]  Peter Clote,et al.  transFold: a web server for predicting the structure and residue contacts of transmembrane beta-barrels , 2006, Nucleic Acids Res..

[16]  A. Krogh,et al.  Reliability measures for membrane protein topology prediction algorithms. , 2003, Journal of molecular biology.

[17]  Hongbin Shen,et al.  MemBrain: Improving the Accuracy of Predicting Transmembrane Helices , 2008, PloS one.

[18]  S H White,et al.  MPtopo: A database of membrane protein topology , 2001, Protein science : a publication of the Protein Society.

[19]  H. Bayley,et al.  Altered Antibiotic Transport in OmpC Mutants Isolated from a Series of Clinical Strains of Multi-Drug Resistant E. coli , 2011, PloS one.

[20]  Zsuzsanna Dosztányi,et al.  Transmembrane proteins in the Protein Data Bank: identification and classification , 2004, Bioinform..

[21]  Aiping Wu,et al.  Incorporation of Local Structural Preference Potential Improves Fold Recognition , 2011, PloS one.

[22]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[23]  Zsuzsanna Dosztányi,et al.  PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank , 2004, Nucleic Acids Res..

[24]  Krzysztof Fidelis,et al.  Processing and evaluation of predictions in CASP4 , 2001, Proteins.

[25]  Yaoqi Zhou,et al.  Predicting the topology of transmembrane helical proteins using mean burial propensity and a hidden-Markov-model-based method , 2003 .

[26]  T. Bhat,et al.  The Protein Data Bank and the challenge of structural genomics , 2000, Nature Structural Biology.

[27]  L. Tamm,et al.  Folding and assembly of β-barrel membrane proteins , 2004 .

[28]  Han Wang,et al.  Improving transmembrane protein consensus topology prediction using inter-helical interaction. , 2012, Biochimica et biophysica acta.

[29]  G. Schulz,et al.  Asymmetric conductivity of engineered porins. , 2002, Protein engineering.

[30]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[31]  Pierre Baldi,et al.  A machine learning information retrieval approach to protein fold recognition. , 2006, Bioinformatics.

[32]  Dong Xu,et al.  PROSPECT II: protein structure prediction program for genome-scale applications. , 2003, Protein engineering.

[33]  Nagarajan Vaidehi,et al.  First principles predictions of the structure and function of g-protein-coupled receptors: validation for bovine rhodopsin. , 2004, Biophysical journal.

[34]  Hongyi Zhou,et al.  Fold recognition by combining sequence profiles derived from evolution and from depth‐dependent structural alignment of fragments , 2004, Proteins.

[35]  L. Tamm,et al.  Folding and assembly of beta-barrel membrane proteins. , 2004, Biochimica et biophysica acta.

[36]  A. Kernytsky,et al.  Transmembrane helix predictions revisited , 2002, Protein science : a publication of the Protein Society.

[37]  Yaoqi Zhou,et al.  Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates , 2011, Bioinform..

[38]  Peter L. Freddolino,et al.  The predicted 3D structure of the human D2 dopamine receptor and the binding site and binding affinities for agonists and antagonists. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[39]  S. Iwata,et al.  Architecture of Succinate Dehydrogenase and Reactive Oxygen Species Generation , 2003, Science.

[40]  A G Murzin,et al.  CASP2 knowledge‐based approach to distant homology recognition and fold prediction in CASP4 , 2001, Proteins.

[41]  Birgit Eisenhaber,et al.  TM or not TM: transmembrane protein prediction with low false positive rate using DAS-TMfilter , 2004, Bioinform..

[42]  Timothy Nugent,et al.  Accurate de novo structure prediction of large transmembrane protein domains using fragment-assembly and correlated mutation analysis , 2012, Proceedings of the National Academy of Sciences.

[43]  Derek P. Ng,et al.  Membrane protein misassembly in disease. , 2012, Biochimica et biophysica acta.

[44]  B. Honig,et al.  On the accuracy of homology modeling and sequence alignment methods applied to membrane proteins. , 2006, Biophysical journal.

[45]  Guang R. Gao,et al.  An improved hidden Markov model for transmembrane protein detection and topology prediction and its applications to complete genomes , 2005, Bioinform..

[46]  Sitao Wu,et al.  MUSTER: Improving protein sequence profile–profile alignments by using multiple sources of structure information , 2008, Proteins.

[47]  A. Godzik The structural alignment between two proteins: Is there a unique answer? , 1996, Protein science : a publication of the Protein Society.

[48]  Shigeki Mitaku,et al.  SOSUI: classification and secondary structure prediction system for membrane proteins , 1998, Bioinform..

[49]  D. Baker,et al.  Multipass membrane protein structure prediction using Rosetta , 2005, Proteins.

[50]  Aleksey A. Porollo,et al.  Combining prediction of secondary structure and solvent accessibility in proteins , 2005, Proteins.

[51]  Gerhard Hessler,et al.  Drug Design Strategies for Targeting G‐Protein‐Coupled Receptors , 2002, Chembiochem : a European journal of chemical biology.

[52]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[53]  R. Stevens,et al.  FoldGPCR: Structure prediction protocol for the transmembrane domain of G protein‐coupled receptors from class A , 2010, Proteins.

[54]  G. Heijne The distribution of positively charged residues in bacterial inner membrane proteins correlates with the trans‐membrane topology , 1986, The EMBO journal.

[55]  Mukta Phatak,et al.  Solvent and lipid accessibility prediction as a basis for model quality assessment in soluble and membrane proteins. , 2011, Current protein & peptide science.

[56]  David T. Jones,et al.  Transmembrane protein topology prediction using support vector machines , 2009, BMC Bioinformatics.

[57]  Douglas C. Rees,et al.  Crystallographic Studies of the Escherichia coliQuinol-Fumarate Reductase with Inhibitors Bound to the Quinol-binding Site* , 2002, The Journal of Biological Chemistry.

[58]  David T. Jones,et al.  Improving the accuracy of transmembrane protein topology prediction using evolutionary information , 2007, Bioinform..

[59]  Gerhard Hessler,et al.  Drug Design Strategies for Targeting G-Protein-Coupled Receptors , 2002 .

[60]  Shandar Ahmad,et al.  TMBETA-NET: discrimination and prediction of membrane spanning β-strands in outer membrane proteins , 2005, Nucleic Acids Res..

[61]  Jie Liang,et al.  Computational studies of membrane proteins: models and predictions for biological understanding. , 2012, Biochimica et biophysica acta.

[62]  Mikael Bodén,et al.  Predicting the solvent accessibility of transmembrane residues from protein sequence. , 2006, Journal of proteome research.

[63]  Srinivas Devadas,et al.  Modeling ensembles of transmembrane beta-barrel proteins. , 2008, Proteins.

[64]  István Simon,et al.  The HMMTOP transmembrane topology prediction server , 2001, Bioinform..

[65]  E. Berry,et al.  3-Nitropropionic Acid Is a Suicide Inhibitor of Mitochondrial Respiration That, upon Oxidation by Complex II, Forms a Covalent Adduct with a Catalytic Base Arginine in the Active Site of the Enzyme* , 2005, Journal of Biological Chemistry.

[66]  Erik L. L. Sonnhammer,et al.  A Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences , 1998, ISMB.

[67]  Marco Punta,et al.  Membrane protein prediction methods. , 2007, Methods.

[68]  Manuel G. Claros,et al.  TopPred II: an improved software for membrane protein structure predictions , 1994, Comput. Appl. Biosci..

[69]  Yu-Yen Ou,et al.  Prediction of membrane spanning segments and topology in β‐barrel membrane proteins at better accuracy , 2010, J. Comput. Chem..

[70]  Song Liu,et al.  Fold recognition by concurrent use of solvent accessibility and residue depth , 2007, Proteins.

[71]  Jooyoung Lee,et al.  De novo protein structure prediction by dynamic fragment assembly and conformational space annealing , 2011, Proteins.

[72]  Arne Elofsson,et al.  Improved detection of homologous membrane proteins by inclusion of information from topology predictions , 2002, Protein science : a publication of the Protein Society.

[73]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[74]  Wen-Lian Hsu,et al.  Enhanced membrane protein topology prediction using a hierarchical classification method and a new scoring function. , 2008, Journal of proteome research.

[75]  Torsten Schwede,et al.  BIOINFORMATICS Bioinformatics Advance Access published November 12, 2005 The SWISS-MODEL Workspace: A web-based environment for protein structure homology modelling , 2022 .

[76]  Lukas Käll,et al.  Reliability of transmembrane predictions in whole‐genome data , 2002, FEBS letters.

[77]  Y Xu,et al.  Protein threading using PROSPECT: Design and evaluation , 2000, Proteins.

[78]  J. Skolnick,et al.  Ab initio modeling of small proteins by iterative TASSER simulations , 2007, BMC Biology.

[79]  Arne Elofsson,et al.  MPRAP: An accessibility predictor for a-helical transmem-brane proteins that performs well inside and outside the membrane , 2010, BMC Bioinformatics.

[80]  A. Elofsson,et al.  Best α‐helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information , 2004 .

[81]  Yang Zhang,et al.  How significant is a protein structure similarity with TM-score = 0.5? , 2010, Bioinform..

[82]  Robert Giegerich,et al.  A systematic approach to dynamic programming in bioinformatics , 2000, Bioinform..

[83]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[84]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[85]  Pierre Baldi,et al.  TMBpro: secondary structure, beta-contact and tertiary structure prediction of transmembrane beta-barrel proteins , 2008, Bioinform..

[86]  Andrei N. Lupas,et al.  Gene Duplication of the Eight-stranded β-Barrel OmpX Produces a Functional Pore: A Scenario for the Evolution of Transmembrane β-Barrels , 2007 .

[87]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[88]  Stavros J. Hamodrakas,et al.  PRED-TMBB: a web server for predicting the topology of ?barrel outer membrane proteins , 2004, Nucleic Acids Res..

[89]  J. Bowie,et al.  Membrane channel structure of Helicobacter pylori vacuolating toxin: role of multiple GXXXG motifs in cylindrical channels. , 2004, Proceedings of the National Academy of Sciences of the United States of America.