MEDELLER: homology-based coordinate generation for membrane proteins

Motivation: Membrane proteins (MPs) are important drug targets but knowledge of their exact structure is limited to relatively few examples. Existing homology-based structure prediction methods are designed for globular, water-soluble proteins. However, we are now beginning to have enough MP structures to justify the development of a homology-based approach specifically for them. Results: We present a MP-specific homology-based coordinate generation method, MEDELLER, which is optimized to build highly reliable core models. The method outperforms the popular structure prediction programme Modeller on MPs. The comparison of the two methods was performed on 616 target–template pairs of MPs, which were classified into four test sets by their sequence identity. Across all targets, MEDELLER gave an average backbone root mean square deviation (RMSD) of 2.62 Å versus 3.16 Å for Modeller. On our ‘easy’ test set, MEDELLER achieves an average accuracy of 0.93 Å backbone RMSD versus 1.56 Å for Modeller. Availability and Implementation: http://medeller.info; Implemented in Python, Bash and Perl CGI for use on Linux systems; Supplementary data are available at http://www.stats.ox.ac.uk/proteins/resources. Contact: kelm@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction—Round VII , 2007, Proteins.

[2]  Yoonjoo Choi,et al.  FREAD revisited: Accurate loop structure prediction using a database search algorithm , 2010, Proteins.

[3]  Zsuzsanna Dosztányi,et al.  PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank , 2004, Nucleic Acids Res..

[4]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt): an expanding universe of protein information , 2005, Nucleic Acids Res..

[5]  T L Blundell,et al.  FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. , 2001, Journal of molecular biology.

[6]  Charlotte M. Deane,et al.  JOY: protein sequence-structure representation and analysis , 1998, Bioinform..

[7]  Shandar Ahmad,et al.  Application of residue distribution along the sequence for discriminating outer membrane proteins , 2005, Comput. Biol. Chem..

[8]  C Venclovas,et al.  Processing and analysis of CASP3 protein structure predictions , 1999, Proteins.

[9]  Manuel C. Peitsch,et al.  SWISS-MODEL: an automated protein homology-modeling server , 2003, Nucleic Acids Res..

[10]  John P. Overington,et al.  Derivation of rules for comparative protein modeling from a database of protein structure alignments , 1994, Protein science : a publication of the Protein Society.

[11]  Victor Guallar,et al.  Role of TolC in Klebsiella oxytoca resistance to antibiotics. , 2009, The Journal of antimicrobial chemotherapy.

[12]  M J Sternberg,et al.  Enhancement of protein modeling by human intervention in applying the automatic programs 3D‐JIGSAW and 3D‐PSSM , 2001, Proteins.

[13]  G. Schulz The structure of bacterial outer membrane proteins. , 2002, Biochimica et biophysica acta.

[14]  G. Heijne,et al.  Genome‐wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organisms , 1998, Protein science : a publication of the Protein Society.

[15]  Lei Xie,et al.  Using multiple structure alignments, fast model building, and energetic analysis in fold recognition and homology modeling , 2003, Proteins.

[16]  Charlotte M. Deane,et al.  SCORE: predicting the core of protein models , 2001, Bioinform..

[17]  B. Rost,et al.  Critical assessment of methods of protein structure prediction—Round VIII , 2009, Proteins.

[18]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[19]  B. Honig,et al.  On the accuracy of homology modeling and sequence alignment methods applied to membrane proteins. , 2006, Biophysical journal.

[20]  G. Madhavi Sastry,et al.  Homology modeling of membrane proteins: A critical assessment , 2006, Comput. Biol. Chem..

[21]  K. Mizuguchi,et al.  An iterative structure‐assisted approach to sequence alignment and comparative modeling , 1999, Proteins.

[22]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[23]  G. von Heijne,et al.  Membrane protein structure: prediction versus reality. , 2007, Annual review of biochemistry.

[24]  Andrei L. Lomize,et al.  OPM: Orientations of Proteins in Membranes database , 2006, Bioinform..

[25]  P. Koehl,et al.  A self consistent mean field approach to simultaneous gap closure and side-chain positioning in homology modelling , 1995, Nature Structural Biology.

[26]  M. Sansom,et al.  Amino acid distributions in integral membrane protein structures. , 2001, Biochimica et biophysica acta.

[27]  Marco Punta,et al.  Membrane protein prediction methods. , 2007, Methods.

[28]  D. Baker,et al.  Multipass membrane protein structure prediction using Rosetta , 2005, Proteins.

[29]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[30]  Krzysztof Palczewski,et al.  Vertebrate Membrane Proteins: Structure, Function, and Insights from Biophysical Approaches , 2008, Pharmacological Reviews.

[31]  Arne Elofsson,et al.  All are not equal: A benchmark of different homology modeling programs , 2005, Protein science : a publication of the Protein Society.

[32]  Tim J. P. Hubbard,et al.  SCOP: a structural classification of proteins database , 1998, Nucleic Acids Res..

[33]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[34]  Janet M Thornton,et al.  Computational analysis of alpha-helical membrane protein structure: implications for the prediction of 3D structural models. , 2004, Protein engineering, design & selection : PEDS.

[35]  D Baker,et al.  Prediction of membrane protein structures with complex topologies using limited constraints , 2009, Proceedings of the National Academy of Sciences.

[36]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[37]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[38]  Tom L. Blundell,et al.  Ulla: a program for calculating environment-specific amino acid substitution tables , 2009, Bioinform..

[39]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[40]  R Sánchez,et al.  Advances in comparative protein-structure modelling. , 1997, Current opinion in structural biology.

[41]  T. Stevens,et al.  Are membrane proteins “inside‐out” proteins? , 1999, Proteins.

[42]  Lupei Du,et al.  Modeling the binding modes of Kv1.5 potassium channel and blockers. , 2008, Journal of molecular graphics & modelling.

[43]  M. Levitt Accurate modeling of protein conformation by automatic segment matching. , 1992, Journal of molecular biology.

[44]  B. Rost Twilight zone of protein sequence alignments. , 1999, Protein engineering.

[45]  Syma Khalid,et al.  Coarse-grained MD simulations of membrane protein-bilayer self-assembly. , 2008, Structure.

[46]  C. Deane,et al.  iMembrane: homology-based membrane-insertion of proteins , 2009, Bioinform..

[47]  Charles L. Brooks,et al.  Community-wide assessment of GPCR structure modelling and ligand docking: GPCR Dock 2008 , 2009, Nature Reviews Drug Discovery.

[48]  A K Saxena,et al.  Internet resources in GPCR modelling , 2008, SAR and QSAR in environmental research.