BeEP Server: using evolutionary information for quality assessment of protein structure models

The BeEP Server (http://www.embnet.qb.fcen.uba.ar/embnet/beep.php) is an online resource aimed to help in the endgame of protein structure prediction. It is able to rank submitted structural models of a protein through an explicit use of evolutionary information, a criterion differing from structural or energetic considerations commonly used in other assessment programs. The idea behind BeEP (Best Evolutionary Pattern) is to benefit from the substitution pattern derived from structural constraints present in a set of homologous proteins adopting a given protein conformation. The BeEP method uses a model of protein evolution that takes into account the structure of a protein to build site-specific substitution matrices. The suitability of these substitution matrices is assessed through maximum likelihood calculations from which position-specific and global scores can be derived. These scores estimate how well the structural constraints derived from each structural model are represented in a sequence alignment of homologous proteins. Our assessment on a subset of proteins from the Critical Assessment of techniques for protein Structure Prediction (CASP) experiment has shown that BeEP is capable of discriminating the models and selecting one or more native-like structures. Moreover, BeEP is not explicitly parameterized to find structural similarities between models and given targets, potentially helping to explore the conformational ensemble of the native state.

[1]  Gustavo D. Parisi,et al.  PCDB: a database of protein conformational diversity , 2010, Nucleic Acids Res..

[2]  A. Godzik,et al.  Global distribution of conformational states derived from redundant models in the PDB points to non-uniqueness of the protein structure , 2009, Proceedings of the National Academy of Sciences.

[3]  J. Echave,et al.  Generality of the structurally constrained protein evolution model: assessment on representatives of the four main fold classes. , 2005, Gene.

[4]  H. Akaike A new look at the statistical model identification , 1974 .

[5]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[6]  Nir Ben-Tal,et al.  Quality assessment of protein model-structures using evolutionary conservation , 2010, Bioinform..

[7]  María Silvina Fornasari,et al.  Site-specific amino acid replacement matrices from structurally constrained protein evolution simulations. , 2002, Molecular biology and evolution.

[8]  Peter B. McGarvey,et al.  UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[9]  Hervé Philippe,et al.  Statistical potentials for improved structurally constrained evolutionary models. , 2010, Molecular biology and evolution.

[10]  M. Kimmel,et al.  Conflict of interest statement. None declared. , 2010 .

[11]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[12]  S. Bryant,et al.  Critical assessment of methods of protein structure prediction (CASP): Round II , 1997, Proteins.

[13]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[14]  Michele Vendruscolo,et al.  Neutral evolution of model proteins: diffusion in sequence space and overdispersion. , 1998, Journal of theoretical biology.

[15]  Mark A. Wilson,et al.  Intrinsic motions along an enzymatic reaction trajectory , 2007, Nature.

[16]  Ezequiel I. Juritz,et al.  Protein conformational diversity modulates sequence divergence. , 2013, Molecular biology and evolution.

[17]  Daisuke Kihara,et al.  Quality assessment of protein structure models. , 2009, Current protein & peptide science.

[18]  Silvio C. E. Tosatto,et al.  The Victor/FRST Function for Model Quality Estimation , 2005, J. Comput. Biol..

[19]  Dan S. Tawfik,et al.  Protein Dynamism and Evolvability , 2009, Science.

[20]  Burkhard Rost,et al.  Evaluation of template‐based models in CASP8 with standard measures , 2009, Proteins.

[21]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction—Round VII , 2007, Proteins.

[22]  Frances M. G. Pearl,et al.  The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution , 2006, Nucleic Acids Res..

[23]  B. Rost,et al.  Critical assessment of methods of protein structure prediction—Round VIII , 2009, Proteins.

[24]  Jean-Christophe Nebel,et al.  Quality assessment of protein model-structures based on structural and functional similarities , 2012, BMC Bioinformatics.

[25]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[26]  David T. Jones,et al.  Protein evolution with dependence among codons due to tertiary structure. , 2003, Molecular biology and evolution.

[27]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.

[28]  J. Echave,et al.  The structurally constrained protein evolution model accounts for sequence patterns of the LβH superfamily , 2004, BMC Evolutionary Biology.

[29]  L. Kavraki,et al.  An accurate, sensitive, and scalable method to identify functional sites in protein structures. , 2003, Journal of molecular biology.

[30]  R. Nussinov,et al.  Folding and binding cascades: Dynamic landscapes and population shifts , 2008, Protein science : a publication of the Protein Society.

[31]  Ceslovas Venclovas,et al.  Progress over the first decade of CASP experiments , 2005, Proteins.

[32]  R. Nussinov,et al.  Folding funnels, binding funnels, and protein function , 1999, Protein science : a publication of the Protein Society.

[33]  Usha K Muppirala,et al.  A simple approach for protein structure discrimination based on the network pattern of conserved hydrophobic residues. , 2006, Protein engineering, design & selection : PEDS.

[34]  Ozlem Keskin,et al.  Restricted mobility of conserved residues in protein-protein interfaces in molecular simulations. , 2008, Biophysical journal.

[35]  J. Echave,et al.  Structural constraints and emergence of sequence patterns in protein evolution. , 2001, Molecular biology and evolution.

[36]  Anna Tramontano,et al.  Assessment of predictions in the model quality assessment category , 2007, Proteins.

[37]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[38]  Diego J. Zea,et al.  Protein conformational diversity correlates with evolutionary rate. , 2013, Molecular biology and evolution.

[39]  John P. Overington,et al.  Tertiary structural constraints on protein evolutionary diversity: templates, key residues and structure prediction , 1990, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[40]  Leszek Rychlewski,et al.  FFAS03: a server for profile–profile sequence alignments , 2005, Nucleic Acids Res..

[41]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[42]  M. Karplus,et al.  Molecular dynamics and protein function. , 2005, Proceedings of the National Academy of Sciences of the United States of America.