Quality assessment of protein model-structures using evolutionary conservation

Motivation: Programs that evaluate the quality of a protein structural model are important both for validating the structure determination procedure and for guiding the model-building process. Such programs are based on properties of native structures that are generally not expected for faulty models. One such property, which is rarely used for automatic structure quality assessment, is the tendency for conserved residues to be located at the structural core and for variable residues to be located at the surface. Results: We present ConQuass, a novel quality assessment program based on the consistency between the model structure and the protein's conservation pattern. We show that it can identify problematic structural models, and that the scores it assigns to the server models in CASP8 correlate with the similarity of the models to the native structure. We also show that when the conservation information is reliable, the method's performance is comparable and complementary to that of the other single-structure quality assessment methods that participated in CASP8 and that do not use additional structural information from homologs. Availability: A perl implementation of the method, as well as the various perl and R scripts used for the analysis are available at http://bental.tau.ac.il/ConQuass/. Contact: nirb@tauex.tau.ac.il Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction—Round VII , 2007, Proteins.

[2]  Geoffrey Chang,et al.  Flexibility in the ABC transporter MsbA: Alternating access with a twist , 2007, Proceedings of the National Academy of Sciences.

[3]  Joshua D. Knowles,et al.  Artefacts and biases affecting the evaluation of scoring functions on decoy sets for protein structure prediction , 2009, Bioinform..

[4]  C. Branden,et al.  Introduction to protein structure , 1991 .

[5]  R Samudrala,et al.  Decoys ‘R’ Us: A database of incorrect conformations to improve protein structure prediction , 2000, Protein science : a publication of the Protein Society.

[6]  Geoffrey Chang,et al.  X-ray structure of EmrE supports dual topology model , 2007, Proceedings of the National Academy of Sciences.

[7]  Jianlin Cheng,et al.  Prediction of global and local quality of CASP8 models by MULTICOM series , 2009, Proteins.

[8]  J. Skolnick,et al.  Automated structure prediction of weakly homologous proteins on a genomic scale. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Gerard J. Kleywegt,et al.  On vital aid: the why, what and how of validation , 2009, Acta crystallographica. Section D, Biological crystallography.

[10]  Nir Ben-Tal,et al.  Model Structure of the Na+/H+ Exchanger 1 (NHE1) , 2007, Journal of Biological Chemistry.

[11]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[12]  Anna Tramontano,et al.  Assessment of predictions in the model quality assessment category , 2007, Proteins.

[13]  D. Eisenberg,et al.  VERIFY3D: assessment of protein models with three-dimensional profiles. , 1997, Methods in enzymology.

[15]  Anna Tramontano,et al.  Evaluation of CASP8 model quality predictions , 2009, Proteins.

[16]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[17]  Genki Terashi,et al.  Fams‐ace: A combined method to select the best model after remodeling all server models , 2007, Proteins.

[18]  Keehyoung Joo,et al.  Quality Assessment of Protein Models , 2011 .

[19]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[20]  K. Henrick,et al.  Inference of macromolecular assemblies from crystalline state. , 2007, Journal of molecular biology.

[21]  Barry Honig,et al.  Local quality assessment in homology models using statistical potentials and support vector machines , 2007, Protein science : a publication of the Protein Society.

[22]  Manfred J. Sippl,et al.  Thirty years of environmental health research--and growing. , 1996, Nucleic Acids Res..

[23]  O. Lichtarge,et al.  A family of evolution-entropy hybrid methods for ranking protein residues by importance. , 2004, Journal of molecular biology.

[24]  K. Ginalski Comparative modeling for protein structure prediction. , 2006, Current opinion in structural biology.

[25]  David S. Eisenberg,et al.  Using inferred residue contacts to distinguish between correct and incorrect protein models , 2008, Bioinform..

[26]  A. Sali,et al.  Modeller: generation and refinement of homology-based protein structure models. , 2003, Methods in enzymology.

[27]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[28]  Arne Elofsson,et al.  Quality Assessment of Protein Models , 2008 .

[29]  Sarel J. Fleishman,et al.  A Cα model for the transmembrane α helices of gap junction intercellular channels , 2004 .

[30]  Janusz M. Bujnicki,et al.  Prediction of protein structures, functions, and interactions , 2008 .

[31]  Dan Halperin,et al.  Quasi-symmetry in the cryo-EM structure of EmrE provides the key to modeling its transmembrane domain. , 2006, Journal of molecular biology.

[32]  So Nakagawa,et al.  Structure of the connexin 26 gap junction channel at 3.5 Å resolution , 2009, Nature.

[33]  Tal Pupko,et al.  ConSurf: Identification of Functional Regions in Proteins by Surface-Mapping of Phylogenetic Information , 2003, Bioinform..

[34]  Liam J. McGuffin Prediction of global and local model quality in CASP8 using the ModFOLD server , 2009, Proteins.

[35]  Hideki Tachibana,et al.  Comprehensive secondary‐structure analysis of disulfide variants of lysozyme by synchrotron‐radiation vacuum‐ultraviolet circular dichroism , 2009, Proteins.

[36]  Nir Ben-Tal,et al.  Detection of functionally important regions in "hypothetical proteins" of known structure. , 2008, Structure.

[37]  B. Rost,et al.  Critical assessment of methods of protein structure prediction—Round VIII , 2009, Proteins.

[38]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[39]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[40]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[41]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[42]  Arne Elofsson,et al.  Assessment of global and local model quality in CASP8 using Pcons and ProQ , 2009, Proteins.

[43]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[44]  Gianluca Pollastri,et al.  Beyond the Twilight Zone: Automated prediction of structural properties of proteins by recursive neural networks and remote homology information , 2009, Proteins.

[45]  Richard A Friesner,et al.  An Automatic Method for Predicting Transmembrane Protein Structures Using Cryo-em and Evolutionary Data , 2004 .

[46]  Silvio C. E. Tosatto,et al.  Global and local model quality estimation at CASP8 using the scoring functions QMEAN and QMEANclust , 2009, Proteins.

[47]  O. Schueler‐Furman,et al.  Conserved residue clustering and protein structure prediction , 2003, Proteins.

[48]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[49]  O. Lichtarge,et al.  Combining inference from evolution and geometric probability in protein structure evaluation. , 2003, Journal of molecular biology.

[50]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[51]  Nir Ben-Tal,et al.  The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures , 2008, Nucleic Acids Res..

[52]  B. Rost,et al.  Effective use of sequence correlation and conservation in fold recognition. , 1999, Journal of molecular biology.

[53]  Sarel J Fleishman,et al.  A Calpha model for the transmembrane alpha helices of gap junction intercellular channels. , 2004, Molecular cell.

[54]  S. Bryant,et al.  Critical assessment of methods of protein structure prediction (CASP): Round II , 1997, Proteins.

[55]  Dmitri K Klimov,et al.  Interpeptide interactions induce helix to strand structural transition in Aβ peptides , 2009, Proteins.

[56]  Kevin Karplus,et al.  Applying Undertaker to quality assessment , 2009, Proteins.

[57]  Clifford A Goudey,et al.  Aquaculture in Offshore Zones , 2006, Science.

[58]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[59]  Usha K Muppirala,et al.  A simple approach for protein structure discrimination based on the network pattern of conserved hydrophobic residues. , 2006, Protein engineering, design & selection : PEDS.

[60]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..