Contributions of Structure Comparison Methods to the Protein Structure Prediction Field

Since their development, structure comparison methods have contributed to advance our understanding of protein structure and evolution (Greene et al, 2007; Hasegawa & Holm, 2009), to help the development of structural genomics projects (Pearl et al, 2005), to improve protein function annotations (D. A. Lee et al), etc, thus becoming an essential tool in structural bioinformatics. In recent years, their application range has grown to include the protein structure prediction field, were they are used to evaluate overall prediction quality (Jauch et al, 2007; Venclovas et al, 2001; Vincent et al, 2005; G. Wang et al, 2005), to identify a protein’s fold from low-resolution models (Bonneau et al, 2002; de la Cruz et al, 2002), etc. In this chapter, after briefly reviewing some of these applications, we show how structure comparison methods can also be used for local quality assessment of low-resolution models and how this information can help refine/improve them. Quality assessment is becoming an important research topic in structural bioinformatics because model quality determines the applicability of structure predictions (Cozzetto et al, 2007). Also, because prediction technology is now easily available and potential end-users of prediction methods, from template-based (comparative modeling and threading) to de novo methods, are no longer specialized structural bioinformaticians. Quality assessment methods have been routinely used for many years in structural biology in the evaluation of experimental models. These methods focus on several features of the protein structure (see (Laskowski et al, 1998) and (Kleywegt, 2000) and references therein). Because a number of quality issues are common to both experimental and predicted models, the use of these methods has been naturally extended to the evaluation of structure predictions. For example, in the case of homology modeling, a widely used structure prediction technique, evaluation of models with PROCHECK (Laskowski et al, 1993), WHAT-CHECK (Hooft et al, 1997), PROSA (Sippl, 1993), and others (see (Marti-Renom et al, 2000) and references therein) is part of the standard prediction protocol; WHATIF (Vriend, 1990) and PROSA (Sippl, 1993) have also been used in the CASP experiment to assess comparative models (Venclovas, 2001; Williams et al, 2001); etc. Some quality assessment problems are unique to the structure prediction field, given the specific characteristics of computational models, and have led to the development of

[1]  Roland L Dunbrack,et al.  Assessment of fold recognition predictions in CASP6 , 2005, Proteins.

[2]  M. Karplus,et al.  Effective energy functions for protein structure prediction. , 2000, Current opinion in structural biology.

[3]  Jianlin Cheng,et al.  Evaluating the absolute quality of a single protein model using structural features and support vector machines , 2009, Proteins.

[4]  Arne Elofsson,et al.  Assessment of global and local model quality in CASP8 using Pcons and ProQ , 2009, Proteins.

[5]  A. Elofsson,et al.  Can correct protein models be identified? , 2003, Protein science : a publication of the Protein Society.

[6]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[7]  C Venclovas,et al.  Comparative modeling of CASP4 target proteins: Combining results of sequence search with three‐dimensional structure assessment , 2001, Proteins.

[8]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[9]  B. Davletov,et al.  Sculpting a domain by splicing , 2004, Nature Structural &Molecular Biology.

[10]  Richard Bonneau,et al.  De novo prediction of three-dimensional structures for major protein families. , 2002, Journal of molecular biology.

[11]  Sarah A. Teichmann,et al.  The (In)dependence of Alternative Splicing and Gene Duplication , 2007, PLoS Comput. Biol..

[12]  Arne Elofsson,et al.  Pcons5: combining consensus, structural evaluation and fold recognition scores , 2005, Bioinform..

[13]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[14]  Arne Elofsson,et al.  Identification of correct regions in protein models using structural, alignment, and consensus information , 2006, Protein science : a publication of the Protein Society.

[15]  T L Blundell,et al.  Sequence‐structure homology recognition by iterative alignment refinement and comparative modeling , 2001, Proteins.

[16]  Frances M. G. Pearl,et al.  The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis , 2004, Nucleic Acids Res..

[17]  J Lundström,et al.  Pcons: A neural‐network–based consensus predictor that improves fold recognition , 2001, Protein science : a publication of the Protein Society.

[18]  C. Orengo,et al.  A rapid method of protein structure alignment. , 1990, Journal of theoretical biology.

[19]  J M Thornton,et al.  Validation of protein models derived from experiment. , 1998, Current opinion in structural biology.

[20]  Krzysztof Fidelis,et al.  CASP8 results in context of previous experiments , 2009, Proteins.

[21]  Qi Wang,et al.  Bioinformatics analysis of alternative splicing , 2005, Briefings Bioinform..

[22]  Anna Tramontano,et al.  Assessment of predictions in the model quality assessment category , 2007, Proteins.

[23]  David C. Jones,et al.  Potential energy functions for threading. , 1996, Current opinion in structural biology.

[24]  Arne Elofsson,et al.  Prediction of global and local model quality in CASP7 using Pcons and ProQ , 2007, Proteins.

[25]  H. Scheraga,et al.  Role of interchain interactions in the stabilization of the right-handed twist of beta-sheets. , 1983, Journal of molecular biology.

[26]  Russell L. Marsden,et al.  Progress of structural genomics initiatives: an analysis of solved target structures. , 2005, Journal of molecular biology.

[27]  M J Sippl,et al.  Knowledge-based potentials for proteins. , 1995, Current opinion in structural biology.

[28]  Silvio C. E. Tosatto,et al.  Global and local model quality estimation at CASP8 using the scoring functions QMEAN and QMEANclust , 2009, Proteins.

[29]  Elena Papaleo,et al.  Validation of protein models by a neural network approach , 2008, BMC Bioinformatics.

[30]  Chris Sander,et al.  Objectively judging the quality of a protein structure from a Ramachandran plot , 1997, Comput. Appl. Biosci..

[31]  David A. Lee,et al.  GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains , 2009, Nucleic acids research.

[32]  C Venclovas,et al.  Comparison of performance in successive CASP experiments , 2001, Proteins.

[33]  Osvaldo Olmea,et al.  MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparison , 2002, Protein science : a publication of the Protein Society.

[34]  Jeffrey Skolnick,et al.  Protein model quality assessment prediction by combining fragment comparisons and a consensus Cα contact potential , 2008, Proteins.

[35]  Kevin Karplus,et al.  Applying Undertaker to quality assessment , 2009, Proteins.

[36]  Prasanna R Kolatkar,et al.  Assessment of CASP7 structure predictions for template free targets , 2007, Proteins.

[37]  A. Sali,et al.  Comparative protein structure modeling of genes and genomes. , 2000, Annual review of biophysics and biomolecular structure.

[38]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[39]  G J Kleywegt,et al.  Validation of protein crystal structures. , 2000, Acta crystallographica. Section D, Biological crystallography.

[40]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[41]  Anna Tramontano,et al.  Evaluation of CASP8 model quality predictions , 2009, Proteins.

[42]  Chin-Hsien Tai,et al.  Assessment of CASP6 predictions for new and nearly new fold targets , 2005, Proteins.

[43]  Ian Sillitoe,et al.  Use of structure comparison methods for the refinement of protein structure predictions. I. Identifying the structural family of a protein from low‐resolution models , 2002, Proteins.

[44]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[45]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[46]  Jianlin Cheng,et al.  Prediction of global and local quality of CASP8 models by MULTICOM series , 2009, Proteins.

[47]  Liam J. McGuffin Prediction of global and local model quality in CASP8 using the ModFOLD server , 2009, Proteins.

[48]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction—Round VII , 2007, Proteins.

[49]  Frances M. G. Pearl,et al.  The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution , 2006, Nucleic Acids Res..

[50]  Oliver F. Lange,et al.  Structure prediction for CASP8 with all‐atom refinement using Rosetta , 2009, Proteins.

[51]  Liisa Holm,et al.  Advances and pitfalls of protein structural alignment. , 2009, Current opinion in structural biology.