CASP9 assessment of free modeling target predictions

The goal of the biennial CASP assessment of protein structure prediction is to identify and evaluate the current state of the art methods in the field. The template free modeling (FM) category generally aims to assess ab-initio methods that predict 3D structures from a given protein sequence without the explicit use of template structures available in the Protein Data Bank. The most significant development of methodology for de novo structure prediction from sequence was introduced over a decade ago in CASP3, with the assembly of tertiary structures from selected fragments.1 Fragment-based structure assembly methods have since been adopted and developed by a number of groups, whose de novo protein structure predictions tend to outperform in CASP evaluations of the FM category.2–5 Despite this relative success, the ‘‘protein-folding problem’’ has remained unsolved, with results of the previous CASP8 FM category evaluation suggesting much room for improvement in de novo structure prediction methodologies.6 The CASP9 FM assessment included evaluating prediction models, quantifying these evaluations in a meaningful way, and using these measurements to produce group rankings that were subject to various tests of significance. Newly developed and previously applied automated methods played a crucial role in completing the assessment. Our report outlines the resulting evaluation procedure, the logic behind its development, and the results of its application to CASP9 FM target predictions. Based on these results, we highlight the progress and pitfalls of both the top performing prediction servers and the fold prediction community as a whole. The assessment of the CASP9 FM category encompassed evaluations of 30 domains, which included 4 ‘‘server only’’ domains and 26 ‘‘human/ server’’ domains. For the FM category alone, 16,971 predictions had to be evaluated, making manual judgment of all predictions impossible

[1]  Kevin Karplus,et al.  SAM-T08, HMM-based protein structure prediction , 2009, Nucleic Acids Res..

[2]  Jimin Pei,et al.  Analysis of CASP8 targets, predictions and assessment methods , 2009, Database J. Biol. Databases Curation.

[3]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[4]  S. Brunak,et al.  Improved prediction of signal peptides: SignalP 3.0. , 2004, Journal of molecular biology.

[5]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[6]  Osvaldo Olmea,et al.  MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparison , 2002, Protein science : a publication of the Protein Society.

[7]  András Fiser,et al.  Molecular Biophysics , 2022 .

[8]  J. M. Sauder,et al.  Large‐scale comparison of protein sequence alignment algorithms with structure alignments , 2000, Proteins.

[9]  Liisa Holm,et al.  DaliLite workbench for protein structure comparison , 2000, Bioinform..

[10]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[11]  B. Rost,et al.  A modified definition of Sov, a segment‐based measure for protein secondary structure prediction assessment , 1999, Proteins.

[12]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[13]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[14]  N. Grishin,et al.  CASP9 target classification , 2011, Proteins.

[15]  Jaime Prilusky,et al.  Assessment of CASP8 structure predictions for template free targets , 2009, Proteins.

[16]  Yang Zhang,et al.  I‐TASSER: Fully automated protein structure prediction in CASP8 , 2009, Proteins.

[17]  Oliver F. Lange,et al.  Structure prediction for CASP8 with all‐atom refinement using Rosetta , 2009, Proteins.

[18]  K. Karplus SAM-T 08 , HMM-based Protein Structure Prediction , 2009 .

[19]  Yang Zhang,et al.  Template‐based modeling and free modeling by I‐TASSER in CASP7 , 2007, Proteins.

[20]  Lars Malmström,et al.  Structure prediction for CASP7 targets using extensive all‐atom refinement with Rosetta@home , 2007, Proteins.

[21]  Lisa N Kinch,et al.  CASP5 target classification , 2003, Proteins.

[22]  Lisa N Kinch,et al.  CASP5 assessment of fold recognition target predictions , 2003, Proteins.

[23]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[24]  C. Orengo,et al.  Analysis and assessment of ab initio three‐dimensional prediction, secondary structure, and contacts prediction , 1999, Proteins.

[25]  Richard Bonneau,et al.  Ab initio protein structure prediction of CASP III targets using ROSETTA , 1999, Proteins.

[26]  C Venclovas,et al.  Processing and analysis of CASP3 protein structure predictions , 1999, Proteins.