MULTICOM: a multi-level combination approach to protein structure prediction and its assessments in CASP8

Motivation: Protein structure prediction is one of the most important problems in structural bioinformatics. Here we describe MULTICOM, a multi-level combination approach to improve the various steps in protein structure prediction. In contrast to those methods which look for the best templates, alignments and models, our approach tries to combine complementary and alternative templates, alignments and models to achieve on average better accuracy. Results: The multi-level combination approach was implemented via five automated protein structure prediction servers and one human predictor which participated in the eighth Critical Assessment of Techniques for Protein Structure Prediction (CASP8), 2008. The MULTICOM servers and human predictor were consistently ranked among the top predictors on the CASP8 benchmark. The methods can predict moderate- to high-resolution models for most template-based targets and low-resolution models for some template-free targets. The results show that the multi-level combination of complementary templates, alternative alignments and similar models aided by model quality assessment can systematically improve both template-based and template-free protein modeling. Availability: The MULTICOM server is freely available at http://casp.rnet.missouri.edu/multicom_3d.html Contact: chengji@missouri.edu

[1]  T. Hubbard,et al.  Critical assessment of methods of protein structure prediction (CASP): Round III , 1999, Proteins.

[2]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[3]  Krzysztof Fidelis,et al.  Progress from CASP6 to CASP7 , 2007, Proteins.

[4]  Gianluca Pollastri,et al.  Beyond the Twilight Zone: Automated prediction of structural properties of proteins by recursive neural networks and remote homology information , 2009, Proteins.

[5]  Arne Elofsson,et al.  MaxSub: an automated measure for the assessment of protein structure prediction quality , 2000, Bioinform..

[6]  Janet M. Thornton,et al.  Prediction of protein structure from amino acid sequence , 1978, Nature.

[7]  R. Service,et al.  Structural Genomics, Round 2 , 2005, Science.

[8]  Jaime Prilusky,et al.  Assessment of CASP8 structure predictions for template free targets , 2009, Proteins.

[9]  M J Sternberg,et al.  Prediction of protein structure from amino acid sequence. , 1978, Nature.

[10]  Burkhard Rost,et al.  Evaluation of template‐based models in CASP8 with standard measures , 2009, Proteins.

[11]  Kimmen Sjölander,et al.  SATCHMO: Sequence Alignment and Tree Construction Using Hidden Markov Models , 2003, Bioinform..

[12]  J. Jung,et al.  Protein structure prediction. , 2001, Current opinion in chemical biology.

[13]  Ceslovas Venclovas,et al.  Progress over the first decade of CASP experiments , 2005, Proteins.

[14]  Jianlin Cheng A multi-template combination algorithm for protein comparative modeling , 2008, BMC Structural Biology.

[15]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[16]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[17]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[18]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[19]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[20]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[21]  Arne Elofsson,et al.  Profile–profile methods provide improved fold‐recognition: A study of different profile–profile alignment methods , 2004, Proteins.

[22]  Jason J. Corso,et al.  On-line hierarchy of general linear models for selecting and ranking the best predicted protein structures , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[23]  Christopher J. Williams,et al.  The other 90% of the protein: Assessment beyond the Cαs for CASP8 template‐based and high‐accuracy models , 2009, Proteins.

[24]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction—Round VII , 2007, Proteins.

[25]  C Venclovas,et al.  Processing and analysis of CASP3 protein structure predictions , 1999, Proteins.

[26]  Yang Zhang,et al.  I‐TASSER: Fully automated protein structure prediction in CASP8 , 2009, Proteins.

[27]  Vladislav Yu Orekhov,et al.  Removal of a time barrier for high-resolution multidimensional NMR spectroscopy , 2006, Nature Methods.

[28]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[29]  Jianlin Cheng,et al.  Evaluating the absolute quality of a single protein model using structural features and support vector machines , 2009, Proteins.

[30]  Johannes Söding,et al.  Fast and accurate automatic structure prediction with HHpred , 2009, Proteins.

[31]  Jeffrey Skolnick,et al.  Protein structure prediction by pro-Sp3-TASSER. , 2009, Biophysical journal.

[32]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[33]  Jinbo Xu,et al.  Template‐based and free modeling by RAPTOR++ in CASP8 , 2009, Proteins.

[34]  Česlovas Venclovas,et al.  The use of automatic tools and human expertise in template‐based modeling of CASP8 target proteins , 2009, Proteins.

[35]  Pierre Baldi,et al.  SCRATCH: a protein structure and structural feature prediction server , 2005, Nucleic Acids Res..

[36]  Jeffrey Skolnick,et al.  Performance of the Pro‐sp3‐TASSER server in CASP8 , 2009, Proteins.

[37]  Charles Vuylsteke Methods and implementation , 1988 .

[38]  A. Sali,et al.  Modeller: generation and refinement of homology-based protein structure models. , 2003, Methods in enzymology.

[39]  Huan-Xiang Zhou,et al.  Nonadditive effects of mixed crowding on protein stability , 2009, Proteins.

[40]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[41]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[42]  Yaoqi Zhou,et al.  SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures. , 2005, Bioinformatics.

[43]  R. Schulz,et al.  Protein Structure Prediction , 2020, Methods in Molecular Biology.

[44]  Jianlin Cheng,et al.  Prediction of global and local quality of CASP8 models by MULTICOM series , 2009, Proteins.

[45]  Pierre Baldi,et al.  A machine learning information retrieval approach to protein fold recognition. , 2006, Bioinformatics.

[46]  Krzysztof Fidelis,et al.  Protein structure prediction center in CASP8 , 2009, Proteins.

[47]  Yang Zhang,et al.  SPICKER: A clustering approach to identify near‐native protein folds , 2004, J. Comput. Chem..

[48]  N. Grishin,et al.  COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance. , 2003, Journal of molecular biology.

[49]  E. Lattman,et al.  The state of the Protein Structure Initiative , 2004, Proteins.

[50]  Johannes Söding,et al.  The HHpred interactive server for protein homology detection and structure prediction , 2005, Nucleic Acids Res..

[51]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Krzysztof Fidelis,et al.  CASP8 results in context of previous experiments , 2009, Proteins.