GalaxyTBM: template-based modeling by building a reliable core and refining unreliable local regions

BackgroundProtein structures can be reliably predicted by template-based modeling (TBM) when experimental structures of homologous proteins are available. However, it is challenging to obtain structures more accurate than the single best templates by either combining information from multiple templates or by modeling regions that vary among templates or are not covered by any templates.ResultsWe introduce GalaxyTBM, a new TBM method in which the more reliable core region is modeled first from multiple templates and less reliable, variable local regions, such as loops or termini, are then detected and re-modeled by an ab initio method. This TBM method is based on “Seok-server,” which was tested in CASP9 and assessed to be amongst the top TBM servers. The accuracy of the initial core modeling is enhanced by focusing on more conserved regions in the multiple-template selection and multiple sequence alignment stages. Additional improvement is achieved by ab initio modeling of up to 3 unreliable local regions in the fixed framework of the core structure. Overall, GalaxyTBM reproduced the performance of Seok-server, with GalaxyTBM and Seok-server resulting in average GDT-TS of 68.1 and 68.4, respectively, when tested on 68 single-domain CASP9 TBM targets. For application to multi-domain proteins, GalaxyTBM must be combined with domain-splitting methods.ConclusionApplication of GalaxyTBM to CASP9 targets demonstrates that accurate protein structure prediction is possible by use of a multiple-template-based approach, and ab initio modeling of variable regions can further enhance the model quality.

[1]  Yang Zhang,et al.  I-TASSER server for protein 3D structure prediction , 2008, BMC Bioinformatics.

[2]  Liisa Holm,et al.  DaliLite workbench for protein structure comparison , 2000, Bioinform..

[3]  Č. Venclovas,et al.  Comparative modeling in CASP6 using consensus approach to template selection, sequence‐structure alignment, and structure assessment , 2005, Proteins.

[4]  Xin Deng,et al.  Recursive protein modeling: A divide and conquer strategy for protein structure prediction and its case study in CASP9 , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[5]  Charlotte M. Deane,et al.  Exploring the potential of template-based modelling , 2010, Bioinform..

[6]  Keehyoung Joo,et al.  All‐atom chain‐building by optimizing MODELLER energy function using conformational space annealing , 2009, Proteins.

[7]  Jooyoung Lee,et al.  New optimization method for conformational energy calculations on polypeptides: Conformational space annealing , 1997, J. Comput. Chem..

[8]  S. Rackovsky,et al.  Conformational analysis of the 20-residue membrane-bound portion of melittin by conformational space annealing. , 1998, Biopolymers.

[9]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[10]  C Venclovas,et al.  Processing and analysis of CASP3 protein structure predictions , 1999, Proteins.

[11]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[12]  N. Grishin,et al.  PROMALS3D: a tool for multiple protein sequence and structure alignments , 2008, Nucleic acids research.

[13]  Yaoqi Zhou,et al.  Ab initio folding of terminal segments with secondary structures reveals the fine difference between two closely related all‐atom statistical energy functions , 2008, Protein science : a publication of the Protein Society.

[14]  Jianlin Cheng A multi-template combination algorithm for protein comparative modeling , 2008, BMC Structural Biology.

[15]  K. Dill,et al.  Resultants and Loop Closure , 2006 .

[16]  Chaok Seok,et al.  Refinement of unreliable local regions in template‐based protein models , 2012, Proteins.

[17]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[18]  Keehyoung Joo,et al.  Refinement of protein termini in template‐based modeling using conformational space annealing , 2011, Proteins.

[19]  Arne Elofsson,et al.  Using multiple templates to improve quality of homology models in automated homology modeling , 2008, Protein science : a publication of the Protein Society.

[20]  Johannes Söding,et al.  Fast and accurate automatic structure prediction with HHpred , 2009, Proteins.

[21]  A. Sali,et al.  Comparative protein structure modeling of genes and genomes. , 2000, Annual review of biophysics and biomolecular structure.

[22]  Chaok Seok,et al.  Protein loop modeling by using fragment assembly and analytical loop closure , 2010, Proteins.

[23]  Jian Peng,et al.  Boosting Protein Threading Accuracy , 2009, RECOMB.

[24]  Chaok Seok,et al.  A kinematic view of loop closure , 2004, J. Comput. Chem..

[25]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[26]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[27]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[28]  Jooyoung Lee,et al.  LigDockCSA: Protein–ligand docking using conformational space annealing , 2011, J. Comput. Chem..

[29]  A. Liwo,et al.  Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: application to the 10-55 fragment of staphylococcal protein A and to apo calbindin D9K. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Torsten Schwede,et al.  Assessment of template based protein structure predictions in CASP9 , 2011, Proteins.

[31]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[32]  András Fiser,et al.  Comparative protein structure modeling by combining multiple templates and optimizing sequence-to-structure alignments , 2007, Bioinform..

[33]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[34]  Yaoqi Zhou,et al.  Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates , 2011, Bioinform..

[35]  Yang Zhang,et al.  How significant is a protein structure similarity with TM-score = 0.5? , 2010, Bioinform..