Protein structure modeling for CASP10 by multiple layers of global optimization

In the template‐based modeling (TBM) category of CASP10 experiment, we introduced a new protocol called protein modeling system (PMS) to generate accurate protein structures in terms of side‐chains as well as backbone trace. In the new protocol, a global optimization algorithm, called conformational space annealing (CSA), is applied to the three layers of TBM procedure: multiple sequence‐structure alignment, 3D chain building, and side‐chain re‐modeling. For 3D chain building, we developed a new energy function which includes new distance restraint terms of Lorentzian type (derived from multiple templates), and new energy terms that combine (physical) energy terms such as dynamic fragment assembly (DFA) energy, DFIRE statistical potential energy, hydrogen bonding term, etc. These physical energy terms are expected to guide the structure modeling especially for loop regions where no template structures are available. In addition, we developed a new quality assessment method based on random forest machine learning algorithm to screen templates, multiple alignments, and final models. For TBM targets of CASP10, we find that, due to the combination of three stages of CSA global optimizations and quality assessment, the modeling accuracy of PMS improves at each additional stage of the protocol. It is especially noteworthy that the side‐chains of the final PMS models are far more accurate than the models in the intermediate steps. Proteins 2014; 82(Suppl 2):188–195. © 2013 Wiley Periodicals, Inc.

[1]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[2]  Steven P Gross,et al.  Modularity optimization by conformational space annealing. , 2012, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Ruben Abagyan,et al.  Homology Modeling: Methods and Protocols , 2023, Methods in Molecular Biology.

[4]  Keehyoung Joo,et al.  All‐atom chain‐building by optimizing MODELLER energy function using conformational space annealing , 2009, Proteins.

[5]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[6]  Jooyoung Lee,et al.  New optimization method for conformational energy calculations on polypeptides: Conformational space annealing , 1997, J. Comput. Chem..

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Keehyoung Joo,et al.  High accuracy template based modeling by global optimization , 2007, Proteins.

[9]  Keehyoung Joo,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS SANN: Solvent accessibility prediction of proteins , 2022 .

[10]  Jooyoung Lee,et al.  De novo protein structure prediction by dynamic fragment assembly and conformational space annealing , 2011, Proteins.

[11]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[12]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[13]  Keehyoung Joo,et al.  Multiple sequence alignment by conformational space annealing. , 2008, Biophysical journal.

[14]  J. Skolnick,et al.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. , 2011, Biophysical journal.

[15]  Liisa Holm,et al.  COFFEE: an objective function for multiple sequence alignments , 1998, Bioinform..

[16]  Adrian A Canutescu,et al.  A graph‐theory algorithm for rapid protein side‐chain prediction , 2003, Protein science : a publication of the Protein Society.

[17]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[18]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[19]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.