Contact‐assisted protein structure modeling by global optimization in CASP11

We have applied the conformational space annealing method to the contact‐assisted protein structure modeling in CASP11. For Tp targets, where predicted residue–residue contact information was provided, the contact energy term in the form of the Lorentzian function was implemented together with the physical energy terms used in our template‐free modeling of proteins. Although we observed some structural improvement of Tp models over the models predicted without the Tp information, the improvement was not substantial on average. This is partly due to the inaccuracy of the provided contact information, where only about 18% of it was correct. For Ts targets, where the information of ambiguous NOE (Nuclear Overhauser Effect) restraints was provided, we formulated the modeling in terms of the two‐tier optimization problem, which covers: (1) the assignment of NOE peaks and (2) the three‐dimensional (3D) model generation based on the assigned NOEs. Although solving the problem in a direct manner appears to be intractable at first glance, we demonstrate through CASP11 that remarkably accurate protein 3D modeling is possible by brute force optimization of a relevant energy function. For 19 Ts targets of the average size of 224 residues, generated protein models were of about 3.6 Å Cα atom accuracy. Even greater structural improvement was observed when additional Tc contact information was provided. For 20 out of the total 24 Tc targets, we were able to generate protein structures which were better than the best model from the rest of the CASP11 groups in terms of GDT‐TS. Proteins 2016; 84(Suppl 1):189–199. © 2015 Wiley Periodicals, Inc.

[1]  Ben M. Webb,et al.  Putting the Pieces Together: Integrative Modeling Platform Software for Structure Determination of Macromolecular Assemblies , 2012, PLoS biology.

[2]  Keehyoung Joo,et al.  High accuracy template based modeling by global optimization , 2007, Proteins.

[3]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[4]  Jooyoung Lee,et al.  Improved network community structure improves function prediction , 2013, Scientific Reports.

[5]  David Baker,et al.  De novo protein structure determination from near-atomic resolution cryo-EM maps , 2015, Nature Methods.

[6]  Björn Wallner,et al.  DockQ: A Quality Measure for Protein-Protein Docking Models , 2016, PloS one.

[7]  Oliver F. Lange,et al.  NMR Structure Determination for Larger Proteins Using Backbone-Only Data , 2010, Science.

[8]  Alexander D. MacKerell,et al.  Extending the treatment of backbone energetics in protein force fields: Limitations of gas‐phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations , 2004, J. Comput. Chem..

[9]  Jooyoung Lee,et al.  De novo protein structure prediction by dynamic fragment assembly and conformational space annealing , 2011, Proteins.

[10]  D. Svergun,et al.  Structural characterization of proteins and complexes using small-angle X-ray solution scattering. , 2010, Journal of structural biology.

[11]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[12]  Protein structure determination by conformational space annealing using NMR geometric restraints , 2015, Proteins.

[13]  Yang Zhang,et al.  Application of sparse NMR restraints to large-scale protein structure prediction. , 2004, Biophysical journal.

[14]  Debora S. Marks,et al.  Protein structure determination by combining sparse NMR data with evolutionary couplings , 2015, Nature Methods.

[15]  Keehyoung Joo,et al.  Protein structure modeling for CASP10 by multiple layers of global optimization , 2014, Proteins.

[16]  Oliver F. Lange,et al.  Determination of solution structures of proteins up to 40 kDa using CS-Rosetta with sparse NMR data from deuterated samples , 2012, Proceedings of the National Academy of Sciences.

[17]  David E. Kim,et al.  One contact for every twelve residues allows robust and accurate topology‐level protein structure modeling , 2014, Proteins.

[18]  Andrej Sali,et al.  FoXS, FoXSDock and MultiFoXS: Single-state and multi-state structural modeling of proteins and their complexes based on SAXS profiles , 2016, Nucleic Acids Res..

[19]  Keehyoung Joo,et al.  All‐atom chain‐building by optimizing MODELLER energy function using conformational space annealing , 2009, Proteins.

[20]  Keehyoung Joo,et al.  Template based protein structure modeling by global optimization in CASP11 , 2016, Proteins.

[21]  S. Rackovsky,et al.  Conformational analysis of the 20-residue membrane-bound portion of melittin by conformational space annealing. , 1998, Biopolymers.

[22]  David Baker,et al.  Accurate protein structure modeling using sparse NMR data and homologous structure information , 2012, Proceedings of the National Academy of Sciences.

[23]  Guillermo Sapiro,et al.  Protein secondary structure determination by constrained single-particle cryo-electron tomography. , 2012, Structure.

[24]  Jooyoung Lee,et al.  SVMQA: support‐vector‐machine‐based protein single‐model quality assessment , 2017, Bioinform..

[25]  Oliver F. Lange,et al.  Consistent blind protein structure generation from NMR chemical shift data , 2008, Proceedings of the National Academy of Sciences.

[26]  Kyoungrim Lee,et al.  Study of protein–protein interaction using conformational space annealing , 2005, Proteins.

[27]  Andrej Sali,et al.  Uncertainty in integrative structural modeling. , 2014, Current opinion in structural biology.

[28]  W. C. Still,et al.  The GB/SA Continuum Model for Solvation. A Fast Analytical Method for the Calculation of Approximate Born Radii , 1997 .

[29]  John A Tainer,et al.  Bridging the solution divide: comprehensive structural analyses of dynamic RNA, DNA, and protein assemblies by small-angle X-ray scattering. , 2010, Current opinion in structural biology.

[30]  Keehyoung Joo,et al.  Template‐free modeling by LEE and LEER in CASP11 , 2016, Proteins.

[31]  John A Tainer,et al.  Accurate SAXS profile computation and its assessment by contrast variation experiments. , 2013, Biophysical journal.

[32]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[33]  J. Skolnick,et al.  TOUCHSTONEX: Protein structure prediction with sparse NMR data , 2003, Proteins.

[34]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[35]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[36]  Julian Lee,et al.  Unbiased global optimization of Lennard-Jones clusters for N < or =201 using the conformational space annealing method. , 2003, Physical review letters.

[37]  R. Aebersold,et al.  Probing Native Protein Structures by Chemical Cross-linking, Mass Spectrometry, and Bioinformatics , 2010, Molecular & Cellular Proteomics.

[38]  Jooyoung Lee,et al.  LigDockCSA: Protein–ligand docking using conformational space annealing , 2011, J. Comput. Chem..

[39]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.

[40]  David T. Jones,et al.  MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins , 2014, Bioinform..

[41]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[42]  Vincent B. Chen,et al.  Correspondence e-mail: , 2000 .

[43]  Yang Zhang,et al.  Improving protein template recognition by using small-angle x-ray scattering profiles. , 2011, Biophysical journal.

[44]  A. Liwo,et al.  Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: application to the 10-55 fragment of staphylococcal protein A and to apo calbindin D9K. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Michele Vendruscolo,et al.  Protein structure determination from NMR chemical shifts , 2007, Proceedings of the National Academy of Sciences.

[46]  Keehyoung Joo,et al.  Improving physical realism, stereochemistry, and side‐chain accuracy in homology modeling: Four approaches that performed well in CASP8 , 2009, Proteins.

[47]  J. Skolnick,et al.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. , 2011, Biophysical journal.

[48]  John A. Tainer,et al.  Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS) , 2009, Nature Methods.

[49]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[50]  Jooyoung Lee,et al.  New optimization method for conformational energy calculations on polypeptides: Conformational space annealing , 1997, J. Comput. Chem..

[51]  Marco Biasini,et al.  lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests , 2013, Bioinform..

[52]  Friedrich Förster,et al.  Integration of small-angle X-ray scattering data into structural modeling of proteins and their assemblies. , 2008, Journal of molecular biology.

[53]  Z. Weng,et al.  ZDOCK: An initial‐stage protein‐docking algorithm , 2003, Proteins.

[54]  Chris G de Koster,et al.  Chemical cross-linking and mass spectrometry for protein structural modeling. , 2003, Journal of molecular biology.

[55]  Thomas A. Hopf,et al.  Protein structure prediction from sequence variation , 2012, Nature Biotechnology.