Structural features that predict real‐value fluctuations of globular proteins

It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics (MD) trajectories of nonhomologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real value of residue fluctuations using the support vector regression (SVR). It was found that some structural features have higher correlation than crystallographic B‐factors with fluctuations observed in MD trajectories. Moreover, SVR that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson's correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed in predictions by the Gaussian network model (GNM). An advantage of the developed method over the GNMs is that the former predicts the real value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. Proteins 2012; © 2012 Wiley Periodicals, Inc.

[1]  P. Debye,et al.  Interferenz von Röntgenstrahlen und Wärmebewegung , 1913 .

[2]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[3]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[4]  M. Karplus,et al.  Harmonic dynamics of proteins: normal modes and fluctuations in bovine pancreatic trypsin inhibitor. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[5]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[6]  A M Lesk,et al.  Interior and surface of monomeric proteins. , 1987, Journal of molecular biology.

[7]  Jean-Claude Spehner,et al.  Fast and robust computation of molecular surfaces , 1995, SCG '95.

[8]  Tirion,et al.  Large Amplitude Elastic Motions in Proteins from a Single-Parameter, Atomic Analysis. , 1996, Physical review letters.

[9]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[10]  A. Liwo,et al.  A united‐residue force field for off‐lattice protein‐structure simulations. I. Functional forms and parameters of long‐range side‐chain interaction potentials from protein crystal data , 1997 .

[11]  B. Erman,et al.  Efficient characterization of collective motions and interresidue correlations in proteins by low-resolution simulations. , 1997, Biochemistry.

[12]  I. Bahar,et al.  Gaussian Dynamics of Folded Proteins , 1997 .

[13]  Adam Liwo,et al.  A united-residue force field for off-lattice protein-structure simulations. I. Functional forms and parameters of long-range side-chain interaction potentials from protein crystal data , 1997, J. Comput. Chem..

[14]  R. Varadarajan,et al.  Residue depth: a novel parameter for the analysis of protein structure and stability. , 1999, Structure.

[15]  Michael Nilges,et al.  Molecular dynamics and accuracy of NMR structures: Effects of error bounds and data removal , 1999, Proteins.

[16]  Sebastian Doniach,et al.  Protein flexibility in solution and in crystals , 1999 .

[17]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[18]  Rieko Ishima,et al.  Protein dynamics from NMR , 2000, Nature Structural Biology.

[19]  Christopher J. Oldfield,et al.  Intrinsically disordered protein. , 2001, Journal of molecular graphics & modelling.

[20]  J. Skolnick,et al.  TOUCHSTONE: An ab initio protein structure prediction method that uses threading-based tertiary restraints , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[21]  B. Halle,et al.  Flexibility and packing in proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Daisuke Kihara,et al.  Ab initio protein structure prediction on a genomic scale: Application to the Mycoplasma genitalium genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  G. Phillips,et al.  Dynamics of proteins in crystals: comparison of experiment with simple models. , 2002, Biophysical journal.

[24]  J. Mccammon,et al.  Changes in flexibility upon binding: Application of the self-consistent pair contact probability method to protein-protein interactions , 2002 .

[25]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[26]  Zukang Feng,et al.  The Protein Data Bank and structural genomics , 2003, Nucleic Acids Res..

[27]  Daisuke Kihara,et al.  Microbial genomes have over 72% structure assignment by the threading algorithm PROSPECTOR_Q , 2004, Proteins.

[28]  R. Hilgenfeld,et al.  Utility of homology models in the drug discovery process , 2004, Drug Discovery Today.

[29]  Daisuke Takaya,et al.  Protein structure prediction in structure based drug design. , 2004, Current medicinal chemistry.

[30]  A. Maritan,et al.  Accurate and efficient description of protein vibrational dynamics: Comparing molecular dynamics and Gaussian models , 2004, Proteins.

[31]  A. Kolinski Protein modeling and structure prediction with a reduced representation. , 2004, Acta biochimica Polonica.

[32]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[33]  Yaoqi Zhou,et al.  Protein flexibility prediction by an all‐atom mean‐field statistical theory , 2005, Protein science : a publication of the Protein Society.

[34]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[35]  T. Hamelryck An amino acid has two sides: A new 2D measure provides a different view of solvent exposure , 2005, Proteins.

[36]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[37]  Russell L. Marsden,et al.  Progress of structural genomics initiatives: an analysis of solved target structures. , 2005, Journal of molecular biology.

[38]  B. Rost,et al.  Protein flexibility and rigidity predicted from sequence , 2005, Proteins.

[39]  Steven E Brenner,et al.  The Impact of Structural Genomics: Expectations and Outcomes , 2005, Science.

[40]  Marc A. Martí-Renom,et al.  MODBASE: a database of annotated comparative protein structure models and associated resources , 2005, Nucleic Acids Res..

[41]  M. Habeck,et al.  Error distribution derived NOE distance restraints , 2006, Proteins.

[42]  Philip E. Bourne,et al.  Wiggle—Predicting Functionally Flexible Regions from Primary Sequence , 2006, PLoS Comput. Biol..

[43]  G. Phillips,et al.  Optimization and evaluation of a coarse-grained model of protein motion using x-ray crystal data. , 2006, Biophysical journal.

[44]  Charles A Laughton,et al.  Essential Dynamics:  A Tool for Efficient Trajectory Compression and Management. , 2006, Journal of chemical theory and computation.

[45]  A simple way to compute protein dynamics without a mechanical model , 2007, Proteins.

[46]  A. Kolinski,et al.  Characterization of protein-folding pathways by reduced-space modeling , 2007, Proceedings of the National Academy of Sciences.

[47]  J. Skolnick,et al.  Benchmarking of TASSER in the ab initio limit , 2007, Proteins: Structure, Function, and Bioinformatics.

[48]  Hau-San Wong,et al.  Prediction of protein B-factors using multi-class bounded SVM. , 2007, Protein and peptide letters.

[49]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[50]  David W Ritchie,et al.  Recent progress and future directions in protein-protein docking. , 2008, Current protein & peptide science.

[51]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[52]  Shao-Wei Huang,et al.  Deriving protein dynamical properties from weighted protein contact number , 2008, Proteins.

[53]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[54]  Andrzej Kolinski,et al.  Folding pathway of the b1 domain of protein G explored by multiscale modeling. , 2007, Biophysical journal.

[55]  L. Kay,et al.  NMR spectroscopy brings invisible protein states into focus. , 2009, Nature chemical biology.

[56]  Guang Song,et al.  Generalized spring tensor models for protein fluctuation dynamics and conformation changes , 2010, 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop.

[57]  K. Teilum,et al.  Functional aspects of protein flexibility , 2009, Cellular and Molecular Life Sciences.

[58]  Tanja Kortemme,et al.  Backbone flexibility in computational protein design. , 2009, Current opinion in biotechnology.

[59]  Guang Song,et al.  Protein elastic network models and the ranges of cooperativity , 2009, Proceedings of the National Academy of Sciences.

[60]  Hiroshi Wako,et al.  Prediction of protein motions from amino acid sequence and its application to protein-protein interaction , 2010, BMC Structural Biology.

[61]  Fabrizio Chiti,et al.  Amyloid formation by globular proteins under native conditions. , 2009, Nature chemical biology.

[62]  Daisuke Kihara,et al.  Potential for Protein Surface Shape Analysis Using Spherical Harmonics and 3D Zernike Descriptors , 2009, Cell Biochemistry and Biophysics.

[63]  Andrzej Kloczkowski,et al.  Distance matrix-based approach to protein structure prediction , 2009, Journal of Structural and Functional Genomics.

[64]  Lukasz Kurgan,et al.  On the relation between residue flexibility and local solvent accessibility in proteins , 2009, Proteins.

[65]  J. K. Lassila,et al.  Conformational diversity and computational enzyme design. , 2010, Current opinion in chemical biology.

[66]  Martin Zacharias,et al.  Accounting for conformational changes during protein-protein docking. , 2010, Current opinion in structural biology.

[67]  Andrzej Kolinski,et al.  TRACER. A new approach to comparative modeling that combines threading with free-space conformational sampling. , 2010, Acta biochimica Polonica.

[68]  Modesto Orozco,et al.  MoDEL (Molecular Dynamics Extended Library): a database of atomistic molecular dynamics trajectories. , 2010, Structure.

[69]  Andrzej Kolinski,et al.  Simulation of Chaperonin Effect on Protein Folding: A Shift from Nucleation–Condensation to Framework Mechanism , 2011, Journal of the American Chemical Society.

[70]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[71]  A. Liwo,et al.  PDZ binding to the BAR domain of PICK1 is elucidated by coarse-grained molecular dynamics. , 2011, Journal of molecular biology.

[72]  S. Benkovic,et al.  Flexibility, diversity, and cooperativity: pillars of enzyme catalysis. , 2011, Biochemistry.

[73]  Catherine Etchebest,et al.  Predicting protein flexibility through the prediction of local structures , 2011, Proteins.

[74]  Daisuke Kihara,et al.  Effect of using suboptimal alignments in template‐based protein structure prediction , 2011, Proteins.

[75]  M. Lill Efficient incorporation of protein flexibility and dynamics into molecular docking simulations. , 2011, Biochemistry.

[76]  J. Bujnicki,et al.  Computational methods for prediction of protein-RNA interactions. , 2012, Journal of structural biology.

[77]  V. Uversky Intrinsically Disordered Proteins , 2014 .