Principal component and normal mode analysis of proteins; a quantitative comparison using the GroEL subunit

Principal component analysis (PCA) and normal mode analysis (NMA) have emerged as two invaluable tools for studying conformational changes in proteins. To compare these approaches for studying protein dynamics, we have used a subunit of the GroEL chaperone, whose dynamics is well characterized. We first show that both PCA on trajectories from molecular dynamics (MD) simulations and NMA reveal a general dynamical behavior in agreement with what has previously been described for GroEL. We thus compare the reproducibility of PCA on independent MD runs and subsequently investigate the influence of the length of the MD simulations. We show that there is a relatively poor one‐to‐one correspondence between eigenvectors obtained from two independent runs and conclude that caution should be taken when analyzing principal components individually. We also observe that increasing the simulation length does not improve the agreement with the experimental structural difference. In fact, relatively short MD simulations are sufficient for this purpose. We observe a rapid convergence of the eigenvectors (after ca. 6 ns). Although there is not always a clear one‐to‐one correspondence, there is a qualitatively good agreement between the movements described by the first five modes obtained with the three different approaches; PCA, all‐atoms NMA, and coarse‐grained NMA. It is particularly interesting to relate this to the computational cost of the three methods. The results we obtain on the GroEL subunit contribute to the generalization of robust and reproducible strategies for the study of protein dynamics, using either NMA or PCA of trajectories from MD simulations. Proteins 2010. © 2010 Wiley‐Liss, Inc.

[1]  M. Karplus,et al.  Collective motions in proteins: A covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations , 1991, Proteins.

[2]  D. J. Naylor,et al.  Proteome-wide Analysis of Chaperonin-Dependent Protein Folding in Escherichia coli , 2005, Cell.

[3]  K. Furtak,et al.  Folding in vivo of bacterial cytoplasmic proteins: Role of GroEL , 1993, Cell.

[4]  John Mongan,et al.  Interactive essential dynamics , 2004, J. Comput. Aided Mol. Des..

[5]  Zbyszek Otwinowski,et al.  The 2.4 Å crystal structure of the bacterial chaperonin GroEL complexed with ATPγS , 1996, Nature Structural Biology.

[6]  Lee-Wei Yang,et al.  Coarse-Grained Models Reveal Functional Dynamics - I. Elastic Network Models – Theories, Comparisons and Perspectives , 2008, Bioinformatics and biology insights.

[7]  K. Schulten,et al.  Principal Component Analysis and Long Time Protein Dynamics , 1996 .

[8]  Jianpeng Ma,et al.  Usefulness and limitations of normal mode analysis in modeling dynamics of biomolecular complexes. , 2005, Structure.

[9]  B. Tidor Molecular dynamics simulations , 1997, Current Biology.

[10]  A. Carriquiry,et al.  Close correspondence between the motions from principal component analysis of multiple HIV-1 protease structures and elastic network modes. , 2008, Structure.

[11]  David J. Osguthorpe,et al.  Low Frequency Motion in Proteins , 1999 .

[12]  B M Pettitt,et al.  A sampling problem in molecular dynamics simulations of macromolecules. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[13]  P. Chacón,et al.  Thorough validation of protein normal mode analysis: a comparative study with essential dynamics. , 2007, Structure.

[14]  David J. Osguthorpe,et al.  Low Frequency Motion in Proteins Comparison of Normal Mode and Molecular Dynamics of Streptomyces Griseus Protease A , 1999 .

[15]  Junmei Wang,et al.  How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? , 2000, J. Comput. Chem..

[16]  L. Loeb,et al.  Prokaryotic DNA polymerase I: evolution, structure, and "base flipping" mechanism for nucleotide selection. , 2001, Journal of molecular biology.

[17]  K. Hinsen,et al.  Harmonicity in slow protein dynamics , 2000 .

[18]  A. Amadei,et al.  On the convergence of the conformational coordinates basis set obtained by the essential dynamics analysis of proteins' molecular dynamics simulations , 1999, Proteins.

[19]  García,et al.  Large-amplitude nonlinear motions in proteins. , 1992, Physical review letters.

[20]  Cecilia Bartolucci,et al.  Crystal structure of wild-type chaperonin GroEL. , 2005, Journal of molecular biology.

[21]  K. Hinsen,et al.  Projection Methods for the Analysis of Complex Motions in Macromolecules , 2000 .

[22]  Qiang Cui,et al.  Interpreting correlated motions using normal mode analysis. , 2006, Structure.

[23]  R. Vale,et al.  The way things move: looking under the hood of molecular motor proteins. , 2000, Science.

[24]  Walid A Houry,et al.  In Vivo Observation of Polypeptide Flux through the Bacterial Chaperonin System , 1997, Cell.

[25]  D Perahia,et al.  Motions in hemoglobin studied by normal mode analysis and energy minimization: evidence for the existence of tertiary T-like, quaternary R-like intermediate structures. , 1996, Journal of molecular biology.

[26]  Mark A. Wilson,et al.  Intrinsic motions along an enzymatic reaction trajectory , 2007, Nature.

[27]  Konrad Hinsen,et al.  The molecular modeling toolkit: A new approach to molecular simulations , 2000, J. Comput. Chem..

[28]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[29]  Jianpeng Ma,et al.  A normal mode analysis of structural plasticity in the biomolecular motor F(1)-ATPase. , 2004, Journal of molecular biology.

[30]  C. Georgopoulos,et al.  The groES and groEL heat shock gene products of Escherichia coli are essential for bacterial growth at all temperatures , 1989, Journal of bacteriology.

[31]  Dmitrij Frishman,et al.  Identification of in vivo substrates of the chaperonin GroEL , 1999, Nature.

[32]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[33]  A. Horwich,et al.  The crystal structure of the asymmetric GroEL–GroES–(ADP)7 chaperonin complex , 1997, Nature.

[34]  M. Karplus,et al.  Locally accessible conformations of proteins: Multiple molecular dynamics simulations of crambin , 1998, Protein science : a publication of the Protein Society.

[35]  Axel T Brunger,et al.  Exploring the structural dynamics of the E.coli chaperonin GroEL using translation-libration-screw crystallographic refinement of intermediate states. , 2004, Journal of molecular biology.

[36]  R. Jernigan,et al.  Anisotropy of fluctuation dynamics of proteins with an elastic network model. , 2001, Biophysical journal.

[37]  Y. Sanejouand,et al.  A new approach for determining low‐frequency normal modes in macromolecules , 1994 .

[38]  H. Berendsen,et al.  Model‐free methods of analyzing domain motions in proteins from simulation: A comparison of normal mode analysis and molecular dynamics simulation of lysozyme , 1997, Proteins.

[39]  H. Berendsen,et al.  Domain motions in bacteriophage T4 lysozyme: A comparison between molecular dynamics and crystallographic data , 1998, Proteins.

[40]  Adam W Van Wynsberghe,et al.  Comparison of mode analyses at different resolutions applied to nucleic acid systems. , 2005, Biophysical journal.

[41]  J A McCammon,et al.  Analysis of a 10-ns molecular dynamics simulation of mouse acetylcholinesterase. , 2001, Biophysical journal.

[42]  Y. Sanejouand,et al.  Building‐block approach for determining low‐frequency normal modes of macromolecules , 2000, Proteins.

[43]  Masasuke Yoshida,et al.  Mechanically driven ATP synthesis by F1-ATPase , 2004, Nature.

[44]  David Pérahia,et al.  Computation of Low-frequency Normal Modes in Macromolecules: Improvements to the Method of Diagonalization in a Mixed Basis and Application to Hemoglobin , 1995, Comput. Chem..

[45]  Adam W Van Wynsberghe,et al.  Protein structural variation in computational models and crystallographic data. , 2006, Structure.

[46]  Neil A Ranson,et al.  The chaperonin folding machine. , 2002, Trends in biochemical sciences.

[47]  N. Go,et al.  Dynamics of a small globular protein in terms of low-frequency vibrational modes. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Andrea Amadei,et al.  A comparison of techniques for calculating protein essential dynamics , 1997, J. Comput. Chem..

[49]  G. Ciccotti,et al.  Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes , 1977 .

[50]  Lars Skjærven,et al.  Normal mode analysis for proteins , 2009 .

[51]  A. R. Srinivasan,et al.  Quasi‐harmonic method for studying very low frequency modes in proteins , 1984, Biopolymers.

[52]  M. Karplus,et al.  Harmonic dynamics of proteins: normal modes and fluctuations in bovine pancreatic trypsin inhibitor. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[54]  N. Go,et al.  Harmonicity and anharmonicity in protein dynamics: A normal mode analysis and principal component analysis , 1995, Proteins.

[55]  M Karplus,et al.  The allosteric mechanism of the chaperonin GroEL: a dynamic analysis. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[56]  I. Bahar,et al.  Coarse-grained normal mode analysis in structural biology. , 2005, Current opinion in structural biology.

[57]  Y. Sanejouand,et al.  Hinge‐bending motion in citrate synthase arising from normal mode calculations , 1995, Proteins.

[58]  K. Hinsen Analysis of domain motions by approximate normal mode calculations , 1998, Proteins.

[59]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[60]  A. Atilgan,et al.  Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. , 1997, Folding & design.

[61]  Yong Duan,et al.  Distinguish protein decoys by Using a scoring function based on a new AMBER force field, short molecular dynamics simulations, and the generalized born solvent model , 2004, Proteins.

[62]  D. Boisvert,et al.  The 2.4 A crystal structure of the bacterial chaperonin GroEL complexed with ATP gamma S. , 1996, Nature structural biology.

[63]  H. Berendsen,et al.  A comparison of techniques for calculating protein essential dynamics , 1997 .

[64]  Fumio Hirata,et al.  The effects of solvent on the conformation and the collective motions of protein: normal mode analysis and molecular dynamics simulations of melittin in water and in vacuum , 1991 .

[65]  L. Mouawad,et al.  Diagonalization in a mixed basis: A method to compute low‐frequency normal modes for large macromolecules , 1993 .

[66]  D. Thirumalai,et al.  Allosteric transitions in the chaperonin GroEL are captured by a dominant normal mode that is most robust to sequence variations. , 2007, Biophysical journal.

[67]  Taner Z Sen,et al.  The Extent of Cooperativity of Protein Motions Observed with Elastic Network Models Is Similar for Atomic and Coarser-Grained Models. , 2006, Journal of chemical theory and computation.

[68]  Oliver F. Lange,et al.  Can principal components yield a dimension reduced description of protein dynamics on long time scales? , 2006, The journal of physical chemistry. B.

[69]  Zbyszek Otwinowski,et al.  The crystal structure of the bacterial chaperonln GroEL at 2.8 Å , 1994, Nature.

[70]  Nathan A. Baker,et al.  Molecular dynamics simulation of the Escherichia coli NikR protein: equilibrium conformational fluctuations reveal interdomain allosteric communication pathways. , 2008, Journal of molecular biology.