Analyzing large‐scale structural change in proteins: Comparison of principal component projection and sammon mapping

Effective analysis of large‐scale conformational transitions in macromolecules requires transforming them into a lower dimensional representation that captures the dominant motions. Herein, we apply and compare two different dimensionality reduction techniques, namely, principal component analysis (PCA), a linear method, and Sammon mapping, which is nonlinear. The two methods are used to analyze four different protein transition pathways of varying complexity, obtained by using either the conjugate peak refinement method or constrained molecular dynamics. For the return‐stroke in myosin, both Sammon mapping and PCA show that the conformational change is dominated by a simple rotation of a rigid body. Also, in the case of the T→R transition in hemoglobin, both methods are able to identify the two main quaternary transition events. In contrast, in the cases of the unfolding transition of staphylococcal nuclease or the signaling switch of Ras p21, which are both more complex conformational transitions, only Sammon mapping is able to identify the distinct phases of motion. Proteins 2006. © 2006 Wiley‐Liss, Inc.

[1]  J. Evanseck,et al.  Functional Significance of Hierarchical Tiers in Carbonmonoxy Myoglobin: Conformational Substates and Transitions Studied by Conformational Flooding Simulations , 2000 .

[2]  Motonori Ota,et al.  Phylogeny of protein-folding trajectories reveals a unique pathway to native structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M. Karplus,et al.  Locally accessible conformations of proteins: Multiple molecular dynamics simulations of crambin , 1998, Protein science : a publication of the Protein Society.

[4]  A Kitao,et al.  Harmonic and anharmonic aspects in the dynamics of BPTI: A normal mode analysis and principal component analysis , 1994, Protein science : a publication of the Protein Society.

[5]  D. Lowy,et al.  Function and regulation of ras. , 1993, Annual review of biochemistry.

[6]  Stefan Fischer,et al.  Molecular dynamics simulation reveals a surface salt bridge forming a kinetic trap in unfolding of truncated Staphylococcal nuclease , 2003, Proteins.

[7]  M Karplus,et al.  Molecular switch in signal transduction: reaction paths of the conformational changes in ras p21. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[8]  N Go,et al.  Refinement of protein dynamic structure: normal mode refinement. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[9]  D. Shortle,et al.  Correlation between changes in nuclear magnetic resonance order parameters and conformational entropy: Molecular dynamics simulations of native and denatured staphylococcal nuclease , 2000, Proteins.

[10]  T. Metzger,et al.  Zero-strain GaAs quantum dot molecules as investigated by x-ray diffuse scattering , 2006 .

[11]  R. Abseher,et al.  Are there non-trivial dynamic cross-correlations in proteins? , 1998, Journal of molecular biology.

[12]  W R Taylor,et al.  Distance geometry based comparative modelling. , 1997, Folding & design.

[13]  M. Karplus,et al.  Conjugate peak refinement: an algorithm for finding reaction paths and accurate transition states in systems with many degrees of freedom , 1992 .

[14]  E. Taylor,et al.  Mechanism of adenosine triphosphate hydrolysis by actomyosin. , 1971, Biochemistry.

[15]  H. Berendsen,et al.  Essential dynamics of the cellular retinol-binding protein--evidence for ligand-induced conformational changes. , 1995, Protein engineering.

[16]  S. Whitten,et al.  pH dependence of stability of staphylococcal nuclease: evidence of substantial electrostatic interactions in the denatured state. , 2000, Biochemistry.

[17]  D. Shortle,et al.  A dynamic bundle of four adjacent hydrophobic segments in the denatured state of staphylococcal nuclease , 1996, Protein science : a publication of the Protein Society.

[18]  H J Berendsen,et al.  Toward an exhaustive sampling of the configurational spaces of the two forms of the peptide hormone guanylin. , 1996, Journal of biomolecular structure & dynamics.

[19]  M. Karplus,et al.  Method for estimating the configurational entropy of macromolecules , 1981 .

[20]  Ronald M. Levy,et al.  Motions of an α‐helical polypeptide: Comparison of molecular and harmonic dynamics , 1990 .

[21]  J. Michael Cherry,et al.  Visualization of expression clusters using Sammon's non-linear mapping , 2001, Bioinform..

[22]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[23]  H M Holden,et al.  X-ray structures of the myosin motor domain of Dictyostelium discoideum complexed with MgADP.BeFx and MgADP.AlF4-. , 1995, Biochemistry.

[24]  B. Zhou,et al.  Conformational features of a truncated staphylococcal nuclease R (SNR135) and their implications for catalysis. , 1998, Archives of biochemistry and biophysics.

[25]  B. Hess Convergence of sampling in protein simulations. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  H. Berendsen,et al.  The consistency of large concerted motions in proteins in molecular dynamics simulations. , 1996, Biophysical journal.

[27]  H. Berendsen,et al.  Systematic analysis of domain motions in proteins from conformational change: New results on citrate synthase and T4 lysozyme , 1998, Proteins.

[28]  D. Engelman,et al.  Truncated staphylococcal nuclease is compact but disordered. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[29]  W R Taylor,et al.  Homology modelling by distance geometry. , 1996, Folding & design.

[30]  M. Ermácora,et al.  Mapping the structure of a non-native state of staphylococcal nuclease , 1996, Nature Structural Biology.

[31]  N. Go,et al.  Effect of solvent on collective motions in globular protein. , 1993, Journal of molecular biology.

[32]  H. Berendsen,et al.  An extended sampling of the configurational space of HPr from E. coli , 1996, Proteins.

[33]  A. Wittinghofer,et al.  Ras‐effector interactions, the problem of specificity , 1995, FEBS letters.

[34]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[35]  Frank Noé,et al.  Automated computation of low‐energy pathways for complex rearrangements in proteins: Application to the conformational switch of Ras p21 , 2005, Proteins.

[36]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[37]  Grubmüller,et al.  Predicting slow structural transitions in macromolecular systems: Conformational flooding. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[38]  C. Brooks,et al.  Statistical clustering techniques for the analysis of long molecular dynamics trajectories: analysis of 2.2-ns trajectories of YPGDV. , 1993, Biochemistry.

[39]  Andrea Amadei,et al.  A comparison of techniques for calculating protein essential dynamics , 1997, J. Comput. Chem..

[40]  Steven T. Whitten and,et al.  pH dependence of stability of staphylococcal nuclease: evidence of substantial electrostatic interactions in the denatured state. , 2000 .

[41]  C Chothia,et al.  Haemoglobin: the structural changes related to ligand binding and its allosteric mechanism. , 1979, Journal of molecular biology.

[42]  M. Karplus,et al.  Collective motions in proteins: A covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations , 1991, Proteins.

[43]  D. Shortle,et al.  Backbone dynamics of a highly disordered 131 residue fragment of staphylococcal nuclease. , 1994, Journal of molecular biology.

[44]  Lennart Nilsson,et al.  Thermal unfolding simulations of a multimeric protein—Transition state and unfolding pathways , 2005, Proteins.

[45]  K C Holmes,et al.  Structural mechanism of muscle contraction. , 1999, Annual review of biochemistry.

[46]  H J Berendsen,et al.  An efficient method for sampling the essential subspace of proteins. , 1996, Journal of biomolecular structure & dynamics.

[47]  S H Kim,et al.  Molecular switch for signal transduction: structural differences between active and inactive forms of protooncogenic ras proteins. , 1992, Science.

[48]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[49]  D. Shortle,et al.  Residual structure in large fragments of staphylococcal nuclease: effects of amino acid substitutions. , 1989, Biochemistry.

[50]  A. Fersht Nucleation mechanisms in protein folding. , 1997, Current opinion in structural biology.

[51]  C A Smith,et al.  X-ray structure of the magnesium(II).ADP.vanadate complex of the Dictyostelium discoideum myosin motor domain to 1.9 A resolution. , 1996, Biochemistry.

[52]  A. Lesk,et al.  Structural mechanisms for domain movements in proteins. , 1994, Biochemistry.

[53]  García,et al.  Large-amplitude nonlinear motions in proteins. , 1992, Physical review letters.

[54]  W G Richards,et al.  A novel representation of protein structure. , 1995, Journal of molecular graphics.

[55]  Stefan Fischer,et al.  Structural mechanism of the recovery stroke in the myosin molecular motor. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[56]  D K Agrafiotis,et al.  A new method for analyzing protein sequence relationships based on Sammon maps , 1997, Protein science : a publication of the Protein Society.

[57]  H. Berendsen,et al.  Model‐free methods of analyzing domain motions in proteins from simulation: A comparison of normal mode analysis and molecular dynamics simulation of lysozyme , 1997, Proteins.

[58]  M. Barbacid ras genes. , 1987, Annual review of biochemistry.

[59]  E. Pai,et al.  The structure of Ras protein: a model for a universal molecular switch. , 1991, Trends in biochemical sciences.

[60]  Quantitative visualization of a macromolecular potential energy “funnel” , 1997 .

[61]  Wojciech Szpankowski,et al.  Indexing and mapping of proteins using a modified nonlinear Sammon projection , 1999, J. Comput. Chem..

[62]  Jeremy C. Smith,et al.  Principal components of the protein dynamical transition. , 2003, Physical review letters.