An Online Approach for Mining Collective Behaviors from Molecular Dynamics Simulations

Collective behavior involving distally separate regions in a protein is known to widely affect its function. In this article, we present an online approach to study and characterize collective behavior in proteins as molecular dynamics (MD) simulations progress. Our representation of MD simulations as a stream of continuously evolving data allows us to succinctly capture spatial and temporal dependencies that may exist and analyze them efficiently using data mining techniques. By using tensor analysis we identify (a) collective motions (i.e., dynamic couplings) and (b) time-points during the simulation where the collective motions suddenly change. We demonstrate the applicability of this method on two different protein simulations for barnase and cyclophilin A. We characterize the collective motions in these proteins using our method and analyze sudden changes in these motions. Taken together, our results indicate that tensor analysis is well suited to extracting information from MD trajectories in an online fashion.

[1]  L. Kay,et al.  Intrinsic dynamics of an enzyme underlies catalysis , 2005, Nature.

[2]  J. Leeuw,et al.  Principal component analysis of three-mode data by means of alternating least squares algorithms , 1980 .

[3]  N Pattabiraman,et al.  Use of 3D QSAR methodology for data mining the National Cancer Institute Repository of Small Molecules: application to HIV-1 reverse transcriptase inhibition. , 1998, Methods.

[4]  A. Fersht,et al.  Protein folding and stability: the pathway of folding of barnase , 1993 .

[5]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[6]  L. Serrano,et al.  Quantifying information transfer by protein domains: Analysis of the Fyn SH2 domain structure , 2008, BMC Structural Biology.

[7]  Martin Billeter,et al.  Essential domain motions in barnase revealed by MD simulations , 2002, Proteins.

[8]  K. Dill,et al.  Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. , 2007, The Journal of chemical physics.

[9]  A. Atilgan,et al.  Vibrational Dynamics of Folded Proteins: Significance of Slow and Fast Motions in Relation to Function and Stability , 1998 .

[10]  M Go,et al.  Protein anatomy: functional roles of barnase module. , 1993, The Journal of biological chemistry.

[11]  N Go,et al.  Collective variable description of native protein dynamics. , 1995, Annual review of physical chemistry.

[12]  A. Geist,et al.  Protein dynamics and enzymatic catalysis: investigating the peptidyl-prolyl cis-trans isomerization activity of cyclophilin A. , 2004, Biochemistry.

[13]  Gürol M. Süel,et al.  Evolutionarily conserved networks of residues mediate allosteric communication in proteins , 2003, Nature Structural Biology.

[14]  M. Karplus,et al.  Method for estimating the configurational entropy of macromolecules , 1981 .

[15]  Bülent Yener,et al.  Multiway modeling and analysis in stem cell systems biology , 2008, BMC Systems Biology.

[16]  H. Berendsen,et al.  Collective protein dynamics in relation to function. , 2000, Current opinion in structural biology.

[17]  Tamara G. Kolda,et al.  Efficient MATLAB Computations with Sparse and Factored Tensors , 2007, SIAM J. Sci. Comput..

[18]  Oliver F. Lange,et al.  Full correlation analysis of conformational protein dynamics , 2007, Proteins.

[19]  P. Agarwal,et al.  Network of coupled promoting motions in enzyme catalysis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Rasmus Bro,et al.  Multiway analysis of epilepsy tensors , 2007, ISMB/ECCB.

[21]  Martin Billeter,et al.  Multiway decomposition of NMR spectra with coupled evolution periods. , 2005, Journal of the American Chemical Society.

[22]  D. Jacobs,et al.  Protein flexibility predictions using graph theory , 2001, Proteins.

[23]  John L. Klepeis,et al.  Anton, a special-purpose machine for molecular dynamics simulation , 2007, ISCA '07.

[24]  Tamara G. Kolda,et al.  Categories and Subject Descriptors: G.4 [Mathematics of Computing]: Mathematical Software— , 2022 .

[25]  Valerie Daggett,et al.  The present view of the mechanism of protein folding , 2003, Nature Reviews Molecular Cell Biology.

[26]  M. Thorpe,et al.  Protein flexibility using constraints from molecular dynamics simulations , 2005, Physical biology.

[27]  P. Agarwal Cis/trans isomerization in HIV‐1 capsid protein catalyzed by cyclophilin A: Insights from computational and theoretical studies , 2004, Proteins.

[28]  Dmitry M Korzhnev,et al.  Propagation of dynamic changes in barnase upon binding of barstar: an NMR and computational study. , 2007, Journal of molecular biology.

[29]  Jianyin Shao,et al.  Clustering Molecular Dynamics Trajectories: 1. Characterizing the Performance of Different Clustering Algorithms. , 2007, Journal of chemical theory and computation.

[30]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[31]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..