Using Tensor Analysis to characterize Contact-map Dynamics of Proteins

Molecular dynamics simulations provide vast amount of information about a protein’s dynamics. To interpret a protein’s dynamics and how it may relate to its function, traditionally, two-way analysis techniques such as principal component analysis have been used. However, two-way analysis techniques are usually limited by the fact that they have to be done post-process, i.e., after the simulations have been run and also cannot provide insights into temporal behavior of a protein. To overcome these limitations, we are proposing to use multi-way analysis techniques to understand and interpret protein dynamics as and when the simulations are progressing i.e., online. We model MD simulations in terms of a collection of contact maps and then modeling them as tensors to capture multiple dependencies. Using two recently developed techniques to perform online analysis of streaming data, we illustrate the use of this technique to describe and interpret the behavior of a protein complex in real time. We provide both experimental evidence to support our claims and also discuss the potential advantages and disadvantages of using tensor analysis techniques.

[1]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[2]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[3]  Rasmus Bro,et al.  Multiway analysis of epilepsy tensors , 2007, ISMB/ECCB.

[4]  Martin Billeter,et al.  Robust and versatile interpretation of spectra with coupled evolution periods using multi‐way decomposition , 2006, Magnetic resonance in chemistry : MRC.

[5]  Laxmikant V. Kale,et al.  Algorithmic Challenges in Computational Molecular Biophysics , 1999 .

[6]  Berend Smit,et al.  Understanding molecular simulation: from algorithms to applications , 1996 .

[7]  E. Acar,et al.  Seizure Recognition on Epilepsy Feature Tensor , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[8]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[9]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[10]  Sibsankar Kundu,et al.  Automatic domain decomposition of proteins by a Gaussian Network Model , 2004, Proteins.

[11]  D. A. Bosco,et al.  Catalysis of cis/trans isomerization in native HIV-1 capsid by human cyclophilin A , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[12]  I. Jolliffe Principal Component Analysis , 2002 .

[13]  H. Berendsen,et al.  A comparison of techniques for calculating protein essential dynamics , 1997 .

[14]  D. A. Bosco,et al.  Enzyme Dynamics During Catalysis , 2002, Science.

[15]  Philip S. Yu,et al.  Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams , 2006, Sixth International Conference on Data Mining (ICDM'06).

[16]  Gordon M. Crippen,et al.  Distance Geometry and Molecular Conformation , 1988 .

[17]  M. Karplus,et al.  Method for estimating the configurational entropy of macromolecules , 1981 .

[18]  K. Dill,et al.  Origins of structure in globular proteins. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[19]  M. Karplus,et al.  Kinetics of protein folding. A lattice model study of the requirements for folding to the native state. , 1994, Journal of molecular biology.

[20]  P. Agarwal Enzymes: An integrated view of structure, dynamics and function , 2006, Microbial cell factories.

[21]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[22]  Mohammed J. Zaki,et al.  TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data , 2005, SIGMOD '05.

[23]  P. Agarwal Role of protein dynamics in reaction rate enhancement by enzymes. , 2005, Journal of the American Chemical Society.

[24]  Philip S. Yu,et al.  Tensor Analysis on Multi-aspect Streams , 2007 .

[25]  Steffen Lang Protein domain decomposition using spectral graph partitioning , 2007 .

[26]  N Go,et al.  Collective variable description of native protein dynamics. , 1995, Annual review of physical chemistry.

[27]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[28]  Jimeng Sun,et al.  Beyond streams and graphs: dynamic tensor analysis , 2006, KDD '06.

[29]  M. Karplus,et al.  Kinetics of protein folding , 1995, Nature.

[30]  W. Sundquist,et al.  Structural insights into the catalytic mechanism of cyclophilin A , 2003, Nature Structural Biology.

[31]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[32]  Jimeng Sun,et al.  Distributed Pattern Discovery in Multiple Streams , 2006, PAKDD.

[33]  M Vendruscolo,et al.  Efficient dynamics in the space of contact maps. , 1998, Folding & design.

[34]  S. Vishveshwara,et al.  A network representation of protein structures: implications for protein stability. , 2005, Biophysical journal.

[35]  Eytan Domany,et al.  Protein fold recognition and dynamics in the space of contact maps , 1996, Proteins.

[36]  Martin Billeter,et al.  Multiway decomposition of NMR spectra with coupled evolution periods. , 2005, Journal of the American Chemical Society.

[37]  Ying Xu,et al.  Protein domain decomposition using a graph-theoretic approach , 2000, Bioinform..

[38]  Rafael Brüschweiler,et al.  Identification of slow correlated motions in proteins using residual dipolar and hydrogen-bond scalar couplings. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[39]  David E. Booth,et al.  Multi-Way Analysis: Applications in the Chemical Sciences , 2005, Technometrics.

[40]  C. Chennubhotla,et al.  Insights into equilibrium dynamics of proteins from comparison of NMR and X-ray data with computational predictions. , 2007, Structure.

[41]  A. Geist,et al.  Protein dynamics and enzymatic catalysis: investigating the peptidyl-prolyl cis-trans isomerization activity of cyclophilin A. , 2004, Biochemistry.