Using Wavelet Analysis To Assist in Identification of Significant Events in Molecular Dynamics Simulations

Long time scale molecular dynamics (MD) simulations of biological systems are becoming increasingly commonplace due to the availability of both large-scale computational resources and significant advances in the underlying simulation methodologies. Therefore, it is useful to investigate and develop data mining and analysis techniques to quickly and efficiently extract the biologically relevant information from the incredible amount of generated data. Wavelet analysis (WA) is a technique that can quickly reveal significant motions during an MD simulation. Here, the application of WA on well-converged long time scale (tens of μs) simulations of a DNA helix is described. We show how WA combined with a simple clustering method can be used to identify both the physical and temporal locations of events with significant motion in MD trajectories. We also show that WA can not only distinguish and quantify the locations and time scales of significant motions, but by changing the maximum time scale of WA a more complete characterization of these motions can be obtained. This allows motions of different time scales to be identified or ignored as desired.

[1]  E. Dougherty,et al.  NONLINEAR PROBIT GENE CLASSIFICATION USING MUTUAL INFORMATION AND WAVELET-BASED FEATURE SELECTION , 2004 .

[2]  Valerie J. Gillet,et al.  Compression of Molecular Interaction Fields Using Wavelet Thumbnails: Application to Molecular Alignment , 2012, J. Chem. Inf. Model..

[3]  H. Al‐Hashimi,et al.  Extending the range of microsecond-to-millisecond chemical exchange detected in labeled and unlabeled nucleic acids by selective carbon R(1rho) NMR spectroscopy. , 2009, Journal of the American Chemical Society.

[4]  Mikhail A Iakhiaev,et al.  Mapping the intramolecular signal propagation pathways in protein using Bayesian change point analysis of atomic motions , 2013, Comput. Biol. Chem..

[5]  Daniel R. Roe,et al.  Evaluation of Enhanced Sampling Provided by Accelerated Molecular Dynamics with Hamiltonian Replica Exchange Methods , 2014, The journal of physical chemistry. B.

[6]  Herschel Rabitz,et al.  Wavelet transform for analysis of molecular dynamics , 1996 .

[7]  Gerald Schaefer,et al.  A hybrid biometric approach embedding DNA data in fingerprint images , 2014, 2014 International Conference on Informatics, Electronics & Vision (ICIEV).

[8]  C. Torrence,et al.  A Practical Guide to Wavelet Analysis. , 1998 .

[9]  George N Frantziskonis,et al.  Wavelet-based spatial and temporal multiscaling: Bridging the atomistic and continuum space and time scales , 2003 .

[10]  Guandong Xu,et al.  Tumor tissue identification based on gene expression data using DWT feature extraction and PNN classifier , 2006, Neurocomputing.

[11]  J. Moe,et al.  Kinetics and energetics of base-pair opening in 5'-d(CGCGAATTCGCG)-3' and a substituted dodecamer containing G.T mismatches. , 1992, Biochemistry.

[12]  Huajun Chen,et al.  A Wavelet Approach for the Analysis of Folding Trajectory of Protein Trp-cage , 2005, J. Bioinform. Comput. Biol..

[13]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[14]  Valerie Daggett,et al.  Dynameomics: Large‐scale assessment of native protein flexibility , 2008, Protein science : a publication of the Protein Society.

[15]  Valerie Daggett,et al.  Dynameomics: design of a computational lab workflow and scientific data repository for protein simulations. , 2008, Protein engineering, design & selection : PEDS.

[16]  Daniel R. Roe,et al.  On the absence of intra-helical DNA dynamics on the µs to ms timescale , 2014, Nature Communications.

[17]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[18]  R.S.H. Istepanian,et al.  Microarray image enhancement by denoising using stationary wavelet transform , 2003, IEEE Transactions on NanoBioscience.

[19]  Kresten Lindorff-Larsen,et al.  Interaction Networks in Protein Folding via Atomic-Resolution Experiments and Long-Time-Scale Molecular Dynamics Simulations , 2015, Journal of the American Chemical Society.

[20]  M. Karplus,et al.  Molecular dynamics and protein function. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Stefan Senger,et al.  Wavelet Approximation of GRID Fields: Application to Quantitative Structure‐Activity Relationships , 2010, Molecular informatics.

[22]  P. Vandergheynst,et al.  Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences. , 2000, Journal of theoretical biology.

[23]  S. Mallat A wavelet tour of signal processing , 1998 .

[24]  A Kitao,et al.  Harmonic and anharmonic aspects in the dynamics of BPTI: A normal mode analysis and principal component analysis , 1994, Protein science : a publication of the Protein Society.

[25]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[26]  Jahan B. Ghasemi,et al.  Toward a continuous wavelet transform-based search method for feature selection for classification of spectroscopic data , 2013 .

[27]  Yibing Shan,et al.  Molecular basis for pseudokinase-dependent autoinhibition of JAK2 tyrosine kinase , 2014, Nature Structural &Molecular Biology.

[28]  Daniel R Roe,et al.  PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. , 2013, Journal of chemical theory and computation.

[29]  Huafeng Xu,et al.  Key mutations stabilize antigen-binding conformation during affinity maturation of a broadly neutralizing influenza antibody lineage , 2015, Proteins.

[30]  T. Cheatham,et al.  Convergence and reproducibility in molecular dynamics simulations of the DNA duplex d(GCACGAACGAACGAACGC). , 2015, Biochimica et biophysica acta.

[31]  Pierre Moulin,et al.  Information-theoretic analysis of interscale and intrascale dependencies between image wavelet coefficients , 2001, IEEE Trans. Image Process..

[32]  John L. Klepeis,et al.  A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[33]  Valerie Daggett,et al.  A comparison of multiscale methods for the analysis of molecular dynamics simulations. , 2012, The journal of physical chemistry. B.

[34]  Pietro Liò,et al.  Wavelets in bioinformatics and computational biology: state of art and perspectives , 2003, Bioinform..

[35]  R Dustin Schaeffer,et al.  Dynameomics: a comprehensive database of protein dynamics. , 2010, Structure.

[36]  Valerie Daggett,et al.  Dynameomics: a multi-dimensional analysis-optimized database for dynamic protein data. , 2008, Protein engineering, design & selection : PEDS.

[37]  J. Straub,et al.  Spatio-temporal hierarchy in the dynamics of a minimalist protein model. , 2013, The Journal of chemical physics.

[38]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[39]  M. Eleftheriou,et al.  Single-mutation-induced stability loss in protein lysozyme. , 2007, Biochemical Society transactions.

[40]  L. Marky,et al.  Looking for Waldo: A Potential Thermodynamic Signature to DNA Damage , 2014, Accounts of chemical research.

[41]  M. Pagliai,et al.  Wavelet Transform for Spectroscopic Analysis: Application to Diols in Water. , 2011, Journal of chemical theory and computation.

[42]  M. Takata,et al.  Analysis of motion features for molecular dynamics simulation of proteins , 2011 .

[43]  R.S.H. Istepanian,et al.  Application of wavelet modulus maxima in microarray spots recognition , 2003, IEEE Transactions on NanoBioscience.

[44]  Jiuzhou Z. Song,et al.  The Wavelet-Based Cluster Analysis for Temporal Gene Expression Data , 2007, EURASIP J. Bioinform. Syst. Biol..

[45]  Maria Dulce Quelhas,et al.  Wavelet analysis of human DNA. , 2011, Genomics.

[46]  Adrian E. Roitberg,et al.  Multidimensional Replica Exchange Molecular Dynamics Yields a Converged Ensemble of an RNA Tetranucleotide , 2013, Journal of chemical theory and computation.

[47]  Noah C. Benson,et al.  Wavelet Analysis of protein Motion , 2012, Int. J. Wavelets Multiresolution Inf. Process..

[48]  Hiromi Nakai,et al.  Wavelet transform analysis of ab initio molecular dynamics simulation: Application to core‐excitation dynamics of BF3 , 2007, J. Comput. Chem..