Statistical Measures to Quantify Similarity between Molecular Dynamics Simulation Trajectories

Molecular dynamics simulation is commonly employed to explore protein dynamics. Despite the disparate timescales between functional mechanisms and molecular dynamics (MD) trajectories, functional differences are often inferred from differences in conformational ensembles between two proteins in structure-function studies that investigate the effect of mutations. A common measure to quantify differences in dynamics is the root mean square fluctuation (RMSF) about the average position of residues defined by Cα-atoms. Using six MD trajectories describing three native/mutant pairs of beta-lactamase, we make comparisons with additional measures that include Jensen-Shannon, modifications of Kullback-Leibler divergence, and local p-values from 1-sample Kolmogorov-Smirnov tests. These additional measures require knowing a probability density function, which we estimate by using a nonparametric maximum entropy method that quantifies rare events well. The same measures are applied to distance fluctuations between Cα-atom pairs. Results from several implementations for quantitative comparison of a pair of MD trajectories are made based on fluctuations for on-residue and residue-residue local dynamics. We conclude that there is almost always a statistically significant difference between pairs of 100 ns all-atom simulations on moderate-sized proteins as evident from extraordinarily low p-values.

[1]  L. Rice,et al.  Extended-Spectrum β-Lactamases in Klebsiella pneumoniae Bloodstream Isolates from Seven Countries: Dominance and Widespread Prevalence of SHV- and CTX-M-Type β-Lactamases , 2003, Antimicrobial Agents and Chemotherapy.

[2]  D. C. Rapaport,et al.  The Art of Molecular Dynamics Simulation , 1997 .

[3]  J. B. Jones,et al.  Structure-based design guides the improved efficacy of deacylation transition state analogue inhibitors of TEM-1 beta-Lactamase(,). , 2000, Biochemistry.

[4]  C L Emery,et al.  Detection and clinical significance of extended-spectrum beta-lactamases in a tertiary-care medical center , 1997, Journal of clinical microbiology.

[5]  Daniel M Zuckerman,et al.  Ensemble-based convergence analysis of biomolecular trajectories. , 2006, Biophysical journal.

[6]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[7]  S. Nosé A molecular dynamics method for simulations in the canonical ensemble , 1984 .

[8]  Brian K Shoichet,et al.  The Structural Bases of Antibiotic Resistance in the Clinically Derived Mutant β-Lactamases TEM-30, TEM-32, and TEM-34* , 2002, The Journal of Biological Chemistry.

[9]  E. Abraham,et al.  An Enzyme from Bacteria able to Destroy Penicillin , 1940, Nature.

[10]  J. B. Jones,et al.  Structure-Based Design Guides the Improved Efficacy of Deacylation Transition State Analogue Inhibitors of TEM-1 â-Lactamase , 2022 .

[11]  Rafael Brüschweiler,et al.  Efficient RMSD measures for the comparison of two molecular ensembles , 2002, Proteins.

[12]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[13]  K. Svoboda,et al.  Fluctuation analysis of motor protein movement and single enzyme kinetics. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Donald J. Jacobs,et al.  JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories , 2017, BMC Bioinformatics.

[15]  Pál Ormos,et al.  Dynamic fluctuation of proteins watched in real time , 2008, HFSP journal.

[16]  Herbert A. David,et al.  Order Statistics , 2011, International Encyclopedia of Statistical Science.

[17]  Ivano Bertini,et al.  Experimentally exploring the conformational space sampled by domain reorientation in calmodulin. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Shan Yang,et al.  Measuring Similarity Between Dynamic Ensembles of Biomolecules , 2014, Nature Methods.

[19]  Gregory S. Chirikjian,et al.  Quantitative Comparison of Conformational Ensembles , 2012, Entropy.

[20]  J Skolnick,et al.  Universal similarity measure for comparing protein structures. , 2001, Biopolymers.

[21]  Kristine M Hujer,et al.  Extended-spectrum beta-lactamases in Klebsiella pneumoniae bloodstream isolates from seven countries: dominance and widespread prevalence of SHV- and CTX-M-type beta-lactamases. , 2003, Antimicrobial agents and chemotherapy.

[22]  D. Jacobs,et al.  High throughput nonparametric probability density estimation , 2018, PloS one.

[23]  Jeremy C. Smith,et al.  The dynamics of single protein molecules is non-equilibrium and self-similar over thirteen decades in time , 2015, Nature Physics.

[24]  Kresten Lindorff-Larsen,et al.  Similarity Measures for Protein Ensembles , 2009, PloS one.

[25]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[26]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[27]  Martin Karplus,et al.  Molecular dynamics simulations of biomolecules. , 2002, Nature structural biology.

[28]  Michele Vendruscolo,et al.  Rare fluctuations of native proteins sampled by equilibrium hydrogen exchange. , 2003, Journal of the American Chemical Society.

[29]  Tod D Romo,et al.  Block Covariance Overlap Method and Convergence in Molecular Dynamics Simulation. , 2011, Journal of chemical theory and computation.

[30]  Stefano Piana,et al.  Demonstrating an Order-of-Magnitude Sampling Enhancement in Molecular Dynamics Simulations of Complex Protein Systems. , 2016, Journal of chemical theory and computation.

[31]  Jon E. Ness,et al.  Predicting the emergence of antibiotic resistance by directed evolution and structural analysis , 2001, Nature Structural Biology.

[32]  Mitsuhiko Ikura,et al.  Calcium-induced conformational transition revealed by the solution structure of apo calmodulin , 1995, Nature Structural Biology.

[33]  Bernhard Knapp,et al.  Is an Intuitive Convergence Definition of Molecular Dynamics Simulations Solely Based on the Root Mean Square Deviation Possible? , 2011, J. Comput. Biol..

[34]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[35]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[36]  Gerhard Wolber,et al.  The impact of molecular dynamics on drug design: applications for the characterization of ligand-macromolecule complexes. , 2015, Drug discovery today.

[37]  Lorna J. Smith,et al.  Assessing equilibration and convergence in biomolecular simulations , 2002, Proteins.

[38]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[39]  M. Kamal,et al.  Antibiotic resistance and extended spectrum beta-lactamases: Types, epidemiology and treatment. , 2015, Saudi journal of biological sciences.

[40]  Donald J. Jacobs,et al.  Best Probability Density Function for Random Sampled Data , 2009, Entropy.

[41]  Lucas Sawle,et al.  Convergence of Molecular Dynamics Simulation of Protein Native States: Feasibility vs Self-Consistency Dilemma. , 2016, Journal of chemical theory and computation.

[42]  B. Hess Convergence of sampling in protein simulations. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[43]  J. P. Grossman,et al.  Biomolecular simulation: a computational microscope for molecular biology. , 2012, Annual review of biophysics.

[44]  M. Karplus,et al.  Dynamics of folded proteins , 1977, Nature.

[45]  Aleksandr V. Smirnov,et al.  Watching a Protein as it Functions with 150-ps Time-Resolved X-ray Crystallography , 2003, Science.

[46]  Kresten Lindorff-Larsen,et al.  ENCORE: Software for Quantitative Ensemble Comparison , 2015, PLoS Comput. Biol..

[47]  Shigeyuki Yokoyama,et al.  NMR snapshots of a fluctuating protein structure: ubiquitin at 30 bar-3 kbar. , 2005, Journal of molecular biology.

[48]  T. Hitchens,et al.  Ligand-induced changes in the structure and dynamics of a human class Mu glutathione S-transferase. , 2000, Biochemistry.

[49]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[50]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[51]  Chun Wu,et al.  Convergence of replica exchange molecular dynamics. , 2005, The Journal of chemical physics.

[52]  A. Amadei,et al.  On the convergence of the conformational coordinates basis set obtained by the essential dynamics analysis of proteins' molecular dynamics simulations , 1999, Proteins.

[53]  A. Cooper,et al.  Thermodynamic fluctuations in protein molecules. , 1976, Proceedings of the National Academy of Sciences of the United States of America.

[54]  D. A. Bosco,et al.  Enzyme Dynamics During Catalysis , 2002, Science.

[55]  D E Wemmer,et al.  Two-state allosteric behavior in a single-domain signaling protein. , 2001, Science.

[56]  G. Wolber,et al.  More than a look into a crystal ball: protein structure elucidation guided by molecular dynamics simulations. , 2016, Drug discovery today.