Learning generative models of molecular dynamics

We introduce three algorithms for learning generative models of molecular structures from molecular dynamics simulations. The first algorithm learns a Bayesian-optimal undirected probabilistic model over user-specified covariates (e.g., fluctuations, distances, angles, etc). L1 reg-ularization is used to ensure sparse models and thus reduce the risk of over-fitting the data. The topology of the resulting model reveals important couplings between different parts of the protein, thus aiding in the analysis of molecular motions. The generative nature of the model makes it well-suited to making predictions about the global effects of local structural changes (e.g., the binding of an allosteric regulator). Additionally, the model can be used to sample new conformations. The second algorithm learns a time-varying graphical model where the topology and parameters change smoothly along the trajectory, revealing the conformational sub-states. The last algorithm learns a Markov Chain over undirected graphical models which can be used to study and simulate kinetics. We demonstrate our algorithms on multiple molecular dynamics trajectories.

[1]  Eric P. Xing,et al.  Free Energy Estimates of All-Atom Protein Structures Using Generalized Belief Propagation , 2007, RECOMB.

[2]  Arvind Ramanathan,et al.  On-the-Fly Identification of Conformational Substates from Molecular Dynamics Simulations. , 2011, Journal of chemical theory and computation.

[3]  Arvind Ramanathan,et al.  An Online Approach for Mining Collective Behaviors from Molecular Dynamics Simulations , 2009, RECOMB.

[4]  A. Fersht,et al.  Protein folding and unfolding in microseconds to nanoseconds by experiment and simulation. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Berg,et al.  Molecular dynamics simulations of biomolecules , 2002, Nature Structural Biology.

[6]  Stephen P. Boyd,et al.  Determinant Maximization with Linear Matrix Inequality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[7]  Hetunandan Kamisetty,et al.  The von Mises Graphical Model: Regularized Structure and Parameter Learning (CMU-CS-11-129/CMU-CB-11-101) , 2011 .

[8]  L. Kay,et al.  Intrinsic dynamics of an enzyme underlies catalysis , 2005, Nature.

[9]  Sivaraman Balakrishnan,et al.  Learning generative models for protein fold families , 2011, Proteins.

[10]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[11]  C. Petropoulos,et al.  Loss of Asparagine-Linked Glycosylation Sites in Variable Region 5 of Human Immunodeficiency Virus Type 1 Envelope Is Associated with Resistance to CD4 Antibody Ibalizumab , 2011, Journal of Virology.

[12]  D. Kuritzkes,et al.  Safety, Pharmacokinetics, and Antiretroviral Activity of Multiple Doses of Ibalizumab (formerly TNX-355), an Anti-CD4 Monoclonal Antibody, in Human Immunodeficiency Virus Type 1-Infected Adults , 2008, Antimicrobial Agents and Chemotherapy.

[13]  Alexandre d'Aspremont,et al.  Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data , 2022 .

[14]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[15]  H. Berendsen,et al.  Collective protein dynamics in relation to function. , 2000, Current opinion in structural biology.

[16]  Vijay S Pande,et al.  Progress and challenges in the automated construction of Markov state models for full protein systems. , 2009, The Journal of chemical physics.

[17]  Martin Zacharias,et al.  Efficient evaluation of sampling quality of molecular dynamics simulations by clustering of dihedral torsion angles and Sammon mapping , 2009, J. Comput. Chem..

[18]  G. Tell,et al.  Missense mutations of human homeoboxes: A review , 2001, Human mutation.

[19]  A. Fersht,et al.  The denatured state of Engrailed Homeodomain under denaturing and native conditions. , 2003, Journal of molecular biology.

[20]  Klaus Schulten,et al.  Accelerating Molecular Modeling Applications with GPU Computing , 2009 .

[21]  Jianyin Shao,et al.  Clustering Molecular Dynamics Trajectories: 1. Characterizing the Performance of Different Clustering Algorithms. , 2007, Journal of chemical theory and computation.

[22]  D. Kern,et al.  Dynamic personalities of proteins , 2007, Nature.

[23]  Kurt Wüthrich,et al.  Homeodomain-DNA recognition , 1994, Cell.

[24]  D. Leitner Energy flow in proteins. , 2008, Annual review of physical chemistry.

[25]  H. Frauenfelder,et al.  Conformational substates in proteins. , 1988, Annual review of biophysics and biophysical chemistry.

[26]  Hans Frauenfelder,et al.  Temperature-dependent X-ray diffraction as a probe of protein structural dynamics , 1979, Nature.

[27]  Federico D. Sacerdoti,et al.  Scalable Algorithms for Molecular Dynamics Simulations on Commodity Clusters , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[28]  D. Kern,et al.  Hidden alternate structures of proline isomerase essential for catalysis , 2010 .

[29]  Oliver F. Lange,et al.  Full correlation analysis of conformational protein dynamics , 2007, Proteins.

[30]  W. Gehring,et al.  Homeodomain proteins. , 1994, Annual review of biochemistry.

[31]  M. Karplus,et al.  Method for estimating the configurational entropy of macromolecules , 1981 .

[32]  A. R. Srinivasan,et al.  Quasi‐harmonic method for studying very low frequency modes in proteins , 1984, Biopolymers.

[33]  John L. Klepeis,et al.  Anton, a special-purpose machine for molecular dynamics simulation , 2007, ISCA '07.

[34]  D. A. Bosco,et al.  Enzyme Dynamics During Catalysis , 2002, Science.

[35]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[36]  C. Langmead,et al.  Accounting for conformational entropy in predicting binding free energies of protein‐protein interactions , 2011, Proteins.

[37]  X. Daura,et al.  Folding–unfolding thermodynamics of a β‐heptapeptide from equilibrium simulations , 1999, Proteins.