Data-driven Efficient Solvers and Predictions of Conformational Transitions for Langevin Dynamics on Manifold in High Dimensions

We work on dynamic problems with collected data $\{\mathsf{x}_i\}$ that distributed on a manifold $\mathcal{M}\subset\mathbb{R}^p$. Through the diffusion map, we first learn the reaction coordinates $\{\mathsf{y}_i\}\subset \mathcal{N}$ where $\mathcal{N}$ is a manifold isometrically embedded into an Euclidean space $\mathbb{R}^\ell$ for $\ell \ll p$. The reaction coordinates enable us to obtain an efficient approximation for the dynamics described by a Fokker-Planck equation on the manifold $\mathcal{N}$. By using the reaction coordinates, we propose an implementable, unconditionally stable, data-driven upwind scheme which automatically incorporates the manifold structure of $\mathcal{N}$. Furthermore, we provide a weighted $L^2$ convergence analysis of the upwind scheme to the Fokker-Planck equation. The proposed upwind scheme leads to a Markov chain with transition probability between the nearest neighbor points. We can benefit from such property to directly conduct manifold-related computations such as finding the optimal coarse-grained network and the minimal energy path that represents chemical reactions or conformational changes. To establish the Fokker-Planck equation, we need to acquire information about the equilibrium potential of the physical system on $\mathcal{N}$. Hence, we apply a Gaussian Process regression algorithm to generate equilibrium potential for a new physical system with new parameters. Combining with the proposed upwind scheme, we can calculate the trajectory of the Fokker-Planck equation on $\mathcal{N}$ based on the generated equilibrium potential. Finally, we develop an algorithm to pullback the trajectory to the original high dimensional space as a generative data for the new physical system.

[1]  Mark H. A. Davis,et al.  Applied Stochastic Analysis , 1991 .

[2]  A. Singer,et al.  Vector diffusion maps and the connection Laplacian , 2011, Communications on pure and applied mathematics.

[3]  S. Varadhan On the behavior of the fundamental solution of the heat equation with variable coefficients , 2010 .

[4]  M. Maggioni,et al.  Manifold parametrizations by eigenfunctions of the Laplacian and heat kernels , 2008, Proceedings of the National Academy of Sciences.

[5]  E. Vanden-Eijnden,et al.  String method for the study of rare events , 2002, cond-mat/0205527.

[6]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[7]  Jian‐Guo Liu,et al.  On the rate of convergence of empirical measure in $\infty $-Wasserstein distance for unbounded density function , 2018, Quarterly of Applied Mathematics.

[8]  Nicolás García Trillos,et al.  On the rate of convergence of empirical measures in $\infty$-transportation distance , 2014, 1407.1157.

[9]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[10]  J. Portegies Embeddings of Riemannian Manifolds with Heat Kernels and Eigenfunctions , 2013, 1311.7568.

[11]  Eric Vanden-Eijnden,et al.  Transition-path theory and path-finding algorithms for the study of rare events. , 2010, Annual review of physical chemistry.

[12]  I. Ekeland,et al.  Convex analysis and variational problems , 1976 .

[13]  M. Maggioni,et al.  Determination of reaction coordinates via locally scaled diffusion map. , 2011, The Journal of chemical physics.

[14]  Antonio Esposito,et al.  Nonlocal-Interaction Equation on Graphs: Gradient Flow Structure and Continuum Limit , 2019, Archive for Rational Mechanics and Analysis.

[15]  Y. Gliklikh Stochastic Analysis on Manifolds , 2011 .

[16]  Jonathan Bates,et al.  The embedding dimension of Laplacian eigenfunction maps , 2014, ArXiv.

[17]  D. Donoho,et al.  Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[18]  P. Bérard,et al.  Embedding Riemannian manifolds by their heat kernel , 1994 .

[19]  Lei Li,et al.  Large time behaviors of upwind schemes by jump processes , 2018, 1807.08396.

[20]  John K. Beem,et al.  Pseudo-Riemannian manifolds with totally geodesic bisectors , 1975 .

[21]  Antonin Chambolle,et al.  A First-Order Primal-Dual Algorithm for Convex Problems with Applications to Imaging , 2011, Journal of Mathematical Imaging and Vision.

[22]  C. Amante,et al.  ETOPO1 arc-minute global relief model : procedures, data sources and analysis , 2009 .

[23]  Ann B. Lee,et al.  Diffusion maps and coarse-graining: a unified framework for dimensionality reduction, graph partitioning, and data set parameterization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[25]  Matthias Erbar Gradient flows of the entropy for jump processes , 2012, 1204.2190.

[26]  Z. Schuss Singular Perturbation Methods in Stochastic Differential Equations of Mathematical Physics , 1980 .

[27]  Eric Vanden-Eijnden,et al.  Transition Path Theory for Markov Jump Processes , 2009, Multiscale Model. Simul..

[28]  S. Chow,et al.  Fokker–Planck Equations for a Free Energy Functional or Markov Process on a Graph , 2011, Archive for Rational Mechanics and Analysis.

[29]  Amit Singer,et al.  Spectral Convergence of the connection Laplacian from random samples , 2013, 1306.1587.

[30]  Tiejun Li,et al.  Optimal partition and effective dynamics of complex networks , 2008, Proceedings of the National Academy of Sciences.

[31]  M. A. Peletier,et al.  On the Relation between Gradient Flows and the Large-Deviation Principle, with Applications to Markov Chains and Diffusion , 2013, 1312.7591.

[32]  Jian‐Guo Liu,et al.  A note on parametric Bayesian inference via gradient flows , 2020 .

[33]  Hau-Tieng Wu,et al.  Diffusion based Gaussian process regression via heat kernel reconstruction , 2019 .

[34]  C. Bachoc,et al.  Applied and Computational Harmonic Analysis Tight P-fusion Frames , 2022 .

[35]  W. E,et al.  Probabilistic framework for network partition. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  J. Maas Gradient flows of the entropy for finite Markov chains , 2011, 1102.5238.

[37]  Xin Zhou,et al.  Enhancing single-cell cellular state inference by incorporating molecular network features , 2019, bioRxiv.

[38]  Ronald R. Coifman,et al.  Diffusion Maps, Reduction Coordinates, and Low Dimensional Representation of Stochastic Systems , 2008, Multiscale Model. Simul..

[39]  Hau-Tieng Wu,et al.  Connecting dots: from local covariance to empirical intrinsic geometry and locally linear embedding , 2019, Pure and Applied Analysis.

[40]  Hau-Tieng Wu,et al.  Think globally, fit locally under the manifold setup: Asymptotic analysis of locally linear embedding , 2017, The Annals of Statistics.

[41]  P. Deuflhard,et al.  Robust Perron cluster analysis in conformation dynamics , 2005 .