Simulating solvation and acidity in complex mixtures with first-principles accuracy: the case of CH3SO3H and H2O2 in phenol.

We present a generally-applicable computational framework for the efficient and accurate characterization of molecular structural patterns and acid properties in explicit solvent using H2O2 and CH3SO3H in phenol as an example. In order to address the challenges posed by the complexity of the problem, we resort to a set of data-driven methods and enhanced sampling algorithms. The synergistic application of these techniques makes the first-principle estimation of the chemical properties feasible without renouncing to the use of explicit solvation, involving extensive statistical sampling. Ensembles of neural network potentials are trained on a set of configurations carefully selected out of preliminary simulations performed at a low-cost density-functional tight-binding (DFTB) level. Energy and forces of these configurations are then recomputed at the hybrid density functional theory (DFT) level and used to train the neural networks. The stability of the NN model is enhanced by using DFTB energetics as a baseline, but the efficiency of the direct NN (i.e. baseline-free) is exploited via a multiple-time step integrator. The neural network potentials are combined with enhanced sampling techniques, such as replica exchange and metadynamics, and used to characterize the relevant protonated species and dominant non-covalent interactions in the mixture, also considering nuclear quantum effects.

[1]  Geng Sun,et al.  Towards fast and reliable potential energy surfaces for metallic Pt clusters by hierarchical delta neural networks. , 2019, Journal of chemical theory and computation.

[2]  Klaus-Robert Müller,et al.  Machine learning of accurate energy-conserving molecular force fields , 2016, Science Advances.

[3]  Michele Ceriotti,et al.  Fast and Accurate Uncertainty Estimation in Chemical Machine Learning. , 2018, Journal of chemical theory and computation.

[4]  Kurt R. Brorsen Reproducing global potential energy surfaces with continuous-filter convolutional neural networks. , 2019, The Journal of chemical physics.

[5]  R. Kondor,et al.  Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. , 2009, Physical review letters.

[6]  Christoph Dellago,et al.  Parallel Multistream Training of High-Dimensional Neural Network Potentials. , 2019, Journal of chemical theory and computation.

[7]  Alessandro Laio,et al.  Dissociation mechanism of acetic acid in water. , 2006, Journal of the American Chemical Society.

[8]  Matthias Krack,et al.  Pseudopotentials for H to Kr optimized for gradient-corrected exchange-correlation functionals , 2005 .

[9]  Massimiliano Bonomi,et al.  PLUMED 2: New feathers for an old bird , 2013, Comput. Phys. Commun..

[10]  Markus Meuwly,et al.  Reactive molecular dynamics for the [Cl–CH3–Br]− reaction in the gas phase and in solution: a comparative study using empirical and neural network force fields , 2019, Electronic Structure.

[11]  Jörg Behler,et al.  Concentration-Dependent Proton Transfer Mechanisms in Aqueous NaOH Solutions: From Acceptor-Driven to Donor-Driven and Back. , 2016, The journal of physical chemistry letters.

[12]  S. Vasudevan,et al.  The multiple dissociation constants of glutathione disulfide: interpreting experimental pH-titration curves with ab initio MD simulations. , 2019, Physical chemistry chemical physics : PCCP.

[13]  Michele Parrinello,et al.  Simplifying the representation of complex free-energy landscapes using sketch-map , 2011, Proceedings of the National Academy of Sciences.

[14]  Rustam Z. Khaliullin,et al.  Microscopic origins of the anomalous melting behavior of sodium under high pressure. , 2011, Physical review letters.

[15]  Michele Ceriotti,et al.  Beyond static structures: Putting forth REMD as a tool to solve problems in computational organic chemistry , 2015, J. Comput. Chem..

[16]  Christoph Dellago,et al.  Neural networks for local structure detection in polymorphic systems. , 2013, The Journal of chemical physics.

[17]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[18]  Michele Parrinello,et al.  Microscopic description of acid–base equilibrium , 2019, Proceedings of the National Academy of Sciences.

[19]  Gábor Csányi,et al.  Gaussian approximation potentials: A brief tutorial introduction , 2015, 1502.01366.

[20]  H. Kulik,et al.  A Quantitative Uncertainty Metric Controls Error in Neural Network-Driven Chemical Discovery , 2019 .

[21]  Jörg Behler,et al.  Nuclear Quantum Effects in Sodium Hydroxide Solutions from Neural Network Molecular Dynamics Simulations. , 2018, The journal of physical chemistry. B.

[22]  M. Elstner,et al.  Validation of the density-functional based tight-binding approximation method for the calculation of reaction energies and other data. , 2005, The Journal of chemical physics.

[23]  J. Behler Neural network potential-energy surfaces in chemistry: a tool for large-scale simulations. , 2011, Physical chemistry chemical physics : PCCP.

[24]  Joost VandeVondele,et al.  Auxiliary Density Matrix Methods for Hartree-Fock Exchange Calculations. , 2010, Journal of chemical theory and computation.

[25]  Michele Parrinello,et al.  Generalized neural-network representation of high-dimensional potential-energy surfaces. , 2007, Physical review letters.

[26]  Aldo Glielmo,et al.  Efficient nonparametric n -body force fields from machine learning , 2018, 1801.04823.

[27]  Michele Ceriotti,et al.  Efficient multiple time scale molecular dynamics: Using colored noise thermostats to stabilize resonances. , 2010, The Journal of chemical physics.

[28]  Jörg Behler,et al.  Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials. , 2018, The Journal of chemical physics.

[29]  Jun Cheng,et al.  Redox Potentials and Acidity Constants from Density Functional Theory Based Molecular Dynamics , 2015 .

[30]  Germany,et al.  Neural network interatomic potential for the phase change material GeTe , 2012, 1201.2026.

[31]  Stefan Grimme,et al.  Effect of the damping function in dispersion corrected density functional theory , 2011, J. Comput. Chem..

[32]  J. Blumberger,et al.  Absolute pKa Values and Solvation Structure of Amino Acids from Density Functional Based Molecular Dynamics Simulation. , 2011, Journal of chemical theory and computation.

[33]  Min Wu,et al.  Environmental benefits of methanesulfonic acid. Comparative properties and advantages , 1999 .

[34]  Matthias Rupp,et al.  Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. , 2015, Journal of chemical theory and computation.

[35]  Michele Parrinello,et al.  Demonstrating the Transferability and the Descriptive Power of Sketch-Map. , 2013, Journal of chemical theory and computation.

[36]  M. Elstner,et al.  Parametrization and Benchmark of DFTB3 for Organic Molecules. , 2013, Journal of chemical theory and computation.

[37]  Michele Ceriotti,et al.  A new kind of atlas of zeolite building blocks. , 2019, The Journal of chemical physics.

[38]  L. Halonen,et al.  Ab Initio Molecular Dynamics Simulations of the Influence of Lithium Bromide Salt on the Deprotonation of Formic Acid in Aqueous Solution , 2019, The journal of physical chemistry. B.

[39]  Michele Parrinello,et al.  Using sketch-map coordinates to analyze and bias molecular dynamics simulations , 2012, Proceedings of the National Academy of Sciences.

[40]  S. Vasudevan,et al.  Dissociation constants of weak acids from ab initio molecular dynamics using metadynamics: influence of the inductive effect and hydrogen bonding on pKa values. , 2014, The journal of physical chemistry. B.

[41]  Michele Ceriotti,et al.  Efficient first-principles calculation of the quantum kinetic energy and momentum distribution of nuclei. , 2012, Physical review letters.

[42]  Steven Vandenbrande,et al.  i-PI 2.0: A universal force engine for advanced molecular simulations , 2018, Comput. Phys. Commun..

[43]  V. Barone,et al.  Toward reliable density functional methods without adjustable parameters: The PBE0 model , 1999 .

[44]  J. Behler,et al.  Proton-Transfer-Driven Water Exchange Mechanism in the Na+ Solvation Shell. , 2017, The journal of physical chemistry. B.

[45]  Deprotonation of solvated formic acid: Car-Parrinello and metadynamics simulations. , 2006, The journal of physical chemistry. B.

[46]  Joost VandeVondele,et al.  Gaussian basis sets for accurate calculations on molecular systems in gas and condensed phases. , 2007, The Journal of chemical physics.

[47]  Michael Gaus,et al.  DFTB3: Extension of the self-consistent-charge density-functional tight-binding method (SCC-DFTB). , 2011, Journal of chemical theory and computation.

[48]  David R. Glowacki,et al.  Training neural nets to learn reactive potential energy surfaces using interactive quantum chemistry in virtual reality , 2019, The journal of physical chemistry. A.

[49]  T. Frauenheim,et al.  DFTB+, a sparse matrix-based implementation of the DFTB method. , 2007, The journal of physical chemistry. A.

[50]  B. Ensing,et al.  Advances in enhanced sampling along adaptive paths of collective variables. , 2018, The Journal of chemical physics.

[51]  J. Řezáč Empirical Self-Consistent Correction for the Description of Hydrogen Bonds in DFTB3. , 2017, Journal of chemical theory and computation.

[52]  T. M. Rocha Filho,et al.  The use of neural networks for fitting potential energy surfaces: A comparative case study for the H+3 molecule , 2003 .

[53]  A. Laio,et al.  Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science , 2008 .

[54]  Joost VandeVondele,et al.  cp2k: atomistic simulations of condensed matter systems , 2014 .

[55]  Michele Ceriotti,et al.  Unsupervised machine learning in atomistic simulations, between predictions and understanding. , 2019, The Journal of chemical physics.

[56]  M. Ceriotti,et al.  Analyzing Fluxional Molecules Using DORI. , 2018, Journal of chemical theory and computation.

[57]  Dong H. Zhang,et al.  A neural network potential energy surface for the F + CH4 reaction including multiple channels based on coupled cluster theory. , 2018, Physical chemistry chemical physics : PCCP.

[58]  Jakub Rydzewski,et al.  Promoting transparency and reproducibility in enhanced molecular simulations , 2019, Nature Methods.

[59]  P. Popelier,et al.  Potential energy surfaces fitted by artificial neural networks. , 2010, The journal of physical chemistry. A.

[60]  Alireza Khorshidi,et al.  Addressing uncertainty in atomistic machine learning. , 2017, Physical chemistry chemical physics : PCCP.

[61]  H. Nakai,et al.  Rigorous pKa Estimation of Amine Species Using Density-Functional Tight-Binding-Based Metadynamics Simulations. , 2018, Journal of chemical theory and computation.

[62]  Michael Gaus,et al.  Parameterization of DFTB3/3OB for Sulfur and Phosphorus for Chemical and Biological Applications , 2014, Journal of chemical theory and computation.

[63]  C. Corminboeuf,et al.  Data Mining the C−C Cross‐Coupling Genome , 2019, ChemCatChem.

[64]  G. Scuseria,et al.  Assessment of the Perdew–Burke–Ernzerhof exchange-correlation functional , 1999 .

[65]  M. Sulpizi,et al.  Acidity constants from DFT-based molecular dynamics simulations , 2010, Journal of physics. Condensed matter : an Institute of Physics journal.

[66]  Matteo Salvalaglio,et al.  DeepIce: A Deep Neural Network Approach To Identify Ice and Water Molecules , 2019, J. Chem. Inf. Model..

[67]  Giovanni Bussi,et al.  Colored-Noise Thermostats à la Carte , 2010, 1204.0822.

[68]  A. Gross,et al.  Descriptions of surface chemical reactions using a neural network representation of the potential-energy surface , 2006 .

[69]  S. Godtfredsen,et al.  Ullmann ' s Encyclopedia of Industrial Chemistry , 2017 .

[70]  Weitao Yang,et al.  Molecular Dynamics Simulations with Quantum Mechanics/Molecular Mechanics and Adaptive Neural Networks. , 2018, Journal of chemical theory and computation.

[71]  A. Michaelides,et al.  Quantum nature of the hydrogen bond , 2011, Proceedings of the National Academy of Sciences.

[72]  Mark E. Tuckerman,et al.  Reversible multiple time scale molecular dynamics , 1992 .

[73]  J. Behler Perspective: Machine learning potentials for atomistic simulations. , 2016, The Journal of chemical physics.

[74]  K. Rosso,et al.  Acidity Constants of the Hematite-Liquid Water Interface from Ab Initio Molecular Dynamics. , 2018, The journal of physical chemistry letters.

[75]  Jorg Behler,et al.  Automated Fitting of Neural Network Potentials at Coupled Cluster Accuracy: Protonated Water Clusters as Testing Ground , 2019, Journal of chemical theory and computation.

[76]  Michele Ceriotti,et al.  Mapping uncharted territory in ice from zeolite networks to ice structures , 2018, Nature Communications.

[77]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.