Accurate molecular polarizabilities with coupled cluster theory and machine learning

Significance The dipole polarizability of molecules and materials is central to several physical phenomena, modeling techniques, and the interpretation of many experiments. Its accurate evaluation from first principles requires quantum chemistry methods that are often too demanding for routine use. The highly accurate calculations reported herein provide a much-needed benchmark of the accuracy of hybrid density functional theory (DFT) as well as training data for a machine-learning model that can predict the polarizability tensor with an error that is about 50% smaller than DFT. This framework provides an accurate, inexpensive, and transferable strategy for estimating the polarizabilities of molecules containing dozens of atoms, and therefore removes a considerable obstacle to accurate and reliable atomistic-based modeling of matter. The molecular dipole polarizability describes the tendency of a molecule to change its dipole moment in response to an applied electric field. This quantity governs key intra- and intermolecular interactions, such as induction and dispersion; plays a vital role in determining the spectroscopic signatures of molecules; and is an essential ingredient in polarizable force fields. Compared with other ground-state properties, an accurate prediction of the molecular polarizability is considerably more difficult, as this response quantity is quite sensitive to the underlying electronic structure description. In this work, we present highly accurate quantum mechanical calculations of the static dipole polarizability tensors of 7,211 small organic molecules computed using linear response coupled cluster singles and doubles theory (LR-CCSD). Using a symmetry-adapted machine-learning approach, we demonstrate that it is possible to predict the LR-CCSD molecular polarizabilities of these small molecules with an error that is an order of magnitude smaller than that of hybrid density functional theory (DFT) at a negligible computational cost. The resultant model is robust and transferable, yielding molecular polarizabilities for a diverse set of 52 larger molecules (including challenging conjugated systems, carbohydrates, small drugs, amino acids, nucleobases, and hydrocarbon isomers) at an accuracy that exceeds that of hybrid DFT. The atom-centered decomposition implicit in our machine-learning approach offers some insight into the shortcomings of DFT in the prediction of this fundamental quantity of interest.

[1]  Andrea Grisafi,et al.  Symmetry-Adapted Machine Learning for Tensorial Properties of Atomistic Systems. , 2017, Physical review letters.

[2]  M. Orio,et al.  Density functional theory , 2009, Photosynthesis Research.

[3]  B. Paulus,et al.  First Multireference Correlation Treatment of Bulk Metals. , 2014, Journal of chemical theory and computation.

[4]  M. Head‐Gordon,et al.  How accurate are static polarizability predictions from density functional theory? An assessment over 132 species at equilibrium geometry. , 2018, Physical chemistry chemical physics : PCCP.

[5]  J. Hammond,et al.  Coupled-cluster dynamic polarizabilities including triple excitations. , 2008, The Journal of chemical physics.

[6]  Volodymyr Babin,et al.  Development of a "First-Principles" Water Potential with Flexible Monomers. III. Liquid Phase Properties. , 2014, Journal of chemical theory and computation.

[7]  Jeff R. Hammond,et al.  Accurate dipole polarizabilities for water clusters n=2-12 at the coupled-cluster level of theory and benchmarking of various density functionals. , 2009, The Journal of chemical physics.

[8]  R. Kondor,et al.  On representing chemical environments , 2012, 1209.3140.

[9]  Tristan Bereau,et al.  Transferable Atomic Multipole Machine Learning Models for Small Organic Molecules. , 2015, Journal of chemical theory and computation.

[10]  K. Burke Perspective on density functional theory. , 2012, The Journal of chemical physics.

[11]  H. Monkhorst,et al.  Calculation of properties with the coupled-cluster method , 2009 .

[12]  J. Gauss,et al.  Frequency-dependent polarizabilities and first hyperpolarizabilities of CO and H2O from coupled cluster calculations , 1999 .

[13]  Poul Jørgensen,et al.  Response functions from Fourier component variational perturbation theory applied to a time-averaged quasienergy , 1998 .

[14]  Gábor Csányi,et al.  Comparing molecules and solids across structural and alchemical space. , 2015, Physical chemistry chemical physics : PCCP.

[15]  Stefano de Gironcoli,et al.  Reproducibility in density functional theory calculations of solids , 2016, Science.

[16]  S. Pal,et al.  Systematic comparison of DFT and CCSD dipole moments, polarizabilities and hyperpolarizabilities , 2015 .

[17]  Michiel Sprik,et al.  A polarizable model for water using distributed charge sites , 1988 .

[18]  Marcella Iannuzzi,et al.  Raman spectra from ab initio molecular dynamics and its application to liquid S-methyloxirane. , 2014, The Journal of chemical physics.

[19]  Henrik Koch,et al.  Coupled cluster response functions , 1990 .

[20]  R. Kondor,et al.  Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. , 2009, Physical review letters.

[21]  Yaliang Li,et al.  SCI , 2021, Proceedings of the 30th ACM International Conference on Information & Knowledge Management.

[22]  David E. Woon,et al.  Gaussian basis sets for use in correlated molecular calculations. IV. Calculation of static electrical response properties , 1994 .

[23]  Alexandre Tkatchenko,et al.  Non-covalent interactions across organic and biological subsets of chemical space: Physics-based potentials parametrized from machine learning. , 2017, The Journal of chemical physics.

[24]  Michele Parrinello,et al.  Generalized neural-network representation of high-dimensional potential-energy surfaces. , 2007, Physical review letters.

[25]  George E. Dahl,et al.  Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error. , 2017, Journal of chemical theory and computation.

[26]  Keith E. Laidig,et al.  PROPERTIES OF ATOMS IN MOLECULES : ATOMIC POLARIZABILITIES , 1990 .

[27]  H. Schlegel,et al.  Static and Dynamic Polarizabilities of Conjugated Molecules and Their Cations , 2004 .

[28]  Margaret E. Johnson,et al.  Current status of the AMOEBA polarizable force field. , 2010, The journal of physical chemistry. B.

[29]  A. Becke Density-functional thermochemistry. III. The role of exact exchange , 1993 .

[30]  M. Frisch,et al.  Ab Initio Calculation of Vibrational Absorption and Circular Dichroism Spectra Using Density Functional Force Fields , 1994 .

[31]  Alán Aspuru-Guzik,et al.  Advances in molecular quantum chemistry contained in the Q-Chem 4 program package , 2014, Molecular Physics.

[32]  Karol Kowalski,et al.  Coupled cluster calculations for static and dynamic polarizabilities of C60. , 2008, The Journal of chemical physics.

[33]  Denis Sh. Sabirov,et al.  Polarizability as a landmark property for fullerene chemistry and materials science , 2014 .

[34]  Jan Hermann,et al.  First-Principles Models for van der Waals Interactions in Molecules and Materials: Concepts, Theory, and Applications. , 2017, Chemical reviews.

[35]  J. R. Carl,et al.  Atom dipole interaction model for molecular polarizability. Application to polyatomic molecules and determination of atom polarizabilities , 1972 .

[36]  J. Jia,et al.  On the geometric dependence of the molecular dipole polarizability in water: A benchmark study of higher-order electron correlation, basis set incompleteness error, core electron effects, and zero-point vibrational contributions. , 2018, The Journal of chemical physics.

[37]  K. Müller,et al.  Fast and accurate modeling of molecular atomization energies with machine learning. , 2011, Physical review letters.

[38]  Matthias Rupp,et al.  Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. , 2015, Journal of chemical theory and computation.

[39]  S. Grimme Dispersion Interaction and Chemical Bonding , 2014 .

[40]  Chem. , 2020, Catalysis from A to Z.

[41]  Jörg Behler,et al.  Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials. , 2018, The Journal of chemical physics.

[42]  R. Dreizler,et al.  Density Functional Theory: An Advanced Course , 2011 .

[43]  M. Rupp,et al.  Machine learning of molecular electronic properties in chemical compound space , 2013, 1305.7074.

[44]  M. Ceriotti,et al.  Solvent fluctuations and nuclear quantum effects modulate the molecular hyperpolarizability of water , 2017, 1705.01649.

[45]  Evert Jan Baerends,et al.  Exchange potential from the common energy denominator approximation for the Kohn–Sham Green’s function: Application to (hyper)polarizabilities of molecular chains , 2002 .

[46]  J. Gauss,et al.  Polarizabilities of CO, N2, HF, Ne, BH, and CH+ from ab initio calculations: Systematic studies of electron correlation, basis set errors, and vibrational contributions , 1998 .

[47]  Aldo Glielmo,et al.  Efficient nonparametric n -body force fields from machine learning , 2018, 1801.04823.

[48]  Kerwin Hui,et al.  SCAN-based hybrid and double-hybrid density functionals from models without fitted parameters. , 2015, The Journal of chemical physics.

[49]  E. Engel,et al.  Density Functional Theory , 2011 .

[50]  Xiao Wang,et al.  Psi4 1.1: An Open-Source Electronic Structure Program Emphasizing Automation, Advanced Libraries, and Interoperability. , 2017, Journal of chemical theory and computation.

[51]  Sotiris S Xantheas,et al.  Development of transferable interaction potentials for water. V. Extension of the flexible, polarizable, Thole-type model potential (TTM3-F, v. 3.0) to describe the vibrational spectra of water clusters and liquid water. , 2008, The Journal of chemical physics.

[52]  P. Alam ‘W’ , 2021, Composites Engineering.

[53]  M. Papadopoulos,et al.  Calculation of the microscopic and macroscopic linear and nonlinear optical properties of acetonitrile: I. Accurate molecular properties in the gas phase and susceptibilities of the liquid in Onsager's reaction-field model , 2003 .

[54]  Lorenz C. Blum,et al.  970 million druglike small molecules for virtual screening in the chemical universe database GDB-13. , 2009, Journal of the American Chemical Society.

[55]  Francesco Paesani,et al.  Dissecting the Molecular Structure of the Air/Water Interface from Quantum Simulations of the Sum-Frequency Generation Spectrum. , 2016, Journal of the American Chemical Society.

[56]  Noam Bernstein,et al.  Machine learning unifies the modeling of materials and molecules , 2017, Science Advances.

[57]  M. Deleuze,et al.  Benchmark theoretical study of the electric polarizabilities of naphthalene, anthracene, and tetracene. , 2013, The Journal of chemical physics.

[58]  J. Hynes,et al.  A theoretical analysis of the sum frequency generation spectrum of the water surface , 2000 .

[59]  Y. Shen,et al.  Surface properties probed by second-harmonic and sum-frequency generation , 1989, Nature.

[60]  Anthony J. Stone,et al.  The Theory of Intermolecular Forces , 2013 .