Elucidating the Role of Hydrogen Bonding in the Optical Spectroscopy of the Solvated Green Fluorescent Protein Chromophore: Using Machine Learning to Establish the Importance of High-Level Electronic Structure.

Hydrogen bonding interactions with chromophores in chemical and biological environments play a key role in determining their electronic absorption and relaxation processes, which are manifested in their linear and multidimensional optical spectra. For chromophores in the condensed phase, the large number of atoms needed to simulate the environment has traditionally prohibited the use of high-level excited-state electronic structure methods. By leveraging transfer learning, we show how to construct machine-learned models to accurately predict high-level excitation energies of a chromophore in solution from only 400 high-level calculations. We show that when the electronic excitations of the green fluorescent protein chromophore in water are treated using EOM-CCSD embedded in a DFT description of the solvent, the optical spectrum is correctly captured and that this improvement arises from correctly treating the coupling of the electronic transition to electric fields, which leads to a larger response upon hydrogen bonding between the chromophore and water.

[1]  F. Liu,et al.  Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states. , 2023, Physical Chemistry, Chemical Physics - PCCP.

[2]  R. Zaleśny,et al.  Cost-Effective Simulations of Vibrationally-Resolved Absorption Spectra of Fluorophores with Machine-Learning-Based Inhomogeneous Broadening , 2023, Journal of chemical theory and computation.

[3]  B. Mennucci,et al.  Machine Learning Exciton Hamiltonians in Light-Harvesting Complexes , 2023, Journal of chemical theory and computation.

[4]  G. Prampolini,et al.  Unraveling the contributions to the spectral shape of flexible dyes in solution: insights on the absorption spectrum of an oxyluciferin analogue. , 2023, Physical chemistry chemical physics : PCCP.

[5]  C. Cappelli,et al.  Multiple Facets of Modeling Electronic Absorption Spectra of Systems in Solution , 2022, ACS physical chemistry Au.

[6]  T. Martínez,et al.  Steric and Electronic Origins of Fluorescence in GFP and GFP-like Proteins. , 2022, Journal of the American Chemical Society.

[7]  B. Champagne,et al.  TDDFT Investigation of the Raman and Resonant Raman Spectra of Fluorescent Protein Chromophore Models. , 2022, The journal of physical chemistry. B.

[8]  S. Boxer,et al.  Energetic Basis and Design of Enzyme Function Demonstrated Using GFP, an Excited-State Enzyme. , 2022, Journal of the American Chemical Society.

[9]  S. Russo,et al.  The quantum chemical solvation of indole: accounting for strong solute-solvent interactions using implicit/explicit models. , 2022, Physical chemistry chemical physics : PCCP.

[10]  D. Donadio,et al.  UV-Visible Absorption Spectra of Solvated Molecules by Quantum Chemical Machine Learning. , 2021, Journal of chemical theory and computation.

[11]  T. Martínez,et al.  Internal conversion of the anionic GFP chromophore: in and out of the I-twisted S1/S0 conical intersection seam , 2021, Chemical science.

[12]  W. Xie,et al.  Ab Initio Quasiclassical Simulation of Femtosecond Time-Resolved Two-Dimensional Electronic Spectra of Pyrazine. , 2021, The journal of physical chemistry letters.

[13]  C. Isborn,et al.  The Influence of Electronic Polarization on Nonlinear Optical Spectroscopy. , 2021, The journal of physical chemistry. B.

[14]  D. Donadio,et al.  Enhanced photodegradation of dimethoxybenzene isomers in/on ice compared to in aqueous solution , 2021, Atmospheric Chemistry and Physics.

[15]  Daniel S. Levine,et al.  Software for the frontiers of quantum chemistry: An overview of developments in the Q-Chem 5 package , 2021, The Journal of chemical physics.

[16]  A. Stirling,et al.  Multiscale Modeling of Electronic Spectra Including Nuclear Quantum Effects , 2021, Journal of chemical theory and computation.

[17]  Pavlo O. Dral,et al.  Molecular excited states through a machine learning lens , 2021, Nature Reviews Chemistry.

[18]  T. Martínez,et al.  Resolving the ultrafast dynamics of the anionic green fluorescent protein chromophore in water , 2021, Chemical science.

[19]  Thomas-C. Jagau,et al.  Embedded equation-of-motion coupled-cluster theory for electronic excitation, ionisation, electron attachment, and electronic resonances , 2021, Molecular Physics.

[20]  V. Carnevale,et al.  Modeling solvation effects on absorption and fluorescence spectra of indole in aqueous solution. , 2021, The Journal of chemical physics.

[21]  C. Isborn,et al.  Vibronic and Environmental Effects in Simulations of Optical Spectroscopy. , 2021, Annual review of physical chemistry.

[22]  Thomas-C. Jagau,et al.  Embedded equation-of-motion coupled-cluster theory for electronic excitation, ionization, and electron attachment , 2021 .

[23]  O. Andreussi,et al.  Bathochromic Shift in the UV-Visible Absorption Spectra of Phenols at Ice Surfaces: Insights from First-Principles Calculations. , 2020, The journal of physical chemistry. A.

[24]  S. Boxer,et al.  Unusual Spectroscopic and Electric Field Sensitivity of Chromophores with Short Hydrogen Bonds: GFP and PYP as Model Systems. , 2020, The journal of physical chemistry. B.

[25]  A. Farahvash,et al.  Machine learning Frenkel Hamiltonian parameters to accelerate simulations of exciton dynamics. , 2020, The Journal of chemical physics.

[26]  P. Slavíček,et al.  The limits of nuclear ensemble method for electronic spectra simulations: Temperature dependence of the (E)-azobenzene spectrum. , 2020, Journal of chemical theory and computation.

[27]  Nadia Rega,et al.  Ab-initio molecular dynamics and hybrid explicit-implicit solvation model for aqueous and nonaqueous solvents: GFP chromophore in water and methanol solution as case study , 2020, J. Comput. Chem..

[28]  Ivan S. Ufimtsev,et al.  TeraChem: A graphical processing unit‐accelerated electronic structure package for large‐scale ab initio molecular dynamics , 2020, WIREs Computational Molecular Science.

[29]  O. Andreussi,et al.  Photodecay of guaiacol is faster in ice, and even more rapid on ice, than in aqueous solution. , 2020, Environmental science. Processes & impacts.

[30]  P. Marquetand,et al.  Deep learning for UV absorption spectra with SchNarc: First steps toward transferability in chemical compound space. , 2020, The Journal of chemical physics.

[31]  P. Marquetand,et al.  Machine Learning for Electronically Excited States of Molecules , 2020, Chemical reviews.

[32]  Pavlo O. Dral,et al.  Machine Learning for Absorption Cross Sections , 2020, The journal of physical chemistry. A.

[33]  Michael S. Chen,et al.  Exploiting Machine Learning to Efficiently Predict Multidimensional Optical Spectra in Complex Environments. , 2020, The journal of physical chemistry letters.

[34]  A. Scemama,et al.  A Mountaineering Strategy to Excited States: Highly-Accurate Energies and Benchmarks for Medium Size Molecules. , 2019, Journal of chemical theory and computation.

[35]  Shengyu Zhang,et al.  Deep Learning for Optoelectronic Properties of Organic Semiconductors , 2019, The Journal of Physical Chemistry C.

[36]  C. Isborn,et al.  The Influence of Electronic Polarization on the Spectral Density. , 2019, The journal of physical chemistry. B.

[37]  G. Prampolini,et al.  The Adiabatic-Molecular Dynamics|generalized Vertical Hessian approach: a mixed quantum classical method to compute electronic spectra of flexible molecules in condensed phase. , 2019, Journal of chemical theory and computation.

[38]  Dhabih V. Chulhai,et al.  Absolutely Localized Projection-Based Embedding for Excited States. , 2019, Journal of chemical theory and computation.

[39]  D. Claudino,et al.  Simple and efficient truncation of virtual spaces in embedded wave functions via concentric localization. , 2019, Journal of chemical theory and computation.

[40]  S. Boxer,et al.  A unified model for photophysical and electro-optical properties of Green Fluorescent Proteins. , 2019, Journal of the American Chemical Society.

[41]  Christine M Isborn,et al.  Optical spectra in the condensed phase: Capturing anharmonic and vibronic features using dynamic and static approaches. , 2019, The Journal of chemical physics.

[42]  Guozhen Zhang,et al.  A neural network protocol for electronic excitations of N-methylacetamide , 2019, Proceedings of the National Academy of Sciences.

[43]  A. Clark,et al.  The Effect of Ions on the Optical Absorption Spectra of Aqueously Solvated Chromophores. , 2019, The journal of physical chemistry. A.

[44]  Frederick R Manby,et al.  Projection-Based Wavefunction-in-DFT Embedding. , 2019, Accounts of chemical research.

[45]  D. Claudino,et al.  Automatic Partition of Orbital Spaces Based on Singular Value Decomposition in the Context of Embedding Theories. , 2018, Journal of chemical theory and computation.

[46]  Wei-Hai Fang,et al.  Deep Learning for Nonadiabatic Excited-State Dynamics. , 2018, The journal of physical chemistry letters.

[47]  Kristof T. Schütt,et al.  Capturing intensive and extensive DFT/TDDFT molecular properties with machine learning , 2018, The European Physical Journal B.

[48]  C. Isborn,et al.  Unraveling electronic absorption spectra using nuclear quantum effects: Photoactive yellow protein and green fluorescent protein chromophores in water. , 2018, The Journal of chemical physics.

[49]  Yann Garniron,et al.  A Mountaineering Strategy to Excited States: Highly Accurate Reference Energies and Benchmarks. , 2018, Journal of chemical theory and computation.

[50]  Yu Kay Law,et al.  The importance of nuclear quantum effects in spectral line broadening of optical spectra and electrostatic properties in aromatic chromophores. , 2018, The Journal of chemical physics.

[51]  Romain Berraud-Pache,et al.  Simulation and Analysis of the Spectroscopic Properties of Oxyluciferin and Its Analogues in Water. , 2018, Journal of chemical theory and computation.

[52]  Michele Ceriotti,et al.  Nuclear quantum effects enter the mainstream , 2018, 1803.01037.

[53]  T J Zuehlsdorff,et al.  Combining the ensemble and Franck-Condon approaches for calculating spectral shapes of molecules in solution. , 2017, The Journal of chemical physics.

[54]  Frederick R Manby,et al.  Pushing the Limits of EOM-CCSD with Projector-Based Embedding for Excitation Energies. , 2017, The journal of physical chemistry letters.

[55]  I. Georgieva,et al.  High‐level Ab Initio Absorption Spectra Simulations of Neutral, Anionic and Neutral+ Chromophore of Green Fluorescence Protein Chromophore Models in Gas Phase and Solution , 2017, Photochemistry and photobiology.

[56]  Gerbrand Ceder,et al.  Efficient and accurate machine-learning interpolation of atomic energies in compositions with many species , 2017, 1706.06293.

[57]  B. Champagne,et al.  Simulation of the UV/Visible Absorption Spectra of Fluorescent Protein Chromophore Models , 2017 .

[58]  F. Liu,et al.  Direct Learning Hidden Excited State Interaction Patterns from ab initio Dynamics and Its Implication as Alternative Molecular Mechanism Models , 2017, Scientific Reports.

[59]  H. Akiyama,et al.  The effect of dynamical fluctuations of hydration structures on the absorption spectra of oxyluciferin anions in an aqueous solution. , 2017, Physical chemistry chemical physics : PCCP.

[60]  R. Gebauer,et al.  A computational study on how structure influences the optical properties in model crystal structures of amyloid fibrils. , 2017, Physical chemistry chemical physics : PCCP.

[61]  Konstantin A Lukyanov,et al.  Photoinduced Chemistry in Fluorescent Proteins: Curse or Blessing? , 2017, Chemical reviews.

[62]  Ali Hassanali,et al.  Nuclear quantum effects in a HIV/cancer inhibitor: The case of ellipticine. , 2016, The Journal of chemical physics.

[63]  Nongnuch Artrith,et al.  An implementation of artificial neural-network potentials for atomistic materials simulations: Performance for TiO2 , 2016 .

[64]  F. Hab,et al.  Machine learning exciton dynamics , 2016 .

[65]  Siam Rfview,et al.  CONVERGENCE CONDITIONS FOR ASCENT METHODS , 2016 .

[66]  A. Hassanali,et al.  Role of Quantum Vibrations on the Structural, Electronic, and Optical Properties of 9-Methylguanine. , 2015, The journal of physical chemistry. A.

[67]  Jörg Behler,et al.  Constructing high‐dimensional neural network potentials: A tutorial review , 2015 .

[68]  G. Schlau-Cohen Principles of light harvesting from single photosynthetic complexes , 2015, Interface Focus.

[69]  S. Boxer,et al.  Short Hydrogen Bonds and Proton Delocalization in Green Fluorescent Protein (GFP) , 2015, ACS central science.

[70]  I. Timrov,et al.  Accurate and inexpensive prediction of the color optical properties of anthocyanins in solution. , 2015, The journal of physical chemistry. A.

[71]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[72]  M. Davari,et al.  The lineshape of the electronic spectrum of the green fluorescent protein chromophore, part II: solution phase. , 2014, Chemphyschem : a European journal of chemical physics and physical chemistry.

[73]  Jordan Miles,et al.  Resonantly Enhanced Multiphoton Ionization Spectrum of the Neutral Green Fluorescent Protein Chromophore. , 2014, The journal of physical chemistry letters.

[74]  D. Manolopoulos,et al.  How to remove the spurious resonances from ring polymer molecular dynamics. , 2014, The Journal of chemical physics.

[75]  Giulio Cerullo,et al.  Ab Initio Simulations of Two-Dimensional Electronic Spectra: The SOS//QM/MM Approach , 2014 .

[76]  M. Davari,et al.  The lineshape of the electronic spectrum of the green fluorescent protein chromophore, part I: gas phase. , 2014, Chemphyschem : a European journal of chemical physics and physical chemistry.

[77]  Anna I Krylov,et al.  First-principles characterization of the energy landscape and optical spectra of green fluorescent protein along the A→I→B proton transfer route. , 2013, Journal of the American Chemical Society.

[78]  Thomas F. Miller,et al.  Ring-polymer molecular dynamics: quantum effects in chemical dynamics from classical trajectories in an extended phase space. , 2013, Annual review of physical chemistry.

[79]  Johannes Neugebauer,et al.  Direct determination of exciton couplings from subsystem time-dependent density-functional theory within the Tamm-Dancoff approximation. , 2013, The Journal of chemical physics.

[80]  Frederick R. Manby,et al.  A Simple, Exact Density-Functional-Theory Embedding Scheme , 2012, Journal of chemical theory and computation.

[81]  F. Buda,et al.  Bathochromic Shift in Green Fluorescent Protein: A Puzzle for QM/MM Approaches. , 2012, Journal of chemical theory and computation.

[82]  O. Svoboda,et al.  Simulations of light induced processes in water based on ab initio path integrals molecular dynamics. I. Photoabsorption. , 2011, The Journal of chemical physics.

[83]  Graham R Fleming,et al.  Lessons from nature about solar light harvesting. , 2011, Nature chemistry.

[84]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.

[85]  I. Polyakov,et al.  Potential Energy Landscape of the Electronic States of the GFP Chromophore in Different Protonation Forms: Electronic Transition Energies and Conical Intersections. , 2010, Journal of chemical theory and computation.

[86]  Oleksandr Makeyev,et al.  Neural network with ensembles , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[87]  S. Grimme,et al.  A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. , 2010, The Journal of chemical physics.

[88]  P. Naumov,et al.  Topochemistry and photomechanical effects in crystals of green fluorescent protein-like chromophores: effects of hydrogen bonding and crystal packing. , 2010, Journal of the American Chemical Society.

[89]  Michael W. Davidson,et al.  The fluorescent protein palette: tools for cellular imaging. , 2009, Chemical Society reviews.

[90]  Claudia Filippi,et al.  Absorption Spectrum of the Green Fluorescent Protein Chromophore: A Difficult Case for ab Initio Methods? , 2009, Journal of chemical theory and computation.

[91]  Evgeny Epifanovsky,et al.  Quantum Chemical Benchmark Studies of the Electronic Properties of the Green Fluorescent Protein Chromophore. 1. Electronically Excited and Ionized States of the Anionic Chromophore in the Gas Phase. , 2009, Journal of chemical theory and computation.

[92]  Evgeny Epifanovsky,et al.  Quantum Chemical Benchmark Studies of the Electronic Properties of the Green Fluorescent Protein Chromophore: 2. Cis-Trans Isomerization in Water. , 2009, Journal of chemical theory and computation.

[93]  Shu Chien,et al.  Fluorescence proteins, live-cell imaging, and mechanobiology: seeing is believing. , 2008, Annual review of biomedical engineering.

[94]  Anna I Krylov,et al.  Equation-of-motion coupled-cluster methods for open-shell and electronically excited species: the Hitchhiker's guide to Fock space. , 2008, Annual review of physical chemistry.

[95]  Michele Parrinello,et al.  Generalized neural-network representation of high-dimensional potential-energy surfaces. , 2007, Physical review letters.

[96]  M. Cho Coherent two-dimensional optical spectroscopy , 2006 .

[97]  K. Solntsev,et al.  Solvatochromism of the green fluorescence protein chromophore and its derivatives. , 2006, Journal of the American Chemical Society.

[98]  Andreas Dreuw,et al.  Single-reference ab initio methods for the calculation of excited states of large molecules. , 2005, Chemical reviews.

[99]  Marco Garavelli,et al.  Solvent effects on the vibrational activity and photodynamics of the green fluorescent protein chromophore: a quantum-chemical study. , 2005, Journal of the American Chemical Society.

[100]  Ian R. Craig,et al.  Quantum statistics and classical mechanics: real time correlation functions from ring polymer molecular dynamics. , 2004, The Journal of chemical physics.

[101]  T. Martínez,et al.  Conical intersection dynamics in solution: the chromophore of Green Fluorescent Protein. , 2004, Faraday discussions.

[102]  N. Handy,et al.  A new hybrid exchange–correlation functional using the Coulomb-attenuating method (CAM-B3LYP) , 2004 .

[103]  T. Jørgensen,et al.  Experimental studies of the photophysics of gas-phase fluorescent protein chromophores , 2004 .

[104]  Massimo Olivucci,et al.  Origin, nature, and fate of the fluorescent state of the green fluorescent protein chromophore at the CASPT2//CASSCF resolution. , 2004, Journal of the American Chemical Society.

[105]  D. Jonas Two-dimensional femtosecond spectroscopy. , 2003, Annual review of physical chemistry.

[106]  M. Chial,et al.  in simple , 2003 .

[107]  J U Andersen,et al.  Absorption spectrum of the green fluorescent protein chromophore anion in vacuo. , 2001, Physical review letters.

[108]  Peter Schellenberg,et al.  Resonance Raman Scattering by the Green Fluorescent Protein and an Analogue of Its Chromophore , 2001 .

[109]  P. Tonge,et al.  Probing the ground state structure of the green fluorescent protein chromophore using Raman spectroscopy. , 2000, Biochemistry.

[110]  S. Mukamel,et al.  Multidimensional femtosecond correlation spectroscopies of electronic and vibrational excitations. , 2000, Annual review of physical chemistry.

[111]  V. Barone,et al.  Toward reliable density functional methods without adjustable parameters: The PBE0 model , 1999 .

[112]  Yingkai Zhang,et al.  Comment on “Generalized Gradient Approximation Made Simple” , 1998 .

[113]  Notker Rösch,et al.  PROTONATION EFFECTS ON THE CHROMOPHORE OF GREEN FLUORESCENT PROTEIN. QUANTUM CHEMICAL STUDY OF THE ABSORPTION SPECTRUM , 1997 .

[114]  Burke,et al.  Generalized Gradient Approximation Made Simple. , 1996, Physical review letters.

[115]  M. E. Casida Time-Dependent Density Functional Response Theory for Molecules , 1995 .

[116]  S. Mukamel Principles of Nonlinear Optical Spectroscopy , 1995 .

[117]  Mark A. Ratner,et al.  ENVIRONMENTAL EFFECTS ON NONLINEAR OPTICAL CHROMOPHORE PERFORMANCE. CALCULATION OF MOLECULAR QUADRATIC HYPERPOLARIZABILITIES IN SOLVATING MEDIA , 1994 .

[118]  Anders Krogh,et al.  Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[119]  Donald C. Comeau,et al.  The equation-of-motion coupled-cluster method. Applications to open- and closed-shell reference states , 1993 .

[120]  John F. Stanton,et al.  The equation of motion coupled‐cluster method. A systematic biorthogonal approach to molecular excitation energies, transition probabilities, and excited state properties , 1993 .

[121]  A. Klamt,et al.  COSMO : a new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient , 1993 .

[122]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[123]  Michael J. Frisch,et al.  Toward a systematic molecular orbital theory for excited states , 1992 .

[124]  W. Struve,et al.  TIME‐RESOLVED FLUORESCENCE OF NITROBENZOXADIAZOLE‐AMINOHEXANOIC ACID: EFFECT OF INTERMOLECULAR HYDROGEN‐BONDING ON NON‐RADIATIVE DECAY , 1991, Photochemistry and photobiology.

[125]  Bernard Widrow,et al.  Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[126]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[127]  Michael J. Frisch,et al.  Self‐consistent molecular orbital methods 25. Supplementary functions for Gaussian basis sets , 1984 .

[128]  J. Pople,et al.  Self‐Consistent Molecular Orbital Methods. X. Molecular Orbital Studies of Excited States with Minimal and Extended Basis Sets , 1971 .

[129]  E. Condon,et al.  Nuclear Motions Associated with Electron Transitions in Diatomic Molecules , 1928 .

[130]  E. H. Wilson Origin , 1927, Bulletin of popular information - Arnold Arboretum, Harvard University..

[131]  E. Condon A Theory of Intensity Distribution in Band Systems , 1926 .