A Bayesian approach to NMR crystal structure determination.

Nuclear Magnetic Resonance (NMR) spectroscopy is particularly well suited to determine the structure of molecules and materials in powdered form. Structure determination usually proceeds by finding the best match between experimentally observed NMR chemical shifts and those of candidate structures. Chemical shifts for the candidate configurations have traditionally been computed by electronic-structure methods, and more recently predicted by machine learning. However, the reliability of the determination depends on the errors in the predicted shifts. Here we propose a Bayesian framework for determining the confidence in the identification of the experimental crystal structure, based on knowledge of the typical errors in the electronic structure methods. We demonstrate the approach on the determination of the structures of six organic molecular crystals. We critically assess the reliability of the structure determinations, facilitated by the introduction of a visualization of the similarity between candidate configurations in terms of their chemical shifts and their structures. We also show that the commonly used values for the errors in calculated 13C shifts are underestimated, and that more accurate, self-consistently determined uncertainties make it possible to use 13C shifts to improve the accuracy of structure determinations. Finally, we extend the recently-developed ShiftML model to render it more efficient, accurate, and, most importantly, to evaluate the uncertainties in its predictions. By quantifying the confidence in structure determinations based on ShiftML predictions we further substantiate that it provides a valid replacement for first-principles calculations in NMR crystallography.

[1]  Robin K. Harris,et al.  NMR crystallography: the use of chemical shifts , 2004 .

[2]  S. Viel,et al.  Quantitative structural constraints for organic powders at natural isotopic abundance using dynamic nuclear polarization solid-state NMR spectroscopy. , 2015, Angewandte Chemie.

[3]  E. Salager,et al.  Powder NMR crystallography of thymol. , 2009, Physical chemistry chemical physics : PCCP.

[4]  P. J. Bygrave,et al.  Rapid Structure Determination of Molecular Solids Using Chemical Shifts Directed by Unambiguous Prior Constraints , 2019, Journal of the American Chemical Society.

[5]  Gábor Csányi,et al.  Comparing molecules and solids across structural and alchemical space. , 2015, Physical chemistry chemical physics : PCCP.

[6]  John M. Griffin,et al.  First-principles calculation of NMR parameters using the gauge including projector augmented wave method: a chemist's point of view. , 2012, Chemical reviews.

[7]  O. Terasaki,et al.  A general protocol for determining the structures of molecularly ordered but noncrystalline silicate frameworks. , 2013, Journal of the American Chemical Society.

[8]  A. Nonat,et al.  Application of 29Si Homonuclear and 1H−29Si Heteronuclear NMR Correlation to Structural Studies of Calcium Silicate Hydrates , 2004 .

[9]  Yaliang Li,et al.  SCI , 2021, Proceedings of the 30th ACM International Conference on Information & Knowledge Management.

[10]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[11]  L. J. Mueller,et al.  NMR crystallography of enzyme active sites: probing chemically detailed, three-dimensional structure in tryptophan synthase. , 2013, Accounts of chemical research.

[12]  Michele Ceriotti,et al.  Fast and Accurate Uncertainty Estimation in Chemical Machine Learning. , 2018, Journal of chemical theory and computation.

[13]  Stefano de Gironcoli,et al.  Advanced capabilities for materials modelling with Quantum ESPRESSO , 2017, Journal of physics. Condensed matter : an Institute of Physics journal.

[14]  Improving the accuracy of solid-state nuclear magnetic resonance chemical shift prediction with a simple molecular correction. , 2019, Physical chemistry chemical physics : PCCP.

[15]  F. Fotiadu,et al.  Structure elucidation of a complex CO2-based organic framework material by NMR crystallography , 2016, Chemical science.

[16]  C. Grey,et al.  Characterizing Oxygen Local Environments in Paramagnetic Battery Materials via (17)O NMR and DFT Calculations. , 2016, Journal of the American Chemical Society.

[17]  R. Harris,et al.  Applications of solid‐state NMR to pharmaceutical polymorphism and related matters * , 2007, The Journal of pharmacy and pharmacology.

[18]  Carlo Cavazzoni,et al.  Enhancement of DFT-calculations at petascale: Nuclear Magnetic Resonance, Hybrid Density Functional Theory and Car-Parrinello calculations , 2013, Comput. Phys. Commun..

[19]  John S. O. Evans,et al.  Structural study of polymorphs and solvates of finasteride. , 2007, Journal of pharmaceutical sciences.

[20]  Derui Liu,et al.  ACC , 2020, Catalysis from A to Z.

[21]  Abhishek Kumar,et al.  The Atomic-Level Structure of Cementitious Calcium Silicate Hydrate , 2017 .

[22]  Stefano de Gironcoli,et al.  QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials , 2009, Journal of physics. Condensed matter : an Institute of Physics journal.

[23]  G. Brunklaus,et al.  NMR crystallography of ezetimibe co-crystals. , 2015, Solid state nuclear magnetic resonance.

[24]  Andrea Grisafi,et al.  Symmetry-Adapted Machine Learning for Tensorial Properties of Atomistic Systems. , 2017, Physical review letters.

[25]  J. Gauss,et al.  Structure assignment in the solid state by the coupling of quantum chemical calculations with NMR experiments: a columnar hexabenzocoronene derivative. , 2001, Journal of the American Chemical Society.

[26]  A. Corma,et al.  Synthesis and Structure Determination of the Hierarchical Meso-Microporous Zeolite ITQ-43 , 2011, Science.

[27]  runden Tisch,et al.  AM , 2020, Catalysis from A to Z.

[28]  E. Oldfield,et al.  Secondary and tertiary structural effects on protein NMR chemical shifts: an ab initio approach. , 1993, Science.

[29]  A. Pines,et al.  Quantification of the disorder in network-modified silicate glasses , 1992, Nature.

[30]  Jörg Behler,et al.  Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials. , 2018, The Journal of chemical physics.

[31]  D. Hesp,et al.  Cu(110)表面状態に及ぼすステップと規則的欠陥の影響 , 2013 .

[32]  D. Farrusseng,et al.  Superstructure of a substituted zeolitic imidazolate metal-organic framework determined by combining proton solid-state NMR spectroscopy and DFT calculations. , 2015, Angewandte Chemie.

[33]  P. Alam ‘A’ , 2021, Composites Engineering: An A–Z Guide.

[34]  C. Pickard,et al.  Calculation of NMR chemical shifts in organic solids: accounting for motional effects. , 2009, The Journal of chemical physics.

[35]  Michele Ceriotti,et al.  A Data-Driven Construction of the Periodic Table of the Elements , 2018, 1807.00236.

[36]  S. Ashbrook,et al.  Combining solid-state NMR spectroscopy with first-principles calculations - a guide to NMR crystallography. , 2016, Chemical communications.

[37]  W. Hager,et al.  and s , 2019, Shallow Water Hydraulics.

[38]  Chen Yang,et al.  NMR Crystallography of a Carbanionic Intermediate in Tryptophan Synthase: Chemical Structure, Tautomerization, and Reaction Specificity , 2016, Journal of the American Chemical Society.

[39]  P. Souder,et al.  Solid , 2020, Definitions.

[40]  I. Schlichting,et al.  X-ray and NMR crystallography in an enzyme active site: the indoline quinonoid intermediate in tryptophan synthase. , 2011, Journal of the American Chemical Society.

[41]  Blöchl,et al.  Projector augmented-wave method. , 1994, Physical review. B, Condensed matter.

[42]  G. Day,et al.  De novo determination of the crystal structure of a large drug molecule by crystal structure prediction-based powder NMR crystallography. , 2013, Journal of the American Chemical Society.

[43]  Gábor Csányi,et al.  Accuracy and transferability of Gaussian approximation potential models for tungsten , 2014 .

[44]  C. Pickard,et al.  Assigning carbon-13 NMR spectra to crystal structures by the INADEQUATE pulse sequence and first principles computation: a case study of two forms of testosterone. , 2006, Physical chemistry chemical physics : PCCP.

[45]  M. Suchomel,et al.  Long- and Short-Range Constraints for the Structure Determination of Layered Silicates with Stacking Disorder , 2014 .

[46]  I. Bruno,et al.  Cambridge Structural Database , 2002 .

[47]  José A Fernandes,et al.  X-ray and NMR Crystallography Studies of Novel Theophylline Cocrystals Prepared by Liquid Assisted Grinding , 2015 .

[48]  Sten O. Nilsson Lill,et al.  Does Z' equal 1 or 2? Enhanced powder NMR crystallography verification of a disordered room temperature crystal structure of a p38 inhibitor for chronic obstructive pulmonary disease. , 2017, Physical chemistry chemical physics : PCCP.

[49]  Steven P. Brown,et al.  Exploiting the Synergy of Powder X-ray Diffraction and Solid-State NMR Spectroscopy in Structure Determination of Organic Molecular Solids , 2013, The journal of physical chemistry. C, Nanomaterials and interfaces.

[50]  Yuegang Zhang,et al.  Characterizing challenging microcrystalline solids with solid-state NMR shift tensor and synchrotron X-ray powder diffraction data: structural analysis of ambuic acid. , 2006, Journal of the American Chemical Society.

[51]  Sérgio M. Santos,et al.  NMR Crystallography: Toward Chemical Shift-Driven Crystal Structure Determination of the β-Lactam Antibiotic Amoxicillin Trihydrate , 2013 .

[52]  D. Grant,et al.  Stereochemical analysis by solid-state NMR: structural predictions in ambuic acid. , 2003, The Journal of organic chemistry.

[53]  G. Day,et al.  Benchmark fragment-based (1)H, (13)C, (15)N and (17)O chemical shift predictions in molecular crystals. , 2016, Physical chemistry chemical physics : PCCP.

[54]  Noam Bernstein,et al.  Machine learning unifies the modeling of materials and molecules , 2017, Science Advances.

[55]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[56]  L. Emsley,et al.  Molecular structure determination in powders by NMR crystallography from proton spin diffusion. , 2006, Journal of the American Chemical Society.

[57]  D. Brouwer NMR crystallography of zeolites: refinement of an NMR-solved crystal structure using ab initio calculations of 29Si chemical shift tensors. , 2008, Journal of the American Chemical Society.

[58]  James A. Chisholm,et al.  COMPACK: a program for identifying crystal structure similarity using distances , 2005 .

[59]  Albert Hofstetter,et al.  Positional Variance in NMR Crystallography. , 2017, Journal of the American Chemical Society.

[60]  L. Emsley,et al.  NMR crystallography of campho[2,3-c]pyrazole (Z' = 6): combining high-resolution 1H-13C solid-state MAS NMR spectroscopy and GIPAW chemical-shift calculations. , 2010, The journal of physical chemistry. A.

[61]  Michele Ceriotti,et al.  Atom-density representations for machine learning. , 2018, The Journal of chemical physics.

[62]  G. Mali Ab initio crystal structure prediction of magnesium (poly)sulfides and calculation of their NMR parameters. , 2017, Acta crystallographica. Section C, Structural chemistry.

[63]  P. J. Bygrave,et al.  Clathrate Structure Determination by Combining Crystal Structure Prediction with Computational and Experimental 129Xe NMR Spectroscopy , 2017, Chemistry.

[64]  Chem. , 2020, Catalysis from A to Z.

[65]  Thibault Charpentier,et al.  The PAW/GIPAW approach for computing NMR parameters: a new dimension added to NMR study of solids. , 2011, Solid state nuclear magnetic resonance.

[66]  Steven P. Brown,et al.  Improving Confidence in Crystal Structure Solutions Using NMR Crystallography: The Case of β-Piroxicam , 2018 .

[67]  Michele Ceriotti,et al.  Chemical shifts in molecular solids by machine learning , 2018, Nature Communications.

[68]  D. Grant,et al.  Structural characterization of an anhydrous polymorph of paclitaxel by solid-state NMR. , 2007, Physical chemistry chemical physics : PCCP.

[69]  J. Attfield,et al.  Solid State Sciences: Preface , 2016 .

[70]  J. Harper,et al.  Establishing Accurate High-Resolution Crystal Structures in the Absence of Diffraction Data and Single Crystals—An NMR Approach , 2013 .

[71]  J. Brus,et al.  NMR Crystallography of the Polymorphs of Metergoline , 2018, Crystals.

[72]  C. Pickard,et al.  Structure and NMR assignment in calcined and as-synthesized forms of AlPO-14: a combined study by first-principles calculations and high-resolution 27Al-31P MAS NMR correlation. , 2008, Physical chemistry chemical physics : PCCP.

[73]  Daniel Sebastiani,et al.  A strategy for revealing the packing in semicrystalline π-conjugated polymers: crystal structure of bulk poly-3-hexyl-thiophene (P3HT). , 2012, Angewandte Chemie.

[74]  Michele Parrinello,et al.  Demonstrating the Transferability and the Descriptive Power of Sketch-Map. , 2013, Journal of chemical theory and computation.

[75]  Ulrich Sternberg,et al.  Chemical shift driven geometry optimization , 2002, J. Comput. Chem..

[76]  E. Salager,et al.  Powder crystallography of pharmaceutical materials by combined crystal structure prediction and solid-state 1H NMR spectroscopy. , 2013, Physical chemistry chemical physics : PCCP.

[77]  D. Gajan,et al.  Polymorphs of Theophylline Characterized by DNP Enhanced Solid-State NMR , 2015, Molecular pharmaceutics.

[78]  E. Salager,et al.  Powder crystallography by combined crystal structure prediction and high-resolution 1H solid-state NMR spectroscopy. , 2010, Journal of the American Chemical Society.

[79]  Steven P. Brown,et al.  Combining the Advantages of Powder X-ray Diffraction and NMR Crystallography in Structure Determination of the Pharmaceutical Material Cimetidine Hydrochloride , 2016 .

[80]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[81]  M. Jaworska,et al.  NMR crystallography of α-poly(L-lactide). , 2013, Physical chemistry chemical physics : PCCP.

[82]  Peter D Haynes,et al.  Dynamical effects in ab initio NMR calculations: classical force fields fitted to quantum forces. , 2010, The Journal of chemical physics.

[83]  Josh E. Campbell,et al.  Machine learning for the structure–energy–property landscapes of molecular crystals† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c7sc04665k , 2017, Chemical science.

[84]  CsatóLehel,et al.  Sparse on-line Gaussian processes , 2002 .

[85]  Michele Parrinello,et al.  The fuzzy quantum proton in the hydrogen chloride hydrates. , 2012, Journal of the American Chemical Society.

[86]  L. Roberts,et al.  Molecular silicate and aluminate species in anhydrous and hydrated cements. , 2010, Journal of the American Chemical Society.

[87]  C. Pickard,et al.  Assigning powders to crystal structures by high-resolution (1)H-(1)H double quantum and (1)H-(13)C J-INEPT solid-state NMR spectroscopy and first principles computation. A case study of penicillin G. , 2006, Physical chemistry chemical physics : PCCP.

[88]  Lehel Csató,et al.  Sparse On-Line Gaussian Processes , 2002, Neural Computation.

[89]  Sten O. Nilsson Lill,et al.  Elucidating an Amorphous Form Stabilization Mechanism for Tenapanor Hydrochloride: Crystal Structure Analysis Using X-ray Diffraction, NMR Crystallography, and Molecular Modeling. , 2018, Molecular pharmaceutics.

[90]  M. Oszajca,et al.  Fine refinement of solid state structure of racemic form of phospho-tyrosine employing NMR Crystallography approach. , 2015, Solid state nuclear magnetic resonance.

[91]  Volker L. Deringer,et al.  Machine learning based interatomic potential for amorphous carbon , 2016, 1611.03277.

[92]  R. Needs,et al.  Temperature effects in first-principles solid state calculations of the chemical shielding tensor made simple. , 2014, The Journal of chemical physics.

[93]  D. Grant,et al.  Enhancing Crystal-Structure Prediction with NMR Tensor Data , 2006 .

[94]  M. Schubert,et al.  Structure of a protein determined by solid-state magic-angle-spinning NMR spectroscopy , 2002, Nature.

[95]  Yehoshua Y. Zeevi,et al.  The farthest point strategy for progressive image sampling , 1997, IEEE Trans. Image Process..

[96]  M. Dračínský,et al.  A molecular dynamics study of the effects of fast molecular motions on solid-state NMR parameters , 2013 .

[97]  Francesco Mauri,et al.  Calculation of NMR chemical shifts for extended systems using ultrasoft pseudopotentials , 2007 .

[98]  P. Hodgkinson,et al.  Furosemide's one little hydrogen atom: NMR crystallography structure verification of powdered molecular organics. , 2016, Chemical communications.

[99]  M. Dračínský,et al.  Effects of quantum nuclear delocalisation on NMR parameters from path integral molecular dynamics. , 2014, Chemistry.

[100]  Francesco Mauri,et al.  All-electron magnetic response with pseudopotentials: NMR chemical shifts , 2001 .

[101]  L. Emsley,et al.  Crystal-structure determination of powdered paramagnetic lanthanide complexes by proton NMR spectroscopy. , 2009, Angewandte Chemie.