A Multi-Resolution 3D-DenseNet for Chemical Shift Prediction in NMR Crystallography.

We have developed a deep learning algorithm for chemical shift prediction for atoms in molecular crystals that utilizes an atom-centered Gaussian density model for the 3D data representation of a molecule. We define multiple channels that describe different spatial resolutions for each atom type that utilizes cropping, pooling, and concatenation to create a multi-resolution 3D-DenseNet architecture (MR-3D-DenseNet). Because the training and testing time scale linearly with the number of samples, the MR-3D-DenseNet can exploit data augmentation that takes into account the property of rotational invariance of the chemical shifts, thereby also increasing the size of the training dataset by an order of magnitude without additional cost. We obtain very good agreement for 13C, 15N, and 17O chemical shifts when compared to ab initio quantum chemistry methods, with the highest accuracy found for 1H chemical shifts that is comparable to the error between the ab initio results and experimental measurements. Principal Component analysis (PCA) is used to both understand these greatly improved predictions for 1H , as well as indicating that chemical shift prediction for 13C, 15N, and 17O, which have far fewer training environments than the 1H atom type, will improve once more unique training samples are made available to exploit the deep network architecture.

[1]  Michele Ceriotti,et al.  Chemical shifts in molecular solids by machine learning , 2018, Nature Communications.

[2]  Gaël Varoquaux,et al.  Mayavi: 3D Visualization of Scientific Data , 2010, Computing in Science & Engineering.

[3]  Mark Asta,et al.  NMR Crystallography: Evaluation of Hydrogen Positions in Hydromagnesite by 13 C{1 H} REDOR Solid-State NMR and Density Functional Theory Calculation of Chemical Shielding Tensors. , 2019, Angewandte Chemie.

[4]  Simon W. Ginzinger,et al.  SHIFTX2: significantly improved protein chemical shift prediction , 2011, Journal of biomolecular NMR.

[5]  Kai J. Kohlhoff,et al.  Fast and accurate predictions of protein NMR chemical shifts from interatomic distances. , 2009, Journal of the American Chemical Society.

[6]  I. Bruno,et al.  Cambridge Structural Database , 2002 .

[7]  M. Pyda,et al.  Characterization of Two Distinct Amorphous Forms of Valsartan by Solid-State NMR. , 2016, Molecular pharmaceutics.

[8]  Russ B. Altman,et al.  3D deep convolutional neural networks for amino acid environment similarity analysis , 2017, BMC Bioinformatics.

[9]  Francesco Mauri,et al.  All-electron magnetic response with pseudopotentials: NMR chemical shifts , 2001 .

[10]  A. Bax,et al.  SPARTA+: a modest improvement in empirical NMR chemical shift prediction by means of an artificial neural network , 2010, Journal of biomolecular NMR.

[11]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[12]  D. Apperley,et al.  Characterising the role of water in sildenafil citrate by NMR crystallography , 2016 .

[13]  C. Martineau NMR crystallography: Applications to inorganic materials. , 2014, Solid state nuclear magnetic resonance.

[14]  D. Wishart,et al.  Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts , 2003, Journal of Biomolecular NMR.

[15]  G. Beran,et al.  Improved Electrostatic Embedding for Fragment-Based Chemical Shift Calculations in Molecular Crystals. , 2017, Journal of chemical theory and computation.

[16]  Chris J Pickard,et al.  Ab Initio Quality NMR Parameters in Solid-State Materials Using a High-Dimensional Neural-Network Representation. , 2016, Journal of chemical theory and computation.

[17]  Rim Shayakhmetov,et al.  3D Molecular Representations Based on the Wave Transform for Convolutional Neural Networks. , 2018, Molecular pharmaceutics.

[18]  Nikos Paragios,et al.  EnzyNet: enzyme classification using 3D convolutional neural networks on spatial representation , 2017, PeerJ.

[19]  G. Day,et al.  Benchmark fragment-based (1)H, (13)C, (15)N and (17)O chemical shift predictions in molecular crystals. , 2016, Physical chemistry chemical physics : PCCP.

[20]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[21]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  E. Salager,et al.  Powder crystallography of pharmaceutical materials by combined crystal structure prediction and solid-state 1H NMR spectroscopy. , 2013, Physical chemistry chemical physics : PCCP.

[23]  G. Buntkowsky,et al.  NMR crystallography of amides, peptides and protein–ligand complexes , 2013 .

[24]  D. L. Bryce NMR crystallography: structure and properties of materials from solid-state nuclear magnetic resonance observables , 2017, IUCrJ.

[25]  D. Wishart,et al.  Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts , 2003, Journal of biomolecular NMR.

[26]  F. Fotiadu,et al.  Structure elucidation of a complex CO2-based organic framework material by NMR crystallography , 2016, Chemical science.

[27]  A. Mehta,et al.  NMR Crystallography: Evaluation of Hydrogen Positions in Hydromagnesite by 13 C{ 1 H} REDOR Solid‐State NMR and Density Functional Theory Calculation of Chemical Shielding Tensors , 2019, Angewandte Chemie.

[28]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[29]  A. Bax,et al.  Protein backbone chemical shifts predicted from searching a database for torsion angle and sequence homology , 2007, Journal of biomolecular NMR.

[30]  Michael Habeck,et al.  Membrane-protein structure determination by solid-state NMR spectroscopy of microcrystals , 2012, Nature Methods.

[31]  Isaac Tamblyn,et al.  Convolutional neural networks for atomistic systems , 2017, Computational Materials Science.