Capturing the Crystal: Prediction of Enthalpy of Sublimation, Crystal Lattice Energy, and Melting Points of Organic Compounds

Accurate computational prediction of melting points and aqueous solubilities of organic compounds would be very useful but is notoriously difficult. Predicting the lattice energies of compounds is key to understanding and predicting their melting behavior and ultimately their solubility behavior. We report robust, predictive, quantitative structure-property relationship (QSPR) models for enthalpies of sublimation, crystal lattice energies, and melting points for a very large and structurally diverse set of small organic compounds. Sparse Bayesian feature selection and machine learning methods were employed to select the most relevant molecular descriptors for the model and to generate parsimonious quantitative models. The final enthalpy of sublimation model is a four-parameter multilinear equation that has an r(2) value of 0.96 and an average absolute error of 7.9 ± 0.3 kJ.mol(-1). The melting point model can predict this property with a standard error of 45° ± 1 K and r(2) value of 0.79. Given the size and diversity of the training data, these conceptually transparent and accurate models can be used to predict sublimation enthalpy, lattice energy, and melting points of organic compounds in general.

[1]  J. A. Lazzús Neural Network Based on Quantum Chemistry for Predicting Melting Point of Organic Compounds , 2009 .

[2]  R. Gopal Relation between lattice energy and melting points of some crystalline substances. II. Alkali Metals , 1955 .

[3]  A. Bauer-Brandl,et al.  Towards an understanding of the molecular mechanism of solvation of drug molecules: a thermodynamic approach by crystal lattice energy, sublimation, and solubility exemplified by paracetamol, acetanilide, and phenacetin. , 2006, Journal of pharmaceutical sciences.

[4]  A. Bauer-Brandl,et al.  Thermodynamics of solubility, sublimation and solvation processes of parabens. , 2005, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[5]  G Beck,et al.  Evaluation of human intestinal absorption data and subsequent derivation of a quantitative structure-activity relationship (QSAR) with the Abraham descriptors. , 2001, Journal of pharmaceutical sciences.

[6]  A. Guendouzi,et al.  Prediction of the melting points of fatty acids from computed molecular descriptors: a quantitative structure-property relationship study. , 2012, Chemistry and physics of lipids.

[7]  Frank R Burden,et al.  Broad-based quantitative structure-activity relationship modeling of potency and selectivity of farnesyltransferase inhibitors using a Bayesian regularized neural network. , 2004, Journal of medicinal chemistry.

[8]  D. Mathieu Simple Alternative to Neural Networks for Predicting Sublimation Enthalpies from Fragment Contributions , 2012 .

[9]  Ingo Krossing,et al.  Is universal, simple melting point prediction possible? , 2011, Chemphyschem : a European journal of chemical physics and physical chemistry.

[10]  Andreas Bender,et al.  Melting Point Prediction Employing k-Nearest Neighbor Algorithms and Genetic Parameter Optimization , 2006, J. Chem. Inf. Model..

[11]  Frank R. Burden,et al.  Optimal Sparse Descriptor Selection for QSAR Using Bayesian Methods , 2009 .

[12]  S Dimitrov,et al.  Charged partial surface area (CPSA) descriptors QSAR applications , 2002, SAR and QSAR in environmental research.

[13]  Michael G. Hutchings,et al.  Quantitative structure–sublimation enthalpy relationship studied by neural networks, theoretical crystal packing calculations and multilinear regression analysis , 1995 .

[14]  Oleg A. Raevsky,et al.  Sublimation of Molecular Crystals: Prediction of Sublimation Functions on the Basis of HYBOT Physicochemical Descriptors and Structural Clusterization , 2010 .

[15]  D. Mathieu,et al.  MODEL EQUATIONS FOR ESTIMATING SUBLIMATION ENTHALPIES OF ORGANIC COMPOUNDS , 1999 .

[16]  Muthukumarasamy Karthikeyan,et al.  General Melting Point Prediction Based on a Diverse Compound Data Set and Artificial Neural Networks , 2005, J. Chem. Inf. Model..

[17]  A. Holder,et al.  A quantum mechanical quantitative structure-property relationship study of the melting point of a variety of organosilicons. , 2011, Journal of molecular graphics & modelling.

[18]  P. Jurs,et al.  Development and use of charged partial surface area structural descriptors in computer-assisted quantitative structure-property relationship studies , 1990 .

[19]  Takashi Nakamura,et al.  Measurement of temperature dependence for the vapor pressures of twenty‐six polychlorinated biphenyl congeners in commercial Kanechlor mixtures by the Knudsen effusion method , 2006, Environmental toxicology and chemistry.

[20]  F. Burden,et al.  Robust QSAR models using Bayesian regularized neural networks. , 1999, Journal of medicinal chemistry.

[21]  A comparison of sublimation enthalpies with lattice energies calculated using force fields , 1999 .

[22]  William J. Welsh,et al.  Enthalpies of sublimation and formation of polycyclic aromatic hydrocarbons (PAHs) derived from comparative molecular field analysis (CoMFA): Application of moment of inertia for molecular alignment , 1997 .

[23]  Igor V. Tetko,et al.  CADASTER QSPR Models for Predictions of Melting and Boiling Points of Perfluorinated Chemicals , 2011, Molecular informatics.

[24]  William Jones,et al.  An Assessment of Lattice Energy Minimization for the Prediction of Molecular Organic Crystal Structures , 2004 .

[25]  A. Habibi-Yangjeh,et al.  Prediction of Melting Point for Drug-like Compounds Using Principal Component-Genetic Algorithm-Artificial Neural Network , 2008 .

[26]  David A Winkler,et al.  Modelling Inhalational Anaesthetics Using Bayesian Feature Selection and QSAR Modelling Methods , 2010, ChemMedChem.

[27]  Johann Gasteiger,et al.  Robust modelling of solubility in supercritical carbon dioxide using Bayesian methods. , 2010, Journal of molecular graphics & modelling.

[28]  John B. O. Mitchell,et al.  Can we predict lattice energy from molecular structure? , 2003, Acta Crystallographica Section B Structural Science.

[29]  R. Ruoff Prediction of enthalpies of sublimation of fullerenes from first-order molecular connectivity theory , 1993 .

[31]  M Pastor,et al.  VolSurf: a new tool for the pharmacokinetic optimization of lead compounds. , 2000, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[32]  S. Price,et al.  Computed crystal energy landscapes for understanding and predicting organic crystal structures and polymorphism. , 2009, Accounts of chemical research.

[33]  J. Dearden,et al.  The QSAR prediction of melting point, a property of environmental relevance. , 1991, The Science of the total environment.

[34]  John C. Dearden,et al.  QSPR Correlation of Melting Point for Drug Compounds Based on Different Sources of Molecular Descriptors , 2006, J. Chem. Inf. Model..

[35]  Alfonso R. Gennaro,et al.  Remington:the science and practice of pharmacy , 1995 .

[36]  Frank R. Burden,et al.  An Optimal Self‐Pruning Neural Network and Nonlinear Descriptor Selection in QSAR , 2009 .

[37]  Florian Nigsch,et al.  Why Are Some Properties More Difficult To Predict than Others? A Study of QSPR Models of Solubility, Melting Point, and Log P , 2008, J. Chem. Inf. Model..

[38]  Shobha Bhattachar,et al.  The road map to oral bioavailability: an industrial perspective , 2006, Expert opinion on drug metabolism & toxicology.

[39]  Samuel H. Yalkowsky,et al.  Solubility and Solubilization in Aqueous Media , 1999 .

[40]  Antonio Tabernero,et al.  Estimation of sublimation enthalpies of solids constituted by aromatic and/or polycyclic aliphatic rings by using a group contribution method , 2012 .

[41]  F. Gharagheizi,et al.  Prediction of Crystal Lattice Energy Using Enthalpy of Sublimation: A Group Contribution-Based Model , 2011 .

[42]  Dave Winkler,et al.  Bayesian Regularization of Neural Networks , 2009, Artificial Neural Networks.

[43]  John C Dearden,et al.  Quantitative structure‐property relationships for prediction of boiling point, vapor pressure, and melting point , 2003, Environmental toxicology and chemistry.

[44]  William J. Welsh,et al.  Three-Dimensional Quantitative Structure-Property Relationship (3D-QSPR) Models for Prediction of Thermodynamic Properties of Polychlorinated Biphenyls (PCBs): Enthalpy of Sublimation , 2002, J. Chem. Inf. Comput. Sci..

[45]  Barbara Charton,et al.  The nature of topological parameters. II. The composition of topological parameters , 2003, J. Comput. Aided Mol. Des..

[46]  Tonglei Li,et al.  Predicting Lattice Energy of Organic Crystals by Density Functional Theory with Empirically Corrected Dispersion Energy. , 2006, Journal of chemical theory and computation.

[47]  James S. Chickos,et al.  A protocol for correcting experimental fusion enthalpies to 298.15 K and it's application in indirect measurements of sublimation enthalpy at 298.15 K , 1998 .

[48]  F. Gharagheizi A new molecular-based model for prediction of enthalpy of sublimation of pure components , 2008 .