Prediction of Standard Enthalpies of Formation Based on Hydrocarbon Molecular Descriptors and Active Subspace Methodology

Standard enthalpy of formation is an important fuel thermochemical property for energy balance calculation and equilibrium dynamics determination in combustion reactions. In this study, an approach combining the quantitative structure–property relation and the active subspace method was applied to predict the standard enthalpies of formation of different hydrocarbon classes, including alkanes, cycloalkanes, alkenes, alkynes, and aromatics. The standard enthalpies of formation in a database of 1020 hydrocarbons were predicted by a one-dimensional active subspace model, with 902 molecule topological indices being used as input descriptors. The correlation coefficient between the measured and predicted values was 0.99, and the average absolute error was 6.74 kJ/mol. The sensitivity of the prediction output to each descriptor was further evaluated, which enables us to build simplified models by only considering the most representative descriptors. Also, to assess the influences of model simplification on the prediction performance, the relation between the descriptor number and the prediction accuracy was explored. It was observed that simplified but well-predictive models can be established when the top 10 influential descriptors were selected for model building, and the correlation coefficient between the predicted and measured standard enthalpies of formation was as high as 0.96.

[1]  J. B. Pedley,et al.  Thermochemical data of organic compounds , 1986 .

[2]  Tatsuya Takagi,et al.  Mordred: a molecular descriptor calculator , 2018, Journal of Cheminformatics.

[3]  R. Gani,et al.  New group contribution method for estimating properties of pure compounds , 1994 .

[4]  Dong-Sheng Cao,et al.  PyDPI: Freely Available Python Package for Chemoinformatics, Bioinformatics, and Chemogenomics Studies , 2013, J. Chem. Inf. Model..

[5]  Eduardo A. Castro,et al.  QSPR modeling of the enthalpy of formation based on Partial Order Ranking , 2005 .

[6]  N. L. Allinger,et al.  Enthalpies of hydrogenation of phenylalkynes: indirect determination of the enthalpy of formation of diphenylcyclopropenone , 1985 .

[7]  Paul G. Constantine,et al.  Active subspaces for sensitivity analysis and dimension reduction of an integrated hydrologic model , 2015, Comput. Geosci..

[8]  F. Gharagheizi,et al.  Standard molar chemical exergy: A new accurate model , 2018, Energy.

[9]  S. Verevkin,et al.  Renewable platform chemicals: Evaluation of thermochemical data of alkyl lactates with complementary experimental and computational methods , 2019, The Journal of Chemical Thermodynamics.

[10]  Anton J. Hopfinger,et al.  Application of Genetic Function Approximation to Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[11]  Svetoslav H. Slavov,et al.  Quantitative correlation of physical and chemical properties with chemical structure: utility for prediction. , 2010, Chemical reviews.

[12]  P. Seybold,et al.  Molecular modeling of the physical properties of the alkanes , 1988 .

[13]  Dong Han,et al.  Cetane number prediction for hydrocarbons from molecular structural descriptors based on active subspace methodology , 2019, Fuel.

[14]  U. Schubert,et al.  Predicting thermochemical parameters of oxygen-containing heterocycles using simple QSPR models , 2006 .

[15]  F. Gharagheizi,et al.  Prediction of Standard Enthalpy of Formation by a QSPR Model , 2007, International Journal of Molecular Sciences.

[16]  Yaxiong Zhang,et al.  An improved QSPR study of standard formation enthalpies of acyclic alkanes based on artificial neural networks and genetic algorithm , 2009 .

[17]  Zhuyin Ren,et al.  Shared low-dimensional subspaces for propagating kinetic uncertainty to multiple outputs , 2018 .

[18]  R. Notario,et al.  Enthalpies of formation of four isoxazole derivatives in the solid and gas phases: application to the study of chemical equilibria , 2017, Structural Chemistry.

[19]  Paul G. Constantine,et al.  Active Subspaces - Emerging Ideas for Dimension Reduction in Parameter Studies , 2015, SIAM spotlights.

[20]  William J. Rogers,et al.  Predicting the Impact Sensitivities of Polynitro Compounds Using Quantum Chemical Descriptors , 2006 .

[21]  M. Jia,et al.  Construction of a skeletal oxidation mechanism of n-pentanol by integrating decoupling methodology, genetic algorithm, and uncertainty quantification , 2018, Combustion and Flame.

[22]  K. Joback,et al.  ESTIMATION OF PURE-COMPONENT PROPERTIES FROM GROUP-CONTRIBUTIONS , 1987 .

[23]  G. Pilcher,et al.  Measurements of heats of combustion by flame calorimetry. Part 8.—Methane, ethane, propane, n-butane and 2-methylpropane , 1972 .

[24]  Daniel Berrar,et al.  Introduction to the Non-Parametric Bootstrap , 2019, Encyclopedia of Bioinformatics and Computational Biology.

[25]  Zhen Huang,et al.  Influences of isomeric butanol addition on anti-knock tendency of primary reference fuel and toluene primary reference fuel gasoline surrogates , 2021, International Journal of Engine Research.

[26]  Qingsong Xu,et al.  Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions , 2015, Bioinform..

[27]  B. J. Neely,et al.  Molecular modeling of the standard state heat of formation , 2013 .

[28]  G. Stridh Enthalpies of formation of 1-dodecene and 1-hexadecene and the CH2-increment in the 1-alkene series , 1976 .