Flash point prediction of organic compounds using a group contribution and support vector machine

The flash point is one of the most important properties of flammable liquids. This study proposes a support vector regression (SVR) model to predict the flash points of 792 organic compounds from the DIPPR 801 database. The input variables of the model consist of 65 different functional groups, logarithm of molecular weight and their boiling points in this study. Cross-validation and particle swarm optimization were adopted to find three optimal parameters for the SVR model. Since the prediction largely relies on the selection of training data, 100 training data sets were randomly generated and tested. Moreover, all of the organic compounds used in this model were divided into three major classes, which are non-ring, aliphatic ring, and aromatic ring, and a prediction model was built accordingly for each class. The prediction results from the three-class model were much improved than those obtained from the previous works, with the average absolute error being 5.11–7.15 K for the whole data set. The errors in calculation were comparable with the ones from experimental measurements. Therefore, the proposed model can be implemented to determine the initial flash point for any new organic compounds.

[1]  K. Satyanarayana,et al.  Note: Correlation of flash points , 1991 .

[2]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[3]  M. Mannan,et al.  A review of estimation methods for flash points and flammability limits , 2004 .

[4]  Laurent Catoire,et al.  A Unique Equation to Estimate Flash Points of Selected Pure Liquids Application to the Correction of Probably Erroneous Flash Point Values , 2004 .

[5]  Tareq A. Albahri,et al.  Structural Group Contribution Method for Predicting the Octane Number of Pure Hydrocarbon Liquids , 2003 .

[6]  Takahiro Suzuki,et al.  Quantitative Structure-Property Relationships for the Estimation of Boiling Point and Flash Point Using a Radial Basis Function Neural Network , 1999, J. Chem. Inf. Comput. Sci..

[7]  K. Satyanarayana,et al.  Improved equation to estimate flash points of organic compounds , 1992 .

[8]  Iva B. Stoyanova-Slavova,et al.  QSPR modeling of flash points: an update. , 2007, Journal of Molecular Graphics and Modelling.

[9]  Suhani J. Patel,et al.  QSPR Flash Point Prediction of Solvents Using Topological Indices for Application in Computer Aided Molecular Design , 2009 .

[10]  F. Gharagheizi,et al.  Prediction of Flash Point Temperature of Pure Components Using a Quantitative Structure–Property Relationship Model , 2008 .

[11]  Ritu Jain,et al.  QSPR Analysis of Flash Points , 2001, J. Chem. Inf. Comput. Sci..

[12]  V. K. Jayaraman,et al.  Development of support vector regression (SVR)-based correlation for prediction of overall gas hold-up in bubble column reactors for various gas–liquid systems , 2007 .

[13]  K. Ohtaguchi,et al.  A method for estimating flash points of organic compounds from molecular structures. , 1991 .

[14]  E. Biscaia,et al.  Nonlinear parameter estimation through particle swarm optimization , 2008 .

[15]  E. Metcalfe,et al.  Communication: On the correlation of flash points , 1992 .

[16]  Xu Wen,et al.  A New Group Contribution Method for Estimating Critical Properties of Organic Compounds , 2001 .

[17]  G. S. Patil Estimation of flash point , 1988 .

[18]  Yong Pan,et al.  Quantitative Structure–Property Relationship Studies for Predicting Flash Points of Organic Compounds using Support Vector Machines , 2008 .

[19]  Fu‐Yu Hshieh Correlation of closed‐cup flash points with normal boiling points for silicone and general organic compounds , 1997 .

[20]  Gibaek Lee,et al.  A new estimation algorithm of physical properties based on a group contribution and support vector machine , 2008 .

[21]  R. Gani,et al.  New group contribution method for estimating properties of pure compounds , 1994 .

[22]  Jaromír Kukal,et al.  Estimation of the Heat Capacity of Organic Liquids as a Function of Temperature by a Three-Level Group Contribution Method , 2008 .