Prediction of boiling points of organic heterocyclic compounds using regression and neural network techniques

High quality models which relate structural descriptors to normal boiling points have been developed for large, diverse groups of heterocyclic compounds using both linear regression and neural network techniques. Parallel experiments were designed to compare the performance of these complementary modeling techniques on two different data sets. A formerly studied data set comprised of 299 tetrahydrofuran (THF), thiophene, furan, and pyran compounds was reexamined using neural networks. In addition, a new data set of 572 pyridine compounds was investigated to increase our understanding of the nitrogen-containing heterocycles. First, several new descriptors were developed to explore chemical principles which govern the boiling point process. In particular, descriptors that reflect hydrogen bonding and dipole-dipole interactions proved especially useful for improving the predictive models in the pyridine regression work. With each data set, neural networks were trained to predict boiling points with close to experimental accuracy using the back-propagation learning algorithm. Results from these boiling point investigations show that once the key structural features are indentified through traditional regression techniques, neural networks generally provide access to superior predictive equations. On the basis of this information, further studies were initiated to explore using neural networks as a tool to upgrade structural feature selection. Results from this phase of the study demonstrate that this methodology can be used to identify the most informationally rich descriptors.

[1]  Peter C. Jurs,et al.  Prediction of carbon-13 nuclear magnetic resonance chemical shifts , 1978 .

[2]  青山 智夫,et al.  Basic Operating Characteristics of Neural Networks When Applied to Structure-Activity Studies , 1991 .

[3]  Hiroyuki Uesaka,et al.  Neural Network System for the Identification of Infrared Spectra , 1992 .

[4]  H Ichikawa,et al.  Neural networks applied to structure-activity relationships. , 1990, Journal of medicinal chemistry.

[5]  I. B. Sladkov ESTIMATION OF MOLAR VOLUME OF INORGANIC LIQUIDS AT BOILING AND CRITICAL POINTS , 1991 .

[6]  Morton E. Munk,et al.  A neural network approach to infrared spectrum interpretation , 1990 .

[7]  Kenneth J. Miller,et al.  Additions and Corrections - A New Empirical Method to Calculate Average Molecular Polarizabilities , 1979 .

[8]  K. Héberger,et al.  Discrimination between linear and non-linear models describing retention data of alkylbenzenes in gas-chromatography , 1990 .

[9]  David E. Rumelhart,et al.  MSnet: A Neural Network which Classifies Mass Spectra , 1990 .

[10]  Keith L. Peterson,et al.  Counter-propagation neural networks in the modeling and prediction of Kovats indexes for substituted phenols , 1992 .

[11]  M. Munk,et al.  Neural network models for infrared spectrum interpretation , 1991 .

[12]  Peter C. Jurs,et al.  Estimation of autoignition temperatures of hydrocarbons, alcohols, and esters from molecular structure , 1992 .

[13]  青山 智夫,et al.  Obtaining the Correlation Indices between Drug Activity and Structural Parameters Using a Neural Network , 1991 .

[14]  Howard T. Mayfield,et al.  Pattern recognition of jet fuel chromatographic data by artificial neural networks with back-propagation of error , 1991 .

[15]  Peter C. Jurs,et al.  Atomic charge calculations for quantitative structure—property relationships , 1992 .

[16]  P. Jurs,et al.  Development and use of charged partial surface area structural descriptors in computer-assisted quantitative structure-property relationship studies , 1990 .

[17]  David T. Stanton,et al.  Computer‐Assisted Prediction of Normal Boiling Points of Pyrans and Pyrroles. , 1992 .

[18]  David T. Stanton,et al.  Computer‐Assisted Prediction of Normal Boiling Points of Furans, Tetrahydrofurans, and Thiophenes. , 1991 .

[19]  J. Zupan,et al.  Neural networks: A new method for solving chemical problems or just a passing phase? , 1991 .

[20]  Milan Randic,et al.  On molecular identification numbers , 1984, J. Chem. Inf. Comput. Sci..