Combination of genetic algorithm and partial least squares for cloud point prediction of nonionic surfactants from molecular structures.

Quantitative structure-property relationship (QSPR) analysis has been directed to a series of pure nonionic surfactants containing linear alkyl, cyclic alkyl, and alkey phenyl ethoxylates. Modeling of cloud point of these compounds as a function of the theoretically derived descriptors was established by multiple linear regression (MLR) and partial least squares (PLS) regression. In this study, a genetic algorithm (GA) was applied as a variable selection method in QSPR analysis. The results indicate that the GA is a very effective variable selection approach for QSPR analysis. The comparison of the two regression methods used showed that PLS has better prediction ability than MLR.

[1]  Riccardo Leardi,et al.  Application of genetic algorithm–PLS for feature selection in spectral data sets , 2000 .

[2]  Hxugo Kubiny Variable Selection in QSAR Studies. I. An Evolutionary Algorithm , 1994 .

[3]  Kimito Funatsu,et al.  GA Strategy for Variable Selection in QSAR Studies: GA-Based PLS Analysis of Calcium Channel Antagonists , 1997, J. Chem. Inf. Comput. Sci..

[4]  P. Broto,et al.  Molecular structures: perception, autocorrelation descriptor and sar studies. Autocorrelation descriptor , 1984 .

[5]  Jorge Gálvez,et al.  Charge Indexes. New Topological Descriptors , 1994, J. Chem. Inf. Comput. Sci..

[6]  Shah,et al.  Predicting Surfactant Cloud Point from Molecular Structure , 1997, Journal of colloid and interface science.

[7]  H. Tani,et al.  Micelle-mediated extraction , 1997 .

[8]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[9]  E. Pramauro,et al.  A Critical Review of Surfactant-Mediated Phase Separations (Cloud-Point Extractions): Theory and Applications , 1993 .

[10]  K. Roy,et al.  QSAR by LFER model of cytotoxicity data of anti-HIV 5-phenyl-1-phenylamino-1H-imidazole derivatives using principal component factor analysis and genetic function approximation. , 2005, Bioorganic & medicinal chemistry.

[11]  R. Leardi Application of a genetic algorithm to feature selection under full validation conditions and to outlier detection , 1994 .

[12]  R. H. Myers Classical and modern regression with applications , 1986 .

[13]  J. Sjöblom,et al.  Surfactant structure and its relation to the Krafft point, cloud point and micellization : some empirical relationships , 1992 .

[14]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[15]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[16]  K. Unger,et al.  Application of 0.5-μm porous silanized silica beads in electrochromatography , 1997 .

[17]  M. J. Rosen Surfactants and Interfacial Phenomena , 1978 .

[18]  I. W Nowell,et al.  Molecular Connectivity in Structure-Activity Analysis , 1986 .

[19]  R. Leardi,et al.  Genetic algorithms applied to feature selection in PLS regression: how and when to use them , 1998 .

[20]  F. Quina,et al.  Surfactant-Mediated Cloud Point Extractions: An Environmentally Benign Alternative Separation Approach , 1999 .

[21]  Bhupinder S. Dayal,et al.  Improved PLS algorithms , 1997 .

[22]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[23]  Elena V. Konstantinova,et al.  The Discrimination Ability of Some Topological and Information Distance Indices for Graphs of Unbranched Hexagonal Systems , 1996, J. Chem. Inf. Comput. Sci..

[24]  Gerta Rücker,et al.  Counts of all walks as atomic and molecular descriptors , 1993, J. Chem. Inf. Comput. Sci..

[25]  R. Boggia,et al.  Genetic algorithms as a strategy for feature selection , 1992 .

[26]  A. Berthod,et al.  Polyoxyethylene alkyl ether nonionic surfactants: physicochemical properties and use for cholesterol determination in food. , 2001, Talanta.

[27]  H Ichikawa,et al.  Neural networks applied to quantitative structure-activity relationship analysis. , 1990, Journal of medicinal chemistry.

[28]  Hugo Kubinyi,et al.  Evolutionary variable selection in regression and PLS analyses , 1996 .

[29]  Martin J. Schick,et al.  Nonionic Surfactants: Physical Chemistry , 1987 .

[30]  Anton J. Hopfinger,et al.  Application of Genetic Function Approximation to Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[31]  H. Kubinyi Variable Selection in QSAR Studies. II. A Highly Efficient Combination of Systematic Search and Evolution , 1994 .

[32]  L. Rupert,et al.  Physico-Chemical Properties of Selected Anionic, Cationic and Nonionic Surfactants , 1993 .

[33]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .