Publisher Summary One of the main problems when elaborating large data sets is the detection of the relevant variables (i.e., the variables holding information) and the elimination of the noise. The goal of feature selection is the elimination of noise, together with the simplification of the mathematical model and to reduce, as much as possible, the number of variables involved. Genetic algorithms (GAs) can be very easily applied to feature selection. This chapter describes that very good results are obtained with a tailor-made GA configuration, in which the classical GA is slightly modified taking into account several peculiarities of feature selection problem. Hybrid algorithms are conceptually very simple: after a certain number of generations of genetic algorithms, the best experimental condition so far found undergoes a classical method of optimization (in the case of feature selection, stepwise selection); the results thus obtained can enter the population and then a new genetic algorithm is started with the updated population. This approach allows further improvement of the performance of the genetic algorithm. The application of genetic algorithms to two quantitative structure–activity relationship data sets is presented in the chapter, and the results are compared with those described in literature.
[1]
Silvia Lanteri,et al.
Full validation procedures for feature selection in classification and regression problems
,
1992
.
[2]
R. Boggia,et al.
Genetic algorithms as a strategy for feature selection
,
1992
.
[3]
C. B. Lucasius,et al.
Genetic algorithms in wavelength selection: a comparative study
,
1994
.
[4]
C. B. Lucasius,et al.
Genetic algorithms for large-scale optimization in chemometrics: An application
,
1991
.
[5]
D. B. Hibbert,et al.
A hybrid genetic algorithm for the estimation of kinetic parameters
,
1993
.
[6]
Johann Gasteiger,et al.
The Anesthetic Activity and Toxicity of Halogenated Ethyl Methyl Ethers, a Multivariate QSAR Modelled by PLS
,
1985
.
[7]
D. Livingstone,et al.
Structure-activity relationships of antifilarial antimycin analogues: a multivariate pattern recognition study.
,
1990,
Journal of medicinal chemistry.
[8]
R. Leardi.
Application of a genetic algorithm to feature selection under full validation conditions and to outlier detection
,
1994
.