Feature selection by a genetic algorithm. Application to seed discrimination by artificial vision

Abstract: Genetic algorithms (GAs) are efficient search methods based on theparadigm of natural selection and population genetics. A simple GA was appliedfor selecting the optimal feature subset among an initial feature set of larger size.The performances were tested on a practical pattern recognition problem, whichconsisted on the discrimination between four seed species (two cultivated andtwo adventitious seed species) by artiÐcial vision. A set of 73 features, describingsize, shape and texture, were extracted from colour images in order to character-ise each seed. The goal of the GA was to select the best subset of features whichgave the highest classiÐcation rates when using the nearest neighbour as a classi-Ðcation method. The selected features were represented by binary chromosomeswhich had 73 elements. The number of selected features was directly related tothe probability of initialisation of the population at the Ðrst generation of theGA. When this probability was Ðxed to 0E1, the GA selected about Ðve features.The classiÐcation performances increased with the number of generations. Forexample, 6E25% of the seeds were misclassiÐed by using Ðve features at gener-ation 140, whereas another subset of the same size led to 3% misclassiÐcation atgeneration 400. The present work shows the great potential of GAs for featureselection (dimensionality reduction) problems. 1998 SCI.(J Sci Food Agric 76,77E86 (1998)Key words: feature selection; genetic algorithm; seed; colour image analysis;classiÐcation; discrimination

[1]  Steffen Schulze-Kremer,et al.  Genetic Algorithms for Protein Tertiary Structure Prediction , 1993, ECML.

[2]  William H. Press,et al.  Numerical recipes in C (2nd ed.): the art of scientific computing , 1992 .

[3]  Samir W. Mahfoud Crowding and Preselection Revisited , 1992, PPSN.

[4]  Yuval Davidor,et al.  Genetic algorithms and robotics , 1991 .

[5]  Hugues Bersini,et al.  The Immune Recruitment Mechanism: A Selective Evolutionary Strategy , 1991, ICGA.

[6]  David E. Goldberg,et al.  Sizing Populations for Serial and Parallel Genetic Algorithms , 1989, ICGA.

[7]  L. Darrell Whitley,et al.  The GENITOR Algorithm and Selection Pressure: Why Rank-Based Allocation of Reproductive Trials is Best , 1989, ICGA.

[8]  Lawrence Davis,et al.  Adapting Operator Probabilities in Genetic Algorithms , 1989, ICGA.

[9]  D G Myers,et al.  The application of image processing techniques to the identification of Australian wheat varieties. , 1989 .

[10]  Kanti V. Mardia,et al.  A Spatial Thresholding Method for Image Segmentation , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Jack Sklansky,et al.  On Automatic Feature Selection , 1988, Int. J. Pattern Recognit. Artif. Intell..

[12]  E. Shwedyk,et al.  An instrumental system for cereal grain classification using digital image analysis , 1987 .

[13]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[14]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[15]  Mary M. Galloway,et al.  Texture analysis using gray level run lengths , 1974 .