Wavelength selection with Tabu Search

This paper introduces Tabu Search in analytical chemistry by applying it to wavelength selection. Tabu Search is a deterministic global optimization technique loosely based on concepts from artificial intelligence. Wavelength selection is a method which can be used for improving the quality of calibration models. Tabu Search uses basic, problem‐specific operators to explore a search space, and memory to keep track of parts already visited. Several implementational aspects of wavelength selection with Tabu Search will be discussed. Two ways of memorizing the search space are investigated: storing the actual solutions and storing the steps necessary to create them. Parameters associated with Tabu Search are configured with a Plackett–Burman design. In addition, two extension schemes for Tabu Search, intensification and diversification, have been implemented and are applied with good results. Eventually, two implementations of wavelength selection with Tabu Search are tested, one which searches for a solution with a constant number of wavelengths and one with a variable number of wavelengths. Both implementations are compared with results obtained by wavelength selection methods based on simulated annealing (SA) and genetic algorithms (GAs). It is demonstrated with three real‐world data sets that Tabu Search performs equally well as and can be a valuable alternative to SA and GAs. The improvements in predictive abilities increased by a factor of 20 for data set 1 and by a factor of 2 for data sets 2 and 3. In addition, when the number of wavelengths in a solution is variable, measurements on the coverage of the search space show that the coverage is usually higher for Tabu Search compared with SA and GAs. Copyright © 2003 John Wiley & Sons, Ltd.

[1]  D. Werra,et al.  Tabu search: a tutorial and an application to neural networks , 1989 .

[2]  S. D. Jong SIMPLS: an alternative approach to partial least squares regression , 1993 .

[3]  Fred Glover,et al.  Tabu Search - Part II , 1989, INFORMS J. Comput..

[4]  John H. Kalivas,et al.  Further investigation on a comparative study of simulated annealing and genetic algorithm for wavelength selection , 1995 .

[5]  R. Boggia,et al.  Genetic algorithms as a strategy for feature selection , 1992 .

[6]  D. Massart,et al.  Application of wavelet transform to extract the relevant component from spectral data for multivariate calibration. , 1997, Analytical chemistry.

[7]  Fred W. Glover,et al.  Tabu Search - Part I , 1989, INFORMS J. Comput..

[8]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[9]  C. B. Lucasius,et al.  Genetic algorithms in wavelength selection: a comparative study , 1994 .

[10]  John H. Kalivas,et al.  Simulated‐annealing‐based optimization algorithms: Fundamentals and wavelength selection applications , 1995 .

[11]  Fred W. Glover,et al.  A user's guide to tabu search , 1993, Ann. Oper. Res..

[12]  David E. Clark,et al.  A comparison of heuristic search algorithms for molecular docking , 1997, J. Comput. Aided Mol. Des..

[13]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[14]  William H. Press,et al.  Numerical recipes in C , 2002 .

[15]  Jiri Pospichal,et al.  Fast Evaluation of Chemical Distance by Tabu Search Algorithm , 1994, Journal of chemical information and computer sciences.

[16]  D. Massart,et al.  Elimination of uninformative variables for multivariate calibration. , 1996, Analytical chemistry.

[17]  Riccardo Leardi,et al.  Application of genetic algorithm–PLS for feature selection in spectral data sets , 2000 .

[18]  Lutgarde M. C. Buydens,et al.  Possibilities of visible–near-infrared spectroscopy for the assessment of soil contamination in river floodplains , 2001 .

[19]  Eric R. Ziegel,et al.  Handbook of Chemometrics and Qualimetrics, Part B , 2000, Technometrics.

[20]  R. Plackett,et al.  THE DESIGN OF OPTIMUM MULTIFACTORIAL EXPERIMENTS , 1946 .