Feature Selection and Sensor Array Optimization in Machine Olfaction

In the last few years, growing attention has been given to strategies for feature and sensor selection in multi-sensor systems for machine olfaction. The two main approaches consist of selecting the features extracted from the sensor response to be used to build a multivariate model, or selecting an optimal subset of factors; for example, principal components or latent variables. Selecting from the full set of features is challenging because there is considerable overlapping among them. Furthermore, features are affected by noise. However, methods based on selected features are interesting because the variables chosen carry direct and relevant chemical information; i.e., response time is connected to chemical kinetics. Therefore, these methods are expected to be robust toward the experimental conditions of each specific application. Unlike feature selection, factor selection uses the full set of variables, including noisy variables, to compute the factors before selecting from among them. The selection of an optimal subset of factors is not necessarily straightforward because the magnitude of an eigenvalue is not always a measure of its significance for the calibration. 2 Feature Selection and Sensor Array Optimization in Machine Olfaction Copyright © 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. INTRODUCTION Machine olfaction applications would greatly benefit from gas sensors with high sensitivity and specificity or selectivity to target analytes, low cross-sensitivity to interfering species, fast response and full reversibility of the detection mechanism Several variable selection methods have been reported as useful (Blum & Langely 1997, Guyon et al. 2006, Naes & Martens 1998, Sun 1995). These include deterministic methods such as forward or backward selection methods, correlated principal component regression analysis of weights resulting from multiple linear regression, branch and bound regression, and stochastic methods such as generalized simulated annealing or genetic algorithms. In most machine olfaction applications, it is usually out of the question to make an exhaustive search because it is a very timeconsuming process, given the large number of variables to be considered for selection. Deterministic methods are, most of the times, greedy methods in which, once a choice has been made, e.g. the selection or elimination of a variable, this decision is never reconsidered. Such techniques can make a good selection with relatively few operations but can get easily trapped in a local optimum of the search space. Unlike deterministic methods, stochastic methods such as simulated annealing or genetic algorithms are more likely to find a global optimum in reasonable computational time. In the case of stochastic methods, the next point to be explored in a solution space is chosen by stochastic rather than deterministic rules, and no assumptions about the characteristics of the problem to be solved are needed. Therefore, they are normally more generally applicable. Although stochastic methods are useful for selecting features, it has been shown (Jouan-Rimbaud, Massart, & Noord 1996, Llobet et al. 2004) that the solution found should be investigated carefully because these algorithms do not prevent meaningless features, such as random non-relevant variables, from being selected. In this context, the main objective of this chapter is to provide the reader with a thorough review of feature or sensor selection for machine olfaction. The organization of the chapter is as follows. First the ‘curse of dimensionality’ and the need for variable selection in gas sensor and direct mass spectrometry based artificial olfaction is discussed. A critical review of the different techniques employed for reducing dimensionality follows. Then, examples taken from the literature showing how these techniques have actually been employed in machine olfaction applications are reviewed and discussed. This is followed by a section devoted to sensor selection and array optimization. The chapter ends with some conclusions drawn from the results presented and a visionary look toward the future in terms of how the field may evolve. 59 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/chapter/feature-selection-sensor-arrayoptimization/52447?camid=4v1 This title is available in InfoSci-Intelligent Technologies, InfoSci-Books, Science, Engineering, and Information Technology, InfoSci-Computer Science and Information Technology, InfoSci-Select, InfoSci-Select, InfoSci-Select. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=16

[1]  T. Næs,et al.  Principal component regression in NIR analysis: Viewpoints, background details and selection of components , 1988 .

[2]  J. Brezmes,et al.  Optimised temperature modulation of metal oxide micro-hotplate gas sensors through multilevel pseudo random sequences , 2005 .

[3]  J. Kauer,et al.  A chemical-detecting system based on a cross-reactive optical sensor array , 1996, Nature.

[4]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[5]  Keith R. Godfrey,et al.  Perturbation signals for system identification , 1993 .

[6]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[7]  T. Eklöv,et al.  Selection of variables for interpreting multivariate gas sensor data , 1999 .

[8]  Konrad Colbow,et al.  Algorithms to improve the selectivity of thermally-cycled tin oxide gas sensors , 1989 .

[9]  P. Corcoran,et al.  Optimal configuration of a thermally cycled gas sensor array with neural network pattern recognition , 1998 .

[10]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[11]  J. Brezmes,et al.  A fuzzy ARTMAP- and PLS-based MS e-nose for the qualitative and quantitative assessment of rancidity in crisps , 2005 .

[12]  A. Hierlemann,et al.  Higher-order Chemical Sensing , 2007 .

[13]  Satoshi Nakata,et al.  Detection and Quantification of CO Gas Based on the Dynamic Response of a Ceramic Sensor , 1991 .

[14]  Joachim Denzler,et al.  Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Zengliang Yu,et al.  Rectangular mode of operation for detecting pesticide residue by using a single SnO2-based gas sensor , 2003 .

[16]  Dario Floreano,et al.  Coevolution of active vision and feature selection , 2004, Biological Cybernetics.

[17]  E. Llobet,et al.  Discrimination between different samples of olive oil using variable selection techniques and modified fuzzy artmap neural networks , 2005, IEEE Sensors Journal.

[18]  J. Brezmes,et al.  Building parsimonious fuzzy ARTMAP models by variable selection with a cascaded genetic algorithm: application to multisensor systems for gas analysis , 2004 .

[19]  Konrad Colbow,et al.  General characteristics of thermally cycled tin oxide gas sensors , 1989 .

[20]  R. Bro PARAFAC. Tutorial and applications , 1997 .

[21]  Avraham Lorber,et al.  Estimation of prediction error for multivariate calibration , 1988 .

[22]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Eduard Llobet,et al.  Selectivity Enhancement in Multisensor Systems Using Flow Modulation Techniques , 2008, Sensors.

[24]  Ricardo Gutierrez-Osuna,et al.  Active Chemical Sensing With Partially Observable Markov Decision Processes , 2009 .

[25]  P. K. Clifford,et al.  Characteristics of semiconductor gas sensors II. transient response to temperature change , 1982 .

[26]  Tom Artursson,et al.  Wavelet transform of electronic tongue data , 2002 .

[27]  J. Brezmes,et al.  Coupling fast variable selection methods to neural network-based classifiers: Application to multisensor systems , 2006 .

[28]  K. Yoshikawa,et al.  Gas Sensing Based on a Nonlinear Response:  Discrimination between Hydrocarbons and Quantification of Individual Components in a Gas Mixture. , 1996, Analytical chemistry.

[29]  Duk-Dong Lee,et al.  Classification of workplace gases using temperature modulation of two SnO2 sensing films on substrate , 2002 .

[30]  J. Brezmes,et al.  Quantitative gas mixture analysis using temperature-modulated micro-hotplate gas sensors: Selection and validation of the optimal modulating frequencies , 2007 .

[31]  J. Brezmes,et al.  Optimized temperature modulation of micro-hotplate gas sensors through pseudorandom binary sequences , 2005, IEEE Sensors Journal.

[32]  Kurt Hornik,et al.  The support vector machine under test , 2003, Neurocomputing.

[33]  Takamichi Nakamoto,et al.  Improvement of optimization algorithm in active gas/odor sensing system , 1995 .

[34]  Eduard Llobet,et al.  Reducing power consumption via a discontinuous operation of temperature-modulated micro-hotplate gas sensors: Application to the logistics chain of fruit☆ , 2008 .

[35]  R. Gosangi,et al.  Active Temperature Programming for Metal-Oxide Chemoresistors , 2010, IEEE Sensors Journal.

[36]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[37]  P. Moseley,et al.  Solid state gas sensors , 1997 .

[38]  Davide Ballabio,et al.  Prediction of Italian red wine sensorial descriptors from electronic nose, electronic tongue and spectrophotometric measurements by means of Genetic Algorithm regression models , 2007 .

[39]  Ricard Boqué,et al.  Determination of ageing time of spirits in oak barrels using a headspace–mass spectrometry (HS-MS) electronic nose system and multivariate calibration , 2005, Analytical and bioanalytical chemistry.

[40]  N. Barsan,et al.  Fundamental and practical aspects in the design of nanoscaled SnO2 gas sensors: a status report , 1999 .

[41]  S. Wlodek,et al.  Kinetic model of thermally cycled tin oxide gas sensor , 1991 .

[42]  Fredrik Winquist,et al.  Extraction and selection of parameters for evaluation of breath alcohol measurement with an electronic nose , 2000 .

[43]  J. Kauer,et al.  Rapid analyte recognition in a device based on optical sensors and the olfactory system. , 1996, Analytical chemistry.

[44]  Ricardo Gutierrez-Osuna,et al.  A dimensionality-reduction technique inspired by receptor convergence in the olfactory system , 2006 .

[45]  John S. Suehle,et al.  Optimized temperature-pulse sequences for the enhancement of chemically specific response patterns from micro-hotplate gas sensors , 1996 .

[46]  G. Sberveglieri,et al.  Comparing the performance of different features in sensor arrays , 2007 .

[47]  Daniel Cozzolino,et al.  Usefulness of chemometrics and mass spectrometry-based electronic nose to classify Australian white wines by their varietal origin. , 2005, Talanta.

[48]  Julian W. Gardner,et al.  Performance definition and standardization of electronic noses , 1996 .

[49]  Gerhard Niebling,et al.  Design of sensor arrays by use of an inverse feature space , 1995 .

[50]  Francisco López-Ferreras,et al.  Feature Reduction Using Support Vector Machines for Binary Gas Detection , 2003, IWANN.

[51]  Richard E. Cavicchi,et al.  KINETICALLY CONTROLLED CHEMICAL SENSING USING MICROMACHINED STRUCTURES , 1998 .

[52]  Desire L. Massart,et al.  Random correlation in variable selection for multivariate calibration with a genetic algorithm , 1996 .

[53]  W. Maziarz,et al.  Gas sensors in a dynamic operation mode , 2008 .

[54]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[55]  K. Persaud,et al.  Analysis of discrimination mechanisms in the mammalian olfactory system using a model nose , 1982, Nature.

[56]  R T Marsili,et al.  SPME-MS-MVA as an electronic nose for the study of off-flavors in milk. , 1999, Journal of agricultural and food chemistry.

[57]  J. Brezmes,et al.  Early detection of fungal growth in bakery products by use of an electronic nose based on mass spectrometry. , 2004, Journal of agricultural and food chemistry.

[58]  Ada Fort,et al.  Tin oxide gas sensing: comparison among different measurement techniques for gas mixture classification , 2003, IEEE Trans. Instrum. Meas..

[59]  Steve Semancik,et al.  Designing and optimizing microsensor arrays for recognizing chemical hazards in complex environments , 2009 .

[60]  J. Kauer,et al.  A computational system for simulating and analyzing arrays of biological and artificial chemical sensors. , 2002, Chemical senses.

[61]  Eduard Llobet,et al.  Efficient feature selection for mass spectrometry based electronic nose applications , 2007 .

[62]  R. Huerta,et al.  Information-theoretic optimization of chemical sensors , 2010 .

[63]  S. Nakata,et al.  Gas sensing based on the dynamic nonlinear responses of a semiconductor gas sensor: dependence on the range and frequency of a cyclic temperature change , 1998 .

[64]  R. Cavicchi,et al.  Optimization of temperature programmed sensing for gas identification using micro-hotplate sensors , 1998 .

[65]  Hartwig Schulz,et al.  Characterisation of ‘Galia’ melon aroma by GC and mass spectrometric sensor measurements after prolonged storage , 2001 .

[66]  C. Pérès,et al.  Fast characterization of foodstuff by headspace mass spectrometry (HS-MS) , 2003 .

[67]  Konrad Colbow,et al.  Selective thermally cycled gas sensing using fast Fourier-transform techniques , 1990 .

[68]  M. Martí,et al.  Fast screening method for determining 2,4,6-trichloroanisole in wines using a headspace–mass spectrometry (HS–MS) system and multivariate calibration , 2003, Analytical and bioanalytical chemistry.

[69]  J. Brezmes,et al.  Variable selection for support vector machine based multisensor systems , 2007 .

[70]  Alexander Vergara,et al.  A sensor conditioning principle for odor identification , 2010 .

[71]  Lucas Paletta,et al.  Active object recognition by view integration and reinforcement learning , 2000, Robotics Auton. Syst..

[72]  W. Gőpel Chemisorption and charge transfer at ionic semiconductor surfaces: Implications in designing gas sensors , 1985 .

[73]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[74]  J. Brezmes,et al.  MS-electronic nose performance improvement using the retention time dimension and two-way and three-way data processing methods , 2010 .

[75]  J. Stetter,et al.  Theoretical basis for identification and measurement of air contaminants using an array of sensors having partly overlapping selectivities , 1984 .

[76]  A J Ramos,et al.  Use of a MS-electronic nose for prediction of early fungal spoilage of bakery products. , 2007, International journal of food microbiology.

[77]  Alexander Vergara,et al.  Kullback-Leibler distance optimization for artificial chemo-sensors , 2009, 2009 IEEE Sensors.

[78]  T. Becker,et al.  Gas-kinetic interactions of nitrous oxides with SnO2 surfaces , 1998 .

[79]  Wolfgang Göpel,et al.  Chemical imaging: I. Concepts and visions for electronic and bioelectronic noses 1 Presented in part , 1998 .

[80]  B Dittmann,et al.  Strategies for the development of reliable QA/QC methods when working with mass spectrometry-based chemosensory systems , 2000 .

[81]  N. Zierler Linear Recurring Sequences , 1959 .

[82]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[83]  Paul Geladi,et al.  Analysis of multi-way (multi-mode) data , 1989 .

[84]  Valerio Vignoli,et al.  Selectivity enhancement of SnO2 sensors by means of operating temperature modulation , 2002 .

[85]  J. Brezmes,et al.  Fast detection of rancidity in potato crisps using e-noses based on mass spectrometry or gas sensors , 2005 .

[86]  N. Lewis,et al.  A chemically diverse conducting polymer-based "electronic nose". , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[87]  Jianguo Sun,et al.  A correlation principal component regression analysis of NIR data , 1995 .

[88]  Evor L. Hines,et al.  Enhancing electronic nose performance by sensor selection using a new integer-based genetic algorithm approach , 2005 .

[89]  P. K. Clifford,et al.  Characteristics of semiconductor gas sensors I. Steady state gas response , 1982 .