An Empirical Study for PCA- and LDA-Based Feature Reduction for Gas Identification

Increasing the number of sensors in a gas identification system generally improves its performance as this will add extra features for analysis. However, this affects the computational complexity, especially if the identification algorithm is to be implemented on a hardware platform. Therefore, feature reduction is required to extract the most important information from the sensors for processing. In this paper, linear discriminant analysis (LDA) and principal component analysis (PCA)-based feature reduction algorithms have been analyzed using the data obtained from two different types of gas sensors, i.e., seven commercial Figaro sensors and in-house fabricated <inline-formula> <tex-math notation="LaTeX">$4 \times 4$ </tex-math></inline-formula> tin-oxide gas array sensor. A decision tree-based classifier is used to examine the performance of both the PCA and LDA approaches. The software implementation is carried out in MATLAB and the hardware implementation is performed using the Zynq system-on-chip (SoC) platform. It has been found that with the <inline-formula> <tex-math notation="LaTeX">$4 \times 4$ </tex-math></inline-formula> array sensor, two discriminant functions (DF) of LDA provide 3.3% better classification than five PCA components, while for the seven Figaro sensors, two principal components and one DF show the same performances. The hardware implementation results on the programmable logic of the Zynq SoC shows that LDA outperforms PCA by using 50% less resources as well as by being 11% faster with a maximum running frequency of 122 MHz.

[1]  Hans C. van Houwelingen,et al.  The Elements of Statistical Learning, Data Mining, Inference, and Prediction. Trevor Hastie, Robert Tibshirani and Jerome Friedman, Springer, New York, 2001. No. of pages: xvi+533. ISBN 0‐387‐95284‐5 , 2004 .

[2]  Josef Kittler,et al.  Locally linear discriminant analysis for multimodally distributed classes for face recognition with a single model image , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Shourya Roy,et al.  Fast and accurate text classification via multiple linear discriminant projections , 2003, The VLDB Journal.

[4]  Andreas Schütze,et al.  Low power virtual sensor array based on a micromachined gas sensor for fast discrimination between H2, CO and relative humidity , 2004 .

[5]  Gui Yun Tian,et al.  A FEATURE EXTRACTION TECHNIQUE BASED ON PRINCIPAL COMPONENT ANALYSIS FOR PULSED EDDY CURRENT NDT , 2003 .

[6]  Stan Z. Li,et al.  Learning multiview face subspaces and facial pose estimation using independent component analysis , 2005, IEEE Transactions on Image Processing.

[7]  Laszlo B. Kish,et al.  Extracting information from noise spectra of chemical sensors: single sensor electronic noses and tongues , 2000 .

[8]  David Zhang,et al.  Independent components extraction from image matrix , 2010, Pattern Recognit. Lett..

[9]  Amine Bermak,et al.  A CMOS Single-Chip Gas Recognition Circuit for Metal Oxide Gas Sensor Arrays , 2011, IEEE Transactions on Circuits and Systems I: Regular Papers.

[10]  E. Llobet,et al.  Multicomponent gas mixture analysis using a single tin oxide sensor and dynamic pattern recognition , 2001, IEEE Sensors Journal.

[11]  Ricardo Gutierrez-Osuna,et al.  Pattern analysis for machine olfaction: a review , 2002 .

[12]  R. Huerta,et al.  Multifrequency interrogation of nanostructured gas sensor arrays: a tool for analyzing response kinetics. , 2012, Analytical chemistry.

[13]  Gavin C. Cawley,et al.  On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[14]  Takeshi Naemura,et al.  Personal facial expression space based on multidimensional scaling for the recognition improvement , 1999, ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359).

[15]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[16]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[17]  Joachim Goschnick,et al.  A gradient microarray electronic nose based on percolating SnO(2) nanowire sensing elements. , 2007, Nano letters.

[18]  Xia Shaowei,et al.  Robust PCA based on neural networks , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[19]  W. Marsden I and J , 2012 .

[20]  Xin Yao,et al.  Linear dimensionality reduction using relevance weighted LDA , 2005, Pattern Recognit..

[21]  M. Bhuyan,et al.  Online concentration independent feature dimension reduction of metal oxide gas sensor based E-Nose , 2012, IEEE-International Conference On Advances In Engineering, Science And Management (ICAESM -2012).

[22]  Paul Honeine,et al.  Online Kernel Principal Component Analysis: A Reduced-Order Model , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  A. Bermak,et al.  An Integrated Surface Micromachined Convex Microhotplate Structure for Tin Oxide Gas Sensor Array , 2007, IEEE Sensors Journal.

[24]  Pablo Laguna,et al.  Multilead Analysis of T-Wave Alternans in the ECG Using Principal Component Analysis , 2009, IEEE Transactions on Biomedical Engineering.

[25]  Amine Bermak,et al.  Gas Identification Based on Committee Machine for Microelectronic Gas Sensor , 2006, IEEE Transactions on Instrumentation and Measurement.

[26]  Angel Rubio,et al.  Increasing chemical selectivity of carbon nanotube-based sensors by fluctuation-enhanced sensing , 2010 .

[27]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[28]  A. Kolmakov,et al.  Toward the nanoscopic "electronic nose": hydrogen vs carbon monoxide discrimination with an array of individual metal oxide nano- and mesowire sensors. , 2006, Nano letters.

[29]  Amine Bermak,et al.  A Low-Power Hardware-Friendly Binary Decision Tree Classifier for Gas Identification , 2011 .

[30]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .