Fuzzy k -NN Lung Cancer Identification by an Electronic Nose

We present a method to recognize the presence of lung cancer in individuals by classifying the olfactory signal acquired through an electronic nose based on an array of MOS sensors. We analyzed the breath of 101 persons, of which 58 as control and 43 suffering from different types of lung cancer (primary and not) at different stages. In order to find the components able to discriminate between the two classes `healthy' and `sick' as best as possible and to reduce the dimensionality of the problem, we extracted the most significative features and projected them into a lower dimensional space, using Nonparametric Linear Discriminant Analysis. Finally, we used these features as input to a pattern classification algorithm, based on Fuzzy k-Nearest Neighbors (Fuzzy k-NN). The observed results, all validated using cross-validation, have been satisfactory achieving an accuracy of 92.6%, a sensitivity of 95.3% and a specificity of 90.5%. These results put the electronic nose as a valid implementation of lung cancer diagnostic technique, being able to obtain excellent results with a non invasive, small, low cost and very fast instrument.

[1]  R. Lyman Ott,et al.  Introduction to Statistical Methods and Data Analysis (with CD-ROM) , 2006 .

[2]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[3]  E. Martinelli,et al.  Feature Extraction of chemical sensors in phase space , 2003 .

[4]  Ping Wang,et al.  A study of an electronic nose for detection of lung cancer based on a virtual SAW gas sensors array and imaging recognition method , 2005 .

[5]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[6]  E. Martinelli,et al.  Lung cancer identification by the analysis of breath by means of an array of non-selective gas sensors. , 2003, Biosensors & bioelectronics.

[7]  J W Gardner and P N Bartlett,et al.  Electronic Noses: Principles and Applications , 1999 .

[8]  P. Mazzone,et al.  Detection of lung cancer by sensor array analyses of exhaled breath. , 2005, American journal of respiratory and critical care medicine.

[9]  K. Fukunaga,et al.  Nonparametric Discriminant Analysis , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Frederick E. Petry,et al.  Principles and Applications , 1997 .

[11]  A. R. Newman Electronic noses. , 1991, Analytical Chemistry.

[12]  H. Groen,et al.  Preoperative staging of non-small-cell lung cancer with positron-emission tomography. , 2000, The New England journal of medicine.

[13]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  R. Lyman Ott.,et al.  An introduction to statistical methods and data analysis , 1977 .

[15]  Ricardo Gutierrez-Osuna,et al.  The how and why of electronic noses , 1998 .

[16]  C. Borror An Introduction to Statistical Methods and Data Analysis, 5th Ed. , 2002 .

[17]  G. Sberveglieri,et al.  Electronic Olfactory Systems Based on Metal Oxide Semiconductor Sensor Arrays , 2004 .

[18]  Kevin Gleeson,et al.  Detection of lung cancer with volatile markers in the breath. , 2003, Chest.