Feature selection for classification of animal feed ingredients from near infrared microscopy spectra

The classification of animal feed ingredients has become a challenging computational task since the food crisis that arose in the European Union after the outbreak of bovine spongiform encephalopathy (BSE). The most interesting alternative to replace visual observation under classical microscopy is based on the use of near infrared reflectance microscopy (NIRM). This technique collects spectral information from a set of microscopic particles of animal feeds. These spectra can be classified using maximum margin classifiers with good results. However, it is difficult to interpret the models in terms of the contribution of features. To gain insight into the interpretability of such classifications, we propose a method that learns accurate classifiers defined on a small set of narrow intervals of wavelengths. The proposed method is a greedy bipartite procedure that may be successfully compared with other state-of-the-art feature selectors and can be scaled up efficiently to deal with other classification tasks of higher dimensionality.

[1]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[2]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[3]  Ian Murray,et al.  Use of Discriminant Analysis on Visible and near Infrared Reflectance Spectra to Detect Adulteration of Fishmeal with Meat and Bone Meal , 2001 .

[4]  Eric C. C. Tsang,et al.  On fuzzy approximation operators in attribute reduction with fuzzy rough sets , 2008, Inf. Sci..

[5]  William H. Hsu,et al.  Genetic wrappers for feature selection in decision tree induction and variable ordering in Bayesian network structure learning , 2004, Inf. Sci..

[6]  Chih-Jen Lin,et al.  Trust Region Newton Method for Logistic Regression , 2008, J. Mach. Learn. Res..

[7]  P. Williams,et al.  Near-Infrared Technology in the Agricultural and Food Industries , 1987 .

[8]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[9]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[10]  Yiyu Yao,et al.  Attribute reduction in decision-theoretic rough set models , 2008, Inf. Sci..

[11]  Fernando Vicente,et al.  Particle Size Optimisation in Development of near Infrared Microscopy Methodology to Build Spectral Libraries of Animal Feeds , 2008 .

[12]  Tom Fearn,et al.  Development and validation of near infrared microscopy spectral libraries of ingredients in animal feed as a first step to adopting traceability and authenticity as guarantors of food safety , 2010 .

[13]  Juan José del Coz,et al.  Trait Selection for Assessing Beef Meat Quality Using Non-linear SVM , 2004, NIPS.

[14]  Eric P. Xing,et al.  Feature Selection via Block-Regularized Regression , 2008, UAI.

[15]  Michel Verleysen,et al.  Fast Selection of Spectral Variables with B-Spline Compression , 2007, ArXiv.

[16]  C von Holst,et al.  An overview of tests for animal tissues in feeds applied in response to public health concerns regarding bovine spongiform encephalopathy. , 2003, Revue scientifique et technique.

[17]  José Ramón Quevedo,et al.  Analyzing Sensory Data Using Non-linear Preference Learning with Feature Subset Selection , 2004, ECML.

[18]  Vincent Baeten,et al.  Combination of support vector machines (SVM) and near‐infrared (NIR) imaging spectroscopy for the detection of meat and bone meal (MBM) in compound feeds , 2004 .

[19]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[20]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[21]  Yanjun Li,et al.  Regularized orthogonal forward feature selection for spectral data , 2010, 9th IEEE International Conference on Cognitive Informatics (ICCI'10).

[22]  Chris Cornelis,et al.  Attribute selection with fuzzy decision reducts , 2010, Inf. Sci..

[23]  Michel Verleysen,et al.  Mutual information for the selection of relevant variables in spectrometric nonlinear modelling , 2006, ArXiv.

[24]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[25]  Chen Degang,et al.  A new approach to attribute reduction of consistent and inconsistent covering decision systems with covering rough sets , 2007 .

[26]  Bruno Apolloni,et al.  Feature selection via Boolean independent component analysis , 2009, Inf. Sci..

[27]  J. R. Quevedo,et al.  Improving the Discriminatory Power of a Near-Infrared Microscopy Spectral Library with a Support Vector Machine Classifier , 2010, Applied spectroscopy.

[28]  L. G. Weyer,et al.  Spectra– Structure Correlations in the Near‐Infrared , 2006 .

[29]  Richard Weber,et al.  A wrapper method for feature selection using Support Vector Machines , 2009, Inf. Sci..

[30]  Dolores Pérez-Marín,et al.  Application of near-infrared microscopy (NIRM) for the detection of meat and bone meals in animal feeds: A tool for food and feed safety , 2007 .

[31]  R. Leardi,et al.  Sequential application of backward interval partial least squares and genetic algorithms for the selection of relevant spectral regions , 2004 .

[32]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.