Combining optical spectroscopy and machine learning to improve food classification

Abstract Near-infrared spectroscopic data, used for non-destructive product identification, are traditionally processed using multivariate data analysis techniques. However, these methods often cover only a limited product variability. We target the development of a novel machine learning based algorithm enabling the identification of foreign objects, in combination with food safety and quality evaluation in a product stream, by combining the information from ultraviolet, visible, near-infrared reflection spectroscopy and fluorescence spectroscopy. Therefore, we implemented a novel classification scheme using a cascade of individual classifiers combining both types of spectral data. In addition, to ease implementation in industrial applications and reduce processing time, we applied a feature selection search, limiting the considered illumination and detection wavelengths to 8. As an illustration of our novel classification algorithm, we present the processing of walnuts in this paper. The optimal cascade consists of a first classifier based on reflection measurements using Extreme Learning Machine and a second classifier based on fluorescence measurements using Support Vector Machines. A false negative rate of the good nuts of 5.54% was found, while the maximal false positive rate equals 8.34%, for shriveled walnuts. All other sample defects, including both foreign objects and molds, show a correct classification rate exceeding 98%. Consequently, this excellent performance indicates the strength of machine learning processing for multipurpose food processing applications.

[1]  Chandra Verma,et al.  Modelling study of dimerization in mammalian defensins , 2006, BMC Bioinformatics.

[2]  Anna Grazia Mignani,et al.  Optical spectroscopy for food and beverages control , 2010, International Conference on Fiber Optics and Photonics.

[3]  Da-Wen Sun Infrared spectroscopy for food quality analysis and control , 2009 .

[4]  Hugo Thienpont,et al.  Optical detection of aflatoxins in maize using one- and two-photon induced fluorescence spectroscopy , 2015 .

[5]  David C. Slaughter,et al.  Detection of fungal infection in almond kernels using near-infrared reflectance spectroscopy , 2015 .

[6]  Peng Liu,et al.  Detection of Aspergillus spp. contamination levels in peanuts by near infrared spectroscopy and electronic nose , 2018, Food Control.

[7]  Agnieszka Nawrocka,et al.  Determination of Food Quality by Using Spectroscopic Methods , 2013 .

[8]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[9]  Hugo Thienpont,et al.  Photonics enhanced sensors for food monitoring: part 2 , 2017, IEEE Instrumentation & Measurement Magazine.

[10]  Jens Pietzsch,et al.  Carboranyl Derivatives of Rofecoxib with Cytostatic Activity against Human Melanoma and Colon Cancer Cells , 2020, Scientific Reports.

[11]  Hugo Thienpont,et al.  Photonics enhanced sensors for food monitoring: Part 3 , 2017, IEEE Instrumentation & Measurement Magazine.

[12]  Efstathios Z. Panagou,et al.  Data mining derived from food analyses using non-invasive/non-destructive analytical techniques; determination of food authenticity, quality & safety in tandem with computer science disciplines , 2016 .

[13]  Hugo Thienpont,et al.  Photonics enhanced sensors for food monitoring: part 1 , 2016, IEEE Instrumentation & Measurement Magazine.

[15]  Raffaele Romano,et al.  Non-destructive detection of flawed hazelnut kernels and lipid oxidation assessment using NIR spectroscopy , 2015 .

[16]  Songyot Nakariyakul,et al.  Internal damage inspection of almond nuts using optimal near-infrared waveband selection technique , 2014 .

[17]  Yukihiro Ozaki,et al.  Near-Infrared Spectroscopy in Food Science and Technology: Ozaki/Near-Infrared Spectroscopy in Food Science and Technology , 2006 .

[18]  Benyamin Ghojogh,et al.  Linear and Quadratic Discriminant Analysis: Tutorial , 2019, ArXiv.

[19]  Viv Bewick,et al.  Statistics review 13: Receiver operating characteristic curves , 2004, Critical care.

[20]  Danilo Monarca,et al.  Nondestructive detection of insect infested chestnuts based on NIR spectroscopy , 2014 .

[21]  Richard Simon,et al.  Bias in error estimation when using cross-validation for model selection , 2006, BMC Bioinformatics.

[22]  Danilo Monarca,et al.  Feasibility of Vis/NIR spectroscopy for detection of flaws in hazelnut kernels , 2013 .

[23]  H. Thienpont,et al.  Non-destructive detection of mycotoxins in maize kernels using diffuse reflectance spectroscopy , 2016 .

[24]  Songyot Nakariyakul,et al.  Classification of internally damaged almond nuts using hyperspectral imagery , 2011 .

[25]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[26]  P. Mallikarjunan,et al.  Discrimination of moldy peanuts with reference to aflatoxin using FTIR-ATR system , 2014 .

[27]  J. Sádecká,et al.  Fluorescence spectroscopy and chemometrics in the food classification - : a review , 2018 .

[28]  Panagiotis Tsakanikas,et al.  A machine learning workflow for raw food spectroscopic classification in a future industry , 2020, Scientific Reports.

[29]  Hugo Thienpont,et al.  Optical detection techniques for laser sorting machines , 2006, SPIE Photonics Europe.

[30]  Danilo Monarca,et al.  Detection of Mold-Damaged Chestnuts by Near-Infrared Spectroscopy , 2014 .

[31]  R. Boulton,et al.  Use of Near-Infrared Spectroscopy and Chemometrics for the Nondestructive Identification of Concealed Damage in Raw Almonds (Prunus dulcis). , 2016, Journal of agricultural and food chemistry.

[32]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[33]  David W. Aha,et al.  A Comparative Evaluation of Sequential Feature Selection Algorithms , 1995, AISTATS.

[34]  Yukihiro Ozaki,et al.  Near-infrared spectroscopy in food science and technology , 2007 .