Recent applications of multivariate data analysis methods in the authentication of rice and the most analyzed parameters: A review

ABSTRACT Rice is one of the most important staple foods around the world. Authentication of rice is one of the most addressed concerns in the present literature, which includes recognition of its geographical origin and variety, certification of organic rice and many other issues. Good results have been achieved by multivariate data analysis and data mining techniques when combined with specific parameters for ascertaining authenticity and many other useful characteristics of rice, such as quality, yield and others. This paper brings a review of the recent research projects on discrimination and authentication of rice using multivariate data analysis and data mining techniques. We found that data obtained from image processing, molecular and atomic spectroscopy, elemental fingerprinting, genetic markers, molecular content and others are promising sources of information regarding geographical origin, variety and other aspects of rice, being widely used combined with multivariate data analysis techniques. Principal component analysis and linear discriminant analysis are the preferred methods, but several other data classification techniques such as support vector machines, artificial neural networks and others are also frequently present in some studies and show high performance for discrimination of rice.

[1]  Dirk Thorleuchter,et al.  Predicting customer profitability during acquisition: Finding the optimal combination of data source and data mining technique , 2013, Expert Syst. Appl..

[2]  Saurabh Chaudhury,et al.  Efficient technique for rice grain classification using back-propagation neural network and wavelet decomposition , 2016, IET Comput. Vis..

[3]  Lorlyn Reidy,et al.  Elemental fingerprinting of soils using ICP-MS and multivariate statistics: a study for and by forensic chemistry majors. , 2013, Forensic science international.

[4]  Amit Yerpude,et al.  Classification of Basmati Rice Grain Variety using Image Processing and Principal Component Analysis , 2014, ArXiv.

[5]  Wei Wu,et al.  Assessing the relative importance of climate variables to rice yield variation using support vector machines , 2016, Theoretical and Applied Climatology.

[6]  Bruno Lemos Batista,et al.  Monitoring the authenticity of organic rice via chemometric analysis of elemental data , 2015 .

[7]  Margaret R. Karagas,et al.  Rice Consumption and Urinary Arsenic Concentrations in U.S. Children , 2012, Environmental health perspectives.

[8]  Akbar Montaser,et al.  Inductively coupled plasma mass spectrometry , 1998 .

[9]  Charles Elkan,et al.  Learning classifiers from only positive and unlabeled data , 2008, KDD.

[10]  Margaret R Karagas,et al.  Rice consumption contributes to arsenic exposure in US women , 2011, Proceedings of the National Academy of Sciences.

[11]  S. Muthayya,et al.  An overview of global rice production, supply, trade, and consumption , 2014, Annals of the New York Academy of Sciences.

[12]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[13]  Rommel M. Barbosa,et al.  Establishing chemical profiling for ecstasy tablets based on trace element levels and support vector machine , 2018, Neural Computing and Applications.

[14]  Ill-Min Chung,et al.  Geographic authentication of Asian rice (Oryza sativa L.) using multi-elemental and stable isotopic data combined with multivariate analysis. , 2018, Food chemistry.

[15]  Roman M. Balabin,et al.  Near-infrared (NIR) spectroscopy for motor oil classification: From discriminant analysis to support vector machines , 2011 .

[16]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[17]  Dennis Eichmann,et al.  Food Composition And Analysis , 2016 .

[18]  Rommel M. Barbosa,et al.  Comparative study of data mining techniques for the authentication of organic grape juice based on ICP-MS analysis , 2016, Expert Syst. Appl..

[19]  F. Barbosa,et al.  The use of advanced chemometric techniques and trace element levels for controlling the authenticity of organic coffee , 2014 .

[20]  J. M. Jurado,et al.  Recognition of the geographical origin of beer based on support vector machines applied to chemical descriptors , 2012 .

[21]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[22]  Peisheng Cong,et al.  Preliminary study on classification of rice and detection of paraffin in the adulterated samples by Raman spectroscopy combined with multivariate analysis. , 2013, Talanta.

[23]  Joanna Szpunar,et al.  Discrimination of geographical origin of rice based on multi-element fingerprinting by high resolution inductively coupled plasma mass spectrometry. , 2013, Food chemistry.

[24]  An Pan,et al.  White rice consumption and risk of type 2 diabetes: meta-analysis and systematic review , 2012, BMJ : British Medical Journal.

[25]  Rommel M. Barbosa,et al.  Classification of geographic origin of rice by data mining and inductively coupled plasma mass spectrometry , 2016, Comput. Electron. Agric..

[26]  J. Zupan,et al.  On the use of counterpropagation artificial neural networks to characterize Italian rice varieties , 2004 .

[27]  Zhen Zhu,et al.  Cluster Analysis on Japonica Rice (Oryza sativa L.) with Good Eating Quality Based on SSR Markers and Phenotypic Traits , 2010 .

[28]  Rommel M. Barbosa,et al.  Finding the Most Significant Elements for the Classification of Organic Orange Leaves: A Data Mining Approach , 2017 .

[29]  Chih-Jen Lin,et al.  Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[30]  M. de la Guardia,et al.  Geographical traceability of “Arròs de Valencia” rice grain based on mineral element composition , 2011 .

[31]  Julie Upton,et al.  Rice consumption in the United States: recent evidence from food consumption surveys. , 2009, Journal of the American Dietetic Association.

[32]  R. Moench-Pfanner,et al.  Rice Fortification: An Emerging Opportunity to Contribute to the Elimination of Vitamin and Mineral Deficiency Worldwide , 2012, Food and nutrition bulletin.

[33]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[34]  Rommel M. Barbosa,et al.  Recognition of organic rice samples based on trace elements and support vector machines , 2016 .

[35]  L. Nunes,et al.  Profiling the ionome of rice and its use in discriminating geographical origins at the regional scale, China. , 2013, Journal of environmental sciences.

[36]  Phaiwan Pramai,et al.  Chemometric classification of pigmented rice varieties based on antioxidative properties in relation to color , 2016 .

[37]  Fernando Barbosa,et al.  Speciation of arsenic in rice and estimation of daily intake of different arsenic species by Brazilians through rice consumption. , 2011, Journal of hazardous materials.

[38]  Yan-Fu Kuo,et al.  Identifying rice grains using image analysis and sparse-representation-based classification , 2016, Comput. Electron. Agric..

[39]  Yong He,et al.  Quantification of Nitrogen Status in Rice by Least Squares Support Vector Machines and Reflectance Spectroscopy , 2009, Food and Bioprocess Technology.

[40]  Wei Liu,et al.  Application of terahertz spectroscopy imaging for discrimination of transgenic rice seeds with chemometrics. , 2016, Food chemistry.

[41]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[42]  Ill-Min Chung,et al.  Discrimination of geographical origin of rice (Oryza sativa L.) by multielement analysis using inductively coupled plasma atomic emission spectroscopy and multivariate analysis , 2015 .

[43]  Hoeil Chung,et al.  Enhanced Raman spectroscopic discrimination of the geographical origins of rice samples via transmission spectral collection through packed grains. , 2012, Talanta.

[44]  Detlef Günther,et al.  Elemental imaging and classifying rice grains by using laser ablation inductively coupled plasma mass spectrometry and linear discriminant analysis , 2016 .

[45]  Hu Zhengyi,et al.  1H NMR-based metabolomics for discrimination of rice from different geographical origins of China , 2017 .

[46]  Changyeun Mo,et al.  Combination of mass spectrometry-based targeted lipidomics and supervised machine learning algorithms in detecting adulterated admixtures of white rice. , 2017, Food research international.

[47]  Sanghoon Ko,et al.  Classification of rice cultivars based on cluster analysis of hydration and pasting properties of their starches , 2012 .

[48]  Francesco Cubadda,et al.  Chapter 19 – Inductively coupled plasma mass spectrometry , 2007 .

[49]  Zhiwei Zhu,et al.  Classification of Rice by Combining Electronic Tongue and Nose , 2015, Food Analytical Methods.

[50]  Feng Shi,et al.  Support vector machine method on predicting resistance gene against Xanthomonas oryzae pv. oryzae in rice , 2010, Expert Syst. Appl..

[51]  Wei Zheng,et al.  Classification of colonic tissues using near-infrared Raman spectroscopy and support vector machines. , 2008, International journal of oncology.

[52]  Tibérius O. Bonates,et al.  Multi-element determination in Brazilian honey samples by inductively coupled plasma mass spectrometry and estimation of geographic origin with data mining techniques , 2012 .

[53]  Saeid Minaei,et al.  Qualitative classification of milled rice grains using computer vision and metaheuristic techniques , 2015, Journal of Food Science and Technology.

[54]  Sharifuddin M. Zain,et al.  Milk authentication and discrimination via metal content clustering – A case of comparing milk from Malaysia and selected countries of the world , 2016 .

[55]  Wei Gong,et al.  Monitoring of Paddy Rice Varieties Based on the Combination of the Laser-Induced Fluorescence and Multivariate Analysis , 2017, Food Analytical Methods.

[56]  Vincent Baeten,et al.  Combination of support vector machines (SVM) and near‐infrared (NIR) imaging spectroscopy for the detection of meat and bone meal (MBM) in compound feeds , 2004 .

[57]  Fernando Barbosa,et al.  A simple and practical control of the authenticity of organic sugarcane samples based on the use of machine-learning algorithms and trace elements determination by inductively coupled plasma mass spectrometry. , 2015, Food chemistry.

[58]  Mohamad Rafi,et al.  Discrimination of red and white rice bran from Indonesia using HPLC fingerprint analysis combined with chemometrics. , 2017, Food chemistry.