Classification of specialty coffees using machine learning techniques

Specialty coffees have a big importance in the economic scenario, and its sensory quality is appreciated by the productive sector and by the market. Researches have been constantly carried out in the search for better blends in order to add value and differentiate prices according to the product quality. To accomplish that, new methodologies must be explored, taking into consideration factors that might differentiate the particularities of each consumer and/or product. Thus, this article suggests the use of the machine learning technique in the construction of supervised classification and identification models. In a sensory evaluation test for consumer acceptance using four classes of specialty coffees, applied to four groups of trained and untrained consumers, features such as flavor, body, sweetness and general grade were evaluated. The use of machine learning is viable because it allows the classification and identification of specialty coffees produced in different altitudes and different processing methods.

[1]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[2]  F. M. Borém,et al.  Sensory analysis and chemical composition of 'Bourbon' coffees cultivated in different environments. , 2018 .

[3]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[4]  L. V. Resende,et al.  Machine learning in classification and identification of nonconventional vegetables. , 2020, Journal of food science.

[5]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[6]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  L. Devasena Comparative Analysis of Random Forest, REP Tree and J48 Classifiers for Credit Risk Prediction , 2015 .

[9]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[10]  Ron Kohavi,et al.  Wrappers for performance enhancement and oblivious decision graphs , 1995 .

[11]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[12]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[13]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[14]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[15]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[16]  Soledad Espezua,et al.  A Projection Pursuit framework for supervised dimension reduction of high dimensional small sample datasets , 2015, Neurocomputing.

[17]  W. Ferreira,et al.  Sensory analysis of specialty coffee from different environmental conditions in the region of Matas de Minas, Minas Gerais, Brazil , 2016 .

[18]  F. M. Borém,et al.  Unsupervised classification of specialty coffees in Homogeneous sensory attributes through machine learning , 2020 .

[19]  David J. Hand,et al.  Averaging Over Decision Stumps , 1994, ECML.

[20]  M. Cirillo,et al.  Qualidade de cafés especiais: uma avaliação sensorial feita com consumidores utilizando a técnica MFACT , 2017 .

[21]  F. M. Borém,et al.  Coffee sensory quality study based on spatial distribution in the Mantiqueira mountain region of Brazil , 2020, Journal of Sensory Studies.

[22]  John G. Cleary,et al.  K*: An Instance-based Learner Using and Entropic Distance Measure , 1995, ICML.

[23]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[24]  F. M. Borém,et al.  Evaluation of sensory panels of consumers of specialty coffee beverages using the boosting method in discriminant analysis , 2015 .

[25]  Yong Wang,et al.  Using Model Trees for Classification , 1998, Machine Learning.

[26]  Bernhard Pfahringer,et al.  Locally Weighted Naive Bayes , 2002, UAI.

[27]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[29]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.