Support Vector Machine and Artificial Neural Network Models for the Classification of Grapevine Varieties Using a Portable NIR Spectrophotometer

The identification of different grapevine varieties, currently attended using visual ampelometry, DNA analysis and very recently, by hyperspectral analysis under laboratory conditions, is an issue of great importance in the wine industry. This work presents support vector machine and artificial neural network’s modelling for grapevine varietal classification from in-field leaf spectroscopy. Modelling was attempted at two scales: site-specific and a global scale. Spectral measurements were obtained on the near-infrared (NIR) spectral range between 1600 to 2400 nm under field conditions in a non-destructive way using a portable spectrophotometer. For the site specific approach, spectra were collected from the adaxial side of 400 individual leaves of 20 grapevine (Vitis vinifera L.) varieties one week after veraison. For the global model, two additional sets of spectra were collected one week before harvest from two different vineyards in another vintage, each one consisting on 48 measurement from individual leaves of six varieties. Several combinations of spectra scatter correction and smoothing filtering were studied. For the training of the models, support vector machines and artificial neural networks were employed using the pre-processed spectra as input and the varieties as the classes of the models. The results from the pre-processing study showed that there was no influence whether using scatter correction or not. Also, a second-degree derivative with a window size of 5 Savitzky-Golay filtering yielded the highest outcomes. For the site-specific model, with 20 classes, the best results from the classifiers thrown an overall score of 87.25% of correctly classified samples. These results were compared under the same conditions with a model trained using partial least squares discriminant analysis, which showed a worse performance in every case. For the global model, a 6-class dataset involving samples from three different vineyards, two years and leaves monitored at post-veraison and harvest was also built up, reaching a 77.08% of correctly classified samples. The outcomes obtained demonstrate the capability of using a reliable method for fast, in-field, non-destructive grapevine varietal classification that could be very useful in viticulture and wine industry, either global or site-specific.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  S. Ustin,et al.  LEAF OPTICAL PROPERTIES: A STATE OF THE ART , 2000 .

[3]  Nathalie Dupuy,et al.  Artificial vision and chemometrics analyses of olive stones for varietal identification of five French cultivars , 2014 .

[4]  Geoffrey E. Hinton,et al.  Learning representations by back-propagation errors, nature , 1986 .

[5]  R. Barnes,et al.  On the Scales Associated with Near-Infrared Reflectance Difference Spectra , 1995 .

[6]  Fang Cheng,et al.  Spectral and Image Integrated Analysis of Hyperspectral Data for Waxy Corn Seed Variety Classification , 2015, Sensors.

[7]  M. S. Grando,et al.  MICROSATELLITE MARKERS FOR GRAPEVINE: A STATE OF THE ART , 2001 .

[8]  D. Cozzolino,et al.  A feasibility study of the classification of Alpaca (Lama pacos) wool samples from different ages, sex and color by means of visible and near infrared reflectance spectroscopy , 2012 .

[9]  S. Sultan Phenotypic plasticity for plant development, function and life history. , 2000, Trends in plant science.

[10]  Yong He,et al.  Discriminating varieties of tea plant based on Vis/NIR spectral characteristics and using artificial neural networks , 2008 .

[11]  Yibin Ying,et al.  Discrimination of Pear Varieties Using Three Classification Methods Based on Near-Infrared Spectroscopy , 2007 .

[12]  E. Finnegan,et al.  Plant phenotypic plasticity in a changing climate. , 2010, Trends in plant science.

[13]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[14]  B. Graae,et al.  Decoupled phenotypic variation between floral and vegetative traits: distinguishing between developmental and environmental correlations. , 2013, Annals of botany.

[15]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[16]  D. Merdinoglu,et al.  An extensive study of the genetic diversity within seven French wine grape variety collections , 2010, Theoretical and Applied Genetics.

[17]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[18]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[19]  Juan Fernández-Novales,et al.  Assessment of quality parameters in grapes during ripening using a miniature fiber-optic near-infrared spectrometer , 2009, International journal of food sciences and nutrition.

[20]  Pedro Melo-Pinto,et al.  Identification of grapevine varieties using leaf spectroscopy and partial least squares , 2013 .

[21]  H. Barrs,et al.  A Re-Examination of the Relative Turgidity Technique for Estimating Water Deficits in Leaves , 1962 .

[22]  M. Diago,et al.  Automatic discrimination of grapevine (Vitis vinifera L.) clones using leaf hyperspectral imaging and partial least squares , 2014, The Journal of Agricultural Science.

[23]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[24]  Xiaoli Li,et al.  Non-destructive discrimination of Chinese bayberry varieties using Vis/NIR spectroscopy , 2007 .

[25]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[26]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[27]  Jitendra Paliwal,et al.  Spectral Data Compression and Analyses Techniques to Discriminate Wheat Classes , 2006 .

[28]  J. Ibáñez,et al.  Genetic Study of Malvasia and Torrontes Groups through Molecular Markers , 2002, American Journal of Enology and Viticulture.

[29]  Y. Ying,et al.  On-site variety discrimination of tomato plant using visible-near infrared reflectance spectroscopy , 2009, Journal of Zhejiang University SCIENCE B.

[30]  Douglas Fernandes Barbin,et al.  Prediction of water and protein contents and quality classification of Spanish cooked ham using NIR hyperspectral imaging , 2013 .

[31]  Dolores Pérez-Marín,et al.  Non-destructive characterization and quality control of intact strawberries based on NIR spectral data , 2012 .

[32]  Dolores Pérez-Marín,et al.  Miniature handheld NIR sensor for the on-site non-destructive assessment of post-harvest quality and refrigerated storage behavior in plums , 2010 .

[33]  P. Galet A practical ampelography , 1979 .

[34]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[35]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[36]  A. Garrido-Varo,et al.  Evaluation of Pretreatment Strategies for Near-Infrared Spectroscopy Calibration Development of Unground and Ground Compound Feedingstuffs , 2006, Applied spectroscopy.

[37]  Ronald,et al.  Learning representations by backpropagating errors , 2004 .

[38]  S. Delwiche,et al.  The Effect of Spectral Pre-Treatments on the Partial Least Squares Modelling of Agricultural Products , 2004 .

[39]  Yong He,et al.  Identification of Different Varieties of Sesame Oil Using Near-Infrared Hyperspectral Imaging and Chemometrics Algorithms , 2014, PloS one.

[40]  Samuel Verdú,et al.  Detection of expired vacuum-packed smoked salmon based on PLS-DA method using hyperspectral images , 2013 .

[41]  Chu Zhang,et al.  Rice Seed Cultivar Identification Using Near-Infrared Hyperspectral Imaging and Multivariate Data Analysis , 2013, Sensors.