Assessment of grapevine variety discrimination using stem hyperspectral data and AdaBoost of random weight neural networks

Abstract Grapevine variety discrimination is important for growers to be able to certify plants, however, this is not simple to do using conventional ampelometric methods or DNA based methods. The present work proposes, to the best of the authors' knowledge, for the first time, to differentiate grapevine varieties using only spectroscopic stem information. A total of 1200 measurements were gathered non-destructively in the field. The spectroscopic data was processed with a combination of AdaBoost and random weight neural networks (RWNN) which is still rare in scientific literature. Ten grapevine varieties, 5 red and 5 white, were used: Cabernet Sauvignon (CSvar), one of the most widely planted varieties worldwide, and nine Portuguese autochthonous varieties, including Touriga Franca and Touriga Nacional (TNvar) that are very important in Port wine production. Portugal is known to be one of the countries with the largest number of autochthonous varieties in the world. In general, the overall correct classification percentages were significantly better than the 10% level of random classification. True positive rates for a classifier separating ten varieties varied between 41.7 and 70.8%, depending on the variety. The false positive rate (FPR) varied between 2.5 and 7.3%. TNvar and CSvar measurements were correctly classified in 70.8 and 70% of cases with FPR of 5.3 and 3.3%, respectively. The algorithms AdaBoost of extreme learning machines, random vector functional link with pseudoinverse and no regularization, support vector machines and random forest did not provide better results than the AdaBoost of RWNN.

[1]  Ponnuthurai N. Suganthan,et al.  Random vector functional link network for short-term electricity load demand forecasting , 2016, Inf. Sci..

[2]  Javier Tardáguila,et al.  Data mining and non-invasive proximal sensing for precision viticulture , 2015, ECSA 2015.

[3]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[4]  Ping Li,et al.  Dynamic Adaboost ensemble extreme learning machine , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).

[5]  María-Paz Diago,et al.  Data Mining and NIR Spectroscopy in Viticulture: Applications for Plant Phenotyping under Field Conditions , 2016, Sensors.

[6]  Xiaoli Li,et al.  Non-destructive discrimination of Chinese bayberry varieties using Vis/NIR spectroscopy , 2007 .

[7]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[8]  Juan Fernández-Novales,et al.  Support Vector Machine and Artificial Neural Network Models for the Classification of Grapevine Varieties Using a Portable NIR Spectrophotometer , 2015, PloS one.

[9]  Yan-Ping Zhou,et al.  Rapid and non-destructive discrimination of tea varieties by near infrared diffuse reflection spectroscopy coupled with classification and regression trees , 2012 .

[10]  Zhixin Yang,et al.  A Robust AdaBoost.RT Based Ensemble Extreme Learning Machine , 2015 .

[11]  P. N. Suganthan,et al.  Benchmarking Ensemble Classifiers with Novel Co-Trained Kernal Ridge Regression and Random Vector Functional Link Ensembles [Research Frontier] , 2017, IEEE Computational Intelligence Magazine.

[12]  Mark D. McDonnell,et al.  Fast, Simple and Accurate Handwritten Digit Classification by Training Shallow Neural Network Classifiers with the ‘Extreme Learning Machine’ Algorithm , 2015, PloS one.

[13]  Jianping Yin,et al.  Boosting weighted ELM for imbalanced learning , 2014, Neurocomputing.

[14]  David Skibinski,et al.  Genetic markers , 1993, Nature.

[15]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[16]  Adi Ben-Israel,et al.  Generalized inverses: theory and applications , 1974 .

[17]  M. Diago,et al.  Automatic discrimination of grapevine (Vitis vinifera L.) clones using leaf hyperspectral imaging and partial least squares , 2014, The Journal of Agricultural Science.

[18]  Pedro Melo-Pinto,et al.  Identification of grapevine varieties using leaf spectroscopy and partial least squares , 2013 .

[19]  Zhi-Zhong Mao,et al.  An Ensemble ELM Based on Modified AdaBoost.RT Algorithm for Predicting the Temperature of Molten Steel in Ladle Furnace , 2010, IEEE Transactions on Automation Science and Engineering.

[20]  P. Filzmoser,et al.  Repeated double cross validation , 2009 .

[21]  Richard Simon,et al.  Bias in error estimation when using cross-validation for model selection , 2006, BMC Bioinformatics.

[22]  Silvia Arazuri,et al.  Maturity, Variety and Origin Determination in White Grapes (Vitis Vinifera L.) Using near Infrared Reflectance Technology , 2005 .

[23]  Shi-Miao Tan,et al.  Boosting partial least‐squares discriminant analysis with application to near infrared spectroscopic tea variety discrimination , 2012 .

[24]  Raid Saabni Ada-boosting Extreme learning machines for handwritten digit and digit strings recognition , 2015, 2015 Fifth International Conference on Digital Information Processing and Communications (ICDIPC).

[25]  Di Wu,et al.  Soluble solids content and pH prediction and varieties discrimination of grapes based on visible-near infrared spectroscopy , 2010 .

[26]  B. Javornik,et al.  Characterization of Grapevines by the Use of Genetic Markers , 2013 .

[27]  Haiqing Yang,et al.  Nondestructive Discrimination of Grape Seed Varieties Using UV-VIS-NIR Spectroscopy and Chemometrics , 2012 .

[28]  Dejan J. Sobajic,et al.  Learning and generalization characteristics of the random vector Functional-link net , 1994, Neurocomputing.

[29]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[30]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[31]  Yiqiang Chen,et al.  Weighted extreme learning machine for imbalance learning , 2013, Neurocomputing.

[32]  F. M. Lacar,et al.  Use of hyperspectral reflectance for discrimination between grape varieties , 2001, IGARSS 2001. Scanning the Present and Resolving the Future. Proceedings. IEEE 2001 International Geoscience and Remote Sensing Symposium (Cat. No.01CH37217).

[33]  Use of Hyperspectral Reflectance for Discrimination between Grape Varieties , 2001 .

[34]  Sergios Theodoridis,et al.  Machine Learning: A Bayesian and Optimization Perspective , 2015 .

[35]  Hubert A.B. Te Braake,et al.  Random activation weight neural net (RAWN) for fast non-iterative training. , 1995 .

[36]  P. N. Suganthan,et al.  A comprehensive evaluation of random vector functional link networks , 2016, Inf. Sci..

[37]  Robert P. W. Duin,et al.  Feedforward neural networks with random weights , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[38]  Y. Ying,et al.  On-site variety discrimination of tomato plant using visible-near infrared reflectance spectroscopy , 2009, Journal of Zhejiang University SCIENCE B.