Analysis of distance metric variations in KNN for agarwood oil compounds differentiation

This paper presents the analysis of distance metric variations in KNN for agarwood oil compounds differentiation. The work involved of the development of k-Nearest Neighbor (KNN) by varying the distance metrics. The input is abundances (%) of agarwood oil compounds and the output is agarwood oil quality either high or low. The data is divided into two parts; training and testing dataset with ratio of 80% and 20% respectively. The training dataset is used to develop the KNN model from K equal to 1 until K equal to 5, and the testing dataset is used to test the developed model. During the training, distance metric parameters were varied using Euclidean, City-block, Cosine, and Correlation. The performance of each parameter was recorded and observed. All the analytical works are performed automatically via MATLAB software version R2014b. The results showed that, among four distance metric variations, Euclidean and City-block yield 100% accuracy for both training and testing dataset. After that, 89.5% of accuracy was obtained by Cosine and Correlation. In general, the accuracy yielded by all distance metrics is above 80.00% and indicating a good KNN model. This finding proved the capability of KNN in differentiating the agarwood oil compounds to high or low qualities. The results in this study are important and contributed to further research work in agarwood oil grading system.

[1]  Karl J. Siebert,et al.  Application of Pattern-Recognition Techniques to the Essential Oil of Hops , 1984 .

[2]  Parthasarathy Guturu,et al.  A class of new KNN methods for low sample problems , 1990, IEEE Trans. Syst. Man Cybern..

[3]  Masakazu Ishihara,et al.  THREE SESQUITERPENES FROM AGARWOOD , 1991 .

[4]  Masakazu Ishihara,et al.  Guaiane sesquiterpenes from agarwood , 1991 .

[5]  Masakazu Ishihara,et al.  Components of the agarwood smoke on heating , 1993 .

[6]  Masakazu Ishihara,et al.  Fragrant sesquiterpenes from agarwood , 1993 .

[7]  Masakazu Ishihara,et al.  Components of the Volatile Concentrate of Agarwood , 1993 .

[8]  Louis Wehenkel,et al.  Coupling of K-NN with decision trees for power system transient stability assessment , 1995, Proceedings of International Conference on Control Applications.

[9]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[10]  A. Ranalli,et al.  Improving virgin olive oil quality by means of innovative extracting biotechnologies. , 2003, Journal of agricultural and food chemistry.

[11]  Manfred Meier,et al.  Isolation of Anisyl Acetone from Agarwood Oil , 2003 .

[12]  Songbo Tan,et al.  An effective refinement strategy for KNN text classifier , 2006, Expert Syst. Appl..

[13]  Mohammad Saleh Nambakhsh,et al.  Morphological Heart Arrhythmia Detection Using Hermitian Basis Functions and kNN Classifier , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[14]  A. Pravina,et al.  Application of solid phase microextraction in gaharu essential oil analysis , 2008 .

[15]  M. Bhuiyan,et al.  Analysis of essential oil of eaglewood tree (Aquilaria agallocha Roxb.) by gas chromatography mass spectrometry , 2008 .

[16]  Selangor Darul Ehsan,et al.  COMPARISON OF CHEMICAL PROFILES OF SELECTED GAHARU OILS FROM PENINSULAR MALAYSIA , 2008 .

[17]  Abu Bakar Sidik Nurdiyana Comparison of gaharu (Aquilaria Malaccensis) essential oil composition between each country , 2008 .

[18]  Touradj Ebrahimi,et al.  Classification of EEG signals using Dempster Shafer theory and a k-nearest neighbor classifier , 2009, 2009 4th International IEEE/EMBS Conference on Neural Engineering.

[19]  Penpun Wetwitayaklung,et al.  Chemical constituents and antimicrobial activity of essential oil and extracts of heartwood of Aquilaria crassna obtained from water distillation and supercritical fluid carbon dioxide extraction. , 2009 .

[20]  Mohd Nasir Taib,et al.  Classification of Agarwood region using ANN , 2010, 2010 IEEE Control and System Graduate Research Colloquium (ICSGRC 2010).

[21]  Seung-Kook Park,et al.  Identification of Odor-active Components of Agarwood Essential Oils from Thailand by Solid Phase Microextraction-GC/MS and GC-O , 2011 .

[22]  Arun Khosla,et al.  QRS detection using K-Nearest Neighbor algorithm (KNN) and evaluation on standard ECG databases , 2012, Journal of advanced research.

[23]  Mandeep Singh,et al.  A Review of Data Classification Using K-Nearest Neighbour Algorithm , 2013 .