Improving the Performance of CBIR Using XGBoost Classifier with Deep CNN-Based Feature Extraction

The main challenge faced by CBIR systems is the semantic gap, namely the existence of semantic differences between low-level pixel images captured by machines and high-level semantics perceived by humans. This is caused by the CBIR system which is very dependent on the feature extraction used. The success of deep learning techniques, especially Convolutional Neural Networks (CNN) in solving the problem of computer vision applications has inspired researchers to overcome the problem of the semantic gap. Until now, what is known by researchers, Deep CNN research for CBIR uses softmax and SVM classifiers. However, the XGBoost classifier achieves extraordinary performance even though compared to SVM. By applying the advantages of Deep CNN techniques and XGBoost classifier, the Deep CNN model is proposed as a feature extractor and XGBoost to form a classification model replacing the softmax and SVM classifier. From the proposed Deep CNN model, two Fully Connected (FC) layers (FC1 and FC2) are taken as feature representations. In this study, Features Vector (FV) taken from the Deep CNN model that have been produced are used as extractor features (FV.FC1 and FV.FC2). The performance of the Deep CNN for CBIR tasks generated using softmax, SVM, and XGBoost classifier observed. The performance evaluated is accuracy, precision, recall, and f1-score. Based on experimental results on the Wang, GHIM-10k, and Fruit-360 dataset, XGBoost classifier can increase accuracy, precision, recall, and f1-score in all datasets. The best performance of the CBIR system is using XGBoost classifier with the best feature extractor taken from FV.FC2. Whereas the FV.FC1 feature extractor does not produce significant performance when compared to the resources needed.

[1]  Roberto J. Bayardo,et al.  PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce , 2009, Proc. VLDB Endow..

[2]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  David Stutz,et al.  Neural Codes for Image Retrieval , 2015 .

[4]  Nikhil Ketkar,et al.  Deep Learning with Python , 2017 .

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Rehan Ashraf,et al.  Content Based Image Retrieval Using Embedded Neural Networks with Bandletized Regions , 2015, Entropy.

[7]  Jinyu Li,et al.  Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks. , 2013, ICLR 2013.

[8]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[9]  Mrs. P. Nalini Review on Content Based Image Retrieval : From Its Origin to the New Age , 2016 .

[10]  Giorgio Giacinto,et al.  Information fusion in content based image retrieval: A comprehensive overview , 2017, Inf. Fusion.

[11]  Geoffrey E. Hinton,et al.  Simplifying Neural Networks by Soft Weight-Sharing , 1992, Neural Computation.

[12]  Xiangliang Zhang,et al.  An up-to-date comparison of state-of-the-art classification algorithms , 2017, Expert Syst. Appl..

[13]  Pushparaja Murugan,et al.  Implementation of Deep Convolutional Neural Network in Multi-class Categorical Image Classification , 2018, ArXiv.

[14]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[15]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[16]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[17]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[18]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[19]  Qing Wang,et al.  Deep Convolutional Neural Networks for Classifying Body Constitution Based on Face Image , 2017, Comput. Math. Methods Medicine.

[20]  Jasman Pardede,et al.  Re-weighting Relevance Feedback in HSV Quantization for CBIR , 2018, 2018 19th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD).

[21]  Markus Maurer,et al.  Assessment of Deep Convolutional Neural Networks for Road Surface Classification , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[22]  Yann LeCun,et al.  Large-scale Learning with SVM and Convolutional for Generic Object Categorization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[23]  Islam,et al.  A New Approach of Content Based Image Retrieval Using Color and Texture Features , 2017 .

[24]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[25]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[26]  Joseph K. Bradley,et al.  Yggdrasil: An Optimized System for Training Deep Decision Trees at Scale , 2016, NIPS.

[27]  Ola Spjuth,et al.  Transfer Learning with Deep Convolutional Neural Networks for Classifying Cellular Morphological Changes , 2018, bioRxiv.

[28]  Abdolreza Rashno,et al.  An efficient content-based image retrieval with ant colony optimization feature selection schema based on wavelet and color features , 2015, 2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP).

[29]  Mihai Oltean,et al.  Fruit recognition from images using deep learning , 2017, Acta Universitatis Sapientiae, Informatica.

[30]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Jasman Pardede,et al.  Comparison of similarity measures in HSV quantization for CBIR , 2017, 2017 International Conference on Data and Software Engineering (ICoDSE).

[32]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[33]  Jasman Pardede,et al.  COMPARISON OF VSM, GVSM, AND LSI IN INFORMATION RETRIEVAL FOR INDONESIAN TEXT , 2016 .

[34]  Yang Wang,et al.  Malware Classification with Deep Convolutional Neural Networks , 2018, 2018 9th IFIP International Conference on New Technologies, Mobility and Security (NTMS).

[35]  Xingyuan Wang,et al.  A novel method for image retrieval based on structure elements' descriptor , 2013, J. Vis. Commun. Image Represent..

[36]  Jasman Pardede,et al.  Boosting-Based Relevance Feedback for CBIR , 2018, 2018 5th International Conference on Data and Software Engineering (ICoDSE).

[37]  Fabian Schilling,et al.  The Effect of Batch Normalization on Deep Convolutional Neural Networks , 2016 .

[38]  Jasman Pardede,et al.  SVM Relevance Feedback in HSV Quantization for CBIR , 2018, J. Comput..

[39]  Li Deng,et al.  A tutorial survey of architectures, algorithms, and applications for deep learning , 2014, APSIPA Transactions on Signal and Information Processing.

[40]  El Asnaoui Khalid,et al.  Content-Based Image Retrieval Using Convolutional Neural Networks , 2017 .

[41]  Inderjit S. Dhillon,et al.  Gradient Boosted Decision Trees for High Dimensional Sparse Output , 2017, ICML.

[42]  Rahib H Abiyev,et al.  Deep Convolutional Neural Networks for Chest Diseases Detection , 2018, Journal of healthcare engineering.

[43]  Yichuan Tang,et al.  Deep Learning using Linear Support Vector Machines , 2013, 1306.0239.

[44]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[45]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[46]  Hong Zhao,et al.  A deep convolutional neural network for classification of red blood cells in sickle cell anemia , 2017, PLoS Comput. Biol..

[47]  Honglak Lee,et al.  Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[48]  Jasman Pardede,et al.  Reduce Semantic Gap in Content-Based Image Retrieval , 2017 .

[49]  Tie-Yan Liu,et al.  A Communication-Efficient Parallel Algorithm for Decision Tree , 2016, NIPS.

[50]  P.manikandaprabhu S.nithya T.karthikeyan A Survey on Text and Content Based Image Retrieval System for Image Mining , 2014 .