Comparing bags of features, conventional convolutional neural network and AlexNet for fruit recognition

This paper presents a comparative study between Bag of Features (BoF), Conventional Convolutional Neural Network (CNN) and Alexnet for fruit recognition.  Automatic fruit recognition can minimize human intervention in their fruit harvesting operations, operation time and harvesting cost.  On the other hand, this task is very challenging because of the similarities in shapes, colours and textures among various types of fruits. Thus, a robust technique that can produce good result is necessary. Due to the outstanding performance of deep learning like CNN and its pre-trained models like AlexNet in image recognition, this paper investigates the accuracy of conventional CNN, and Alexnet in recognizing thirty different types of fruits from a publicly available dataset.  Besides that, the recognition performance of BoF is also examined since it is one of the machine learning techniques that achieves good result in object recognition.   The experimental results indicate that all of these three techniques produce excellent recognition accuracy. Furthermore, conventional CNN achieves the fastest recognition result compared to BoF, and Alexnet.

[1]  Przemyslaw Prusinkiewicz,et al.  The use of plant models in deep learning: an application to leaf counting in rosette plants , 2018, Plant Methods.

[2]  Sae Hwang Bag-Of-Visual-Words Approach based on SURF Features to Polyp Detection in Wireless Capsule Endoscopy Videos , 2011 .

[3]  Nurbaity Sabri,et al.  Performance evaluation for vision-based vehicle classification using Convolutional Neural Network , 2018 .

[4]  Bruce A. Draper,et al.  Introduction to the Bag of Features Paradigm for Image Classification and Retrieval , 2011, ArXiv.

[5]  Nur Nabilah Abu Mangshor,et al.  Celebrity Face Recognition using Deep Learning , 2018, Indonesian Journal of Electrical Engineering and Computer Science.

[6]  Nurbaity Sabri,et al.  Evaluation of CNN, Alexnet and GoogleNet for Fruit Recognition , 2018, Indonesian Journal of Electrical Engineering and Computer Science.

[7]  K SoumyaGeorge,et al.  Text Classification by Augmenting Bag of Words (BOW) Representation with Co-occurrence Feature , 2014 .

[8]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[9]  Olarik Surinta,et al.  Comparative study between deep learning and bag of visual words for wild-animal recognition , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[10]  Nurbaity Sabri,et al.  Comparing Convolution Neural Network models for leaf recognition , 2018 .

[11]  Dino Isa,et al.  Palm Oil Fresh Fruit Bunch Ripeness Grading Recognition Using Convolutional Neural Network , 2018 .

[12]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[13]  Nur Nabilah Abu Mangshor,et al.  Leaf Recognition using Texture Features for Herbal Plant Identification , 2018 .

[14]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[15]  Mihai Oltean,et al.  Fruit recognition from images using deep learning , 2017, Acta Universitatis Sapientiae, Informatica.

[16]  Bernhard Egger,et al.  Empirically Analyzing the Effect of Dataset Biases on Deep Face Recognition Systems , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[17]  Chougrad Hiba,et al.  Bag of Features Model Using the New Approaches: A Comprehensive Study , 2016 .

[18]  Ke Chen,et al.  Identity-aware convolutional neural networks for facial expression recognition , 2010 .

[19]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.