Fused Deep Convolutional Neural Networks Based on Voting Approach for Efficient Object Classification

Object classification has been one of the main tasks in computer vision. With the fast development of deep learning, its performance in image classification and object recognition has presented dramatic improvements. In this paper, we propose a new deep convolutional neural network (CNN) architecture for robust object classification. The proposed model is fused with three traditional CNN approaches, Densenet201, Resnet50, and our proposed residual CNN. The fused network architecture allows parallel processing of the multiple networks for keeping the system sped up. A single shot deep convolution network is trained as an object detector to generate all possible candidates of different object classes. The output of each neural network is representing a single vote that is used in the classification process. 3-to-1 voting criteria are applied in the final classification decision between the candidate object classes. Several experiments were conducted to evaluate the performance of the proposed network. The experimental results show that the proposed approach has better performance than the networks used in the fusion process when they act individually. It also has lower miss rates when compared to several states of art methodologies.

[1]  Almabrok E. Essa,et al.  A Leaf Recognition Approach to Plant Classification Using Machine Learning , 2018, NAECON 2018 - IEEE National Aerospace and Electronics Conference.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Nick G. Kingsbury,et al.  Dual-Tree wavelet scattering network with parametric log transformation for object classification , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[9]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[10]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[11]  Russell C. Hardie,et al.  Recursive non-local means filter for video denoising , 2017, 2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS).

[12]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[13]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[14]  Chris Yakopcic,et al.  Inception recurrent convolutional neural network for object recognition , 2017, Mach. Vis. Appl..

[15]  Vijayan K. Asari,et al.  Histogram of Oriented Phase and Gradient (HOPG) Descriptor for Improved Pedestrian Detection , 2016 .

[16]  Thi-Thuy Nguyen,et al.  Aerial Image Semantic Segmentation Using Neural Search Network Architecture , 2018, MIWAI.

[17]  Vijayan K. Asari,et al.  Histogram of oriented phase (HOP): a new descriptor based on phase congruency , 2016, Commercial + Scientific Sensing and Imaging.

[18]  Е. А. Краснобаев Распознавание дорожных знаков на изображениях методом Speeded Up Robust Features (SURF) , 2013 .

[19]  Russell C. Hardie,et al.  Skin Lesion Segmentation and Classification for ISIC 2018 by Combining Deep CNN and Handcrafted Features , 2019, ArXiv.

[20]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[22]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[23]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).