A Novel Fused Random Pooling Method for Convolutional Neural Network to Improve Image Classification Accuracy

A convolutional neural network (CNN) is an artificial neural network type used in image recognition and processing. The layers of CNN comprise of an input layer, an output layer, and a hidden layer that contains multiple convolutional layers, pooling layers, and fully connected layers. However, CNN faces issues such as overfitting, which occurs when a model learns the detail and noise in training data to the extent that it negatively affects the model's performance on new data. In this study, a new technique of pooling called fused random pooling replaces the deterministic pooling of CNNs based on a random approach in choosing activation to create better pooled feature maps, thereby decreasing overfitting by performing regularization technique and hyper-parameter tuning by defining the network's best parameters such as the number of layers, filter size, and optimization algorithms. Decrease in the training error and increase in accuracy have been realized in the CIFAR-10, CIFAR-100, and Street View House Numbers (SVHN) datasets as compared to studies in various CNN pooling methods. In the CIFAR-10, the test error and accuracy attained 4.11 percent and 92.37 percent, respectively. In CIFAR-100, a 17.76 percent training error and 69.19 percent accuracy was realized. In the SVHN dataset, fused random pooling achieved a training error of 3.10 percent and accuracy of 96.90 percent. Therefore, fused random pooling proved to be efficient in terms of the decrease in training error and increase in accuracy.

[1]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Stefano Tubaro,et al.  Deep Convolutional Neural Networks for pedestrian detection , 2015, Signal Process. Image Commun..

[3]  Sheroz Khan,et al.  Model Predictive Control for Upper Limb Rehabilitation Robotic System Under Noisy Condition , 2018, 2018 IEEE 5th International Conference on Smart Instrumentation, Measurement and Application (ICSIMA).

[4]  Bo He,et al.  Fuzzy controller used smoothing function for depth control of autonomous underwater vehicle , 2016, OCEANS 2016 - Shanghai.

[5]  C.-C. Jay Kuo Understanding convolutional neural networks with a mathematical model , 2016, J. Vis. Commun. Image Represent..

[6]  Lina M. Sulieman,et al.  Classifying patient portal messages using Convolutional Neural Networks , 2017, J. Biomed. Informatics.

[7]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Murvet Kirci,et al.  Disease detection on the leaves of the tomato plants by using deep learning , 2017, 2017 6th International Conference on Agro-Geoinformatics.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[11]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[12]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[14]  George E. Sakr,et al.  Comparing deep learning and support vector machines for autonomous waste sorting , 2016, 2016 IEEE International Multidisciplinary Conference on Engineering Technology (IMCET).

[15]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[16]  Mostafa Mehdipour-Ghazi,et al.  Plant identification using deep neural networks via optimization of transfer learning parameters , 2017, Neurocomputing.

[17]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[18]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[19]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Pietro Liò,et al.  X-CNN: Cross-modal convolutional neural networks for sparse datasets , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[22]  Mario Fritz,et al.  Learnable Pooling Regions for Image Classification , 2013, ICLR.

[23]  Jiri Matas,et al.  Systematic evaluation of convolution neural network advances on the Imagenet , 2017, Comput. Vis. Image Underst..

[24]  Dit-Yan Yeung,et al.  Learning a Deep Compact Image Representation for Visual Tracking , 2013, NIPS.

[25]  Matti Pietikäinen,et al.  Deep Learning for Generic Object Detection: A Survey , 2018, International Journal of Computer Vision.

[26]  Olarik Surinta,et al.  Comparative study between deep learning and bag of visual words for wild-animal recognition , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[27]  Thomas Kurbiel,et al.  Training of Deep Neural Networks based on Distance Measures using RMSProp , 2017, ArXiv.

[28]  Yann LeCun,et al.  Convolutional neural networks applied to house numbers digit classification , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[29]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[30]  Adam E. Flanders,et al.  Application of Deep Learning in Neuroradiology: Automated Detection of Basal Ganglia Hemorrhage using 2D-Convolutional Neural Networks , 2017, ArXiv.

[31]  Rob Fergus,et al.  Stochastic Pooling for Regularization of Deep Convolutional Neural Networks , 2013, ICLR.

[32]  Song Han,et al.  Real-time pedestrian detection and tracking on customized hardware , 2016, 2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia).

[33]  Christine A. Tataru,et al.  Deep Learning for abnormality detection in Chest X-Ray images , 2017 .

[34]  Pierre Baldi,et al.  Learning Activation Functions to Improve Deep Neural Networks , 2014, ICLR.

[35]  Zhihua Wei,et al.  Mixed Pooling for Convolutional Neural Networks , 2014, RSKT.

[36]  João Francisco Valiati,et al.  Pre-trained convolutional neural networks as feature extractors for tuberculosis detection , 2017, Comput. Biol. Medicine.

[37]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[38]  Luca Maria Gambardella,et al.  Convolutional Neural Network Committees for Handwritten Character Classification , 2011, 2011 International Conference on Document Analysis and Recognition.