Shallow convolutional neural network for image classification

Deep convolutional neural networks show great advantages in computer vision tasks, such as image classification and object detection. However, the networks have complex network structure which include a large number of layers such as convolutional layers and pooling layers. They greatly consume valuable computing and memory resources, and also hugely waste training time. Therefore, we propose a novel shallow convolutional neural network (SCNNB) to overcome the above limitations for image classification, which uses batch normalization techniques to accelerate training convergence and improve the accuracy. The SCNNB network has only 4 layers with small size of convolution kernels, which requires low time complexity and space complexity. In the experiments, we compare the SCNNB model with two variant models and the classical SCNN model on the two benchmark image datasets. Experimental results show that compared to SCNN model, the SCNNB model can quickly learn the features of the data and achieve the highest classification accuracy of 93.69% with 3.8 M time complexity on fashion-MNIST.

[1]  Yong Man Ro,et al.  Convolution with Logarithmic Filter Groups for Efficient Shallow CNN , 2017, MMM.

[2]  Roman Neruda,et al.  Asynchronous Evolution of Convolutional Networks , 2018, ITAT.

[3]  Gang Hua,et al.  Ordinal Regression with Multiple Output CNN for Age Estimation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Lu Lu,et al.  Shallow Convolutional Neural Networks for Acoustic Scene Classification , 2018, Wuhan University Journal of Natural Sciences.

[6]  Rahul Chauhan,et al.  Recognition of Handwritten Digits Using DNN, CNN, and RNN , 2018 .

[7]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[8]  Jian Sun,et al.  Convolutional neural networks at constrained time cost , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Dae-Ki Kang,et al.  Biased Dropout and Crossmap Dropout: Learning towards effective Dropout regularization in convolutional neural network , 2018, Neural Networks.

[10]  Jiebo Luo,et al.  Boundary-based Image Forgery Detection by Fast Shallow CNN , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[11]  Maheshkumar H. Kolekar,et al.  Classification of fashion article images using convolutional neural networks , 2017, 2017 Fourth International Conference on Image Information Processing (ICIIP).

[12]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[13]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[14]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[15]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[16]  Gang Hua,et al.  A convolutional neural network cascade for face detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Oleksii Gorokhovatskyi,et al.  Shallow Convolutional Neural Networks for Pattern Recognition Problems , 2018, 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP).

[19]  Kyung-shik Shin,et al.  Hierarchical convolutional neural networks for fashion image classification , 2019, Expert Syst. Appl..

[20]  Takio Kurita,et al.  Convolutional Neural Network with Discriminant Criterion for Input of Each Neuron in Output Layer , 2018, ICONIP.

[21]  Jianping Gou,et al.  Collaboratively Weighting Deep and Classic Representation via $l_2$ Regularization for Image Classification , 2018, ACML.