Restricted stochastic pooling for convolutional neural network

Deep learning has been demonstrated to be a universal and successful tool for classification, detection and segmentation. Convolutional neural network (CNN), as a representative model, achieves excellent performance in many image process areas due to its special structural design---shared weights, local receptive field and pooling layer. Large efforts are devoted to studying and improving model performance based on these three parts. In this paper, we proposed a novel pooling layer named restricted stochastic pooling layer. Restricted stochastic pooling randomly select activation from the top n maximum of each pooling region instead of the largest one (max pooling), or stochastic one (stochastic pooling). Our proposed method can not only obtain representative activation but also add randomness to the model. Through large amounts of experiments, we achieve good performance on SVHN and CIFAR-10 which demonstrates the effectiveness of restricted stochastic pooling.

[1]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[2]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Rob Fergus,et al.  Stochastic Pooling for Regularization of Deep Convolutional Neural Networks , 2013, ICLR.

[5]  Konstantinos Kamnitsas,et al.  Efficient multi‐scale 3D CNN with fully connected CRF for accurate brain lesion segmentation , 2016, Medical Image Anal..

[6]  Yiannis Aloimonos,et al.  LightNet: A Versatile, Standalone Matlab-based Environment for Deep Learning , 2016, ACM Multimedia.

[7]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[8]  Nikos Komodakis,et al.  Object Detection via a Multi-region and Semantic Segmentation-Aware CNN Model , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  S. Thrun,et al.  Corrigendum: Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[10]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[11]  Jean Ponce,et al.  Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.