Trainable Co-Occurrence Activation Unit for Improving Convnet

A deep neural network is one of the promising approach to produce state-of-the-art performance on various fields such as pattern recognition and signal processing. While the network architecture is intensively studied, as to the network components, non-linear activation functions are the main subject of research in the literature. Most of the activation functions, such as a rectified linear unit (ReLU), operate on each of feature channels in an element-wise manner and thus can be regarded as extracting occurrence characteristics from the input feature map. In this paper, we propose a co-occurrence activation unit to work across feature channels by extending the element-wise activation function. In contrast to the original co-occurrence formulation applied to hand-crafted feature extraction methods, the proposed co-occurrence unit is trainable by a gradient-based optimization through back-propagation learning and exploits the co-occurrence relationships among the feature channels. The experimental results on image classification datasets show that the proposed co-occurrence activation unit embedded into various types of ConvNets favorably improve classification performance.

[1]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[3]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[4]  Yoshua Bengio,et al.  Maxout Networks , 2013, ICML.

[5]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[6]  Andrea Vedaldi,et al.  MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[7]  Tianqi Chen,et al.  Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.

[8]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[9]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[10]  Takeshi Mita,et al.  Discriminative Feature Co-Occurrence Selection for Object Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Takumi Kobayashi,et al.  Color image feature extraction using color index local auto-correlations , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[15]  Amnon Shashua,et al.  Deep SimNets , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[17]  Takio Kurita,et al.  A New Scheme for Practical Flexible and Intelligent Vision Systems , 1988, MVA.

[18]  Atsuto Maki,et al.  From generic to specific deep representations for visual recognition , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[19]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[20]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[22]  Takumi Kobayashi,et al.  Image Feature Extraction Using Gradient Local Auto-Correlations , 2008, ECCV.

[23]  Ze-Nian Li,et al.  Object Detection Using Generalization and Efficiency Balanced Co-Occurrence Features , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).