TV logo classification based on convolutional neural network

This paper presents region-based, convolutional neural network for accurate and efficient TV logo classification. Although many previous methods have been applied to TV logo classification, most of them only process the video images. For images captured by smartphone, these methods often can't achieve satisfactory performance. Three contributions are given to the problem of TV logo detection and classification. Firstly, maximally stable extremal region (MSER) is used as a method of generating candidate boxes per image. Geometry constraint is introduced to filter certain boxes which are obviously different from TV logo shape in geometry space. Secondly, we design an efficient Convolutional Neural Network (CNN) for classifying TV logo. Thirdly, we create a large TV logo dataset, all of which are collected from video streaming. Some augmentation methods have been taken to improve the diversity of TV logo images. Experimental results show that the proposed method outperformed previous methods.

[1]  Gueesang Lee,et al.  Robust Text Detection in Natural Scene Images , 2016, Australasian Conference on Artificial Intelligence.

[2]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[3]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[4]  Antonio Albiol,et al.  Detection of TV commercials , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7]  Andreas Dengel,et al.  ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[8]  Hou Sheng Automatic detection and recognition of TV logos , 2013 .

[9]  Shi Ping,et al.  A method of TV Logo Recognition based on SIFT , 2013, ICMT 2013.

[10]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[12]  Pin Xu,et al.  Multiple feature fusion via hierarchical matching for TV logo recognition , 2015, 2015 8th International Congress on Image and Signal Processing (CISP).

[13]  Huizhong Chen,et al.  Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[14]  Yuan Dong,et al.  Supervised TV logo detection based on SVMS , 2010, 2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content.