New Fuzzy-Mass Based Features for Video Image Type Categorization

Due to the large variety of video type collections, it becomes difficult to achieve good text detection and recognition accuracy. We propose a new fuzzy-mass based method for classifying (categorizing) text frames from different types of video. For each frame of a video type, we formulate Fuzzy logic to identify straight and curved edge components from edge images. We then estimate mass locally and globally by drawing consecutive ellipses over edge images with respect to straight and curved edge components. Further, we extract features based on spatial proximity between centroid of classified straight/curved edge components and that of the whole image. This results local features. Next, the features are extracted for the whole image without ellipse drawing, which results in global features. The combination of both local and global features is then fed to an SVM classifier for video type classification. Experimental results on the proposed and existing classification methods show that the proposed classification outperforms the stat of art methods. Furthermore, experiments on before and after classification with several text detection and binarization methods show that the proposed classification is significant in improving text detection and recognition performance.

[1]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Kai Ming Ting,et al.  Mass estimation , 2012, Machine Learning.

[3]  Palaiahnakote Shivakumara,et al.  Graphics and Scene Text Classification in Video , 2014, 2014 22nd International Conference on Pattern Recognition.

[5]  Tatiana Novikova,et al.  Image Binarization for End-to-End Text Understanding in Natural Images , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[6]  Guoyou Wang,et al.  Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification , 2015, Pattern Recognit. Lett..

[7]  Tao Mei,et al.  Predicting Failing Queries in Video Search , 2014, IEEE Transactions on Multimedia.

[8]  Chew Lim Tan,et al.  Bayesian classifier for multi-oriented video text recognition system , 2015, Expert Syst. Appl..

[9]  Aditya Tiwari,et al.  Ticker text extraction from Bangla news videos , 2010, 2010 Annual IEEE India Conference (INDICON).

[10]  Palaiahnakote Shivakumara,et al.  A New Method for Arbitrarily-Oriented Text Detection in Video , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[11]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Palaiahnakote Shivakumara,et al.  Recognizing Text with Perspective Distortion in Natural Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[13]  Palaiahnakote Shivakumara,et al.  Video scene text frames categorization for text detection and recognition , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[14]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Zohreh Azimifar,et al.  Document image binarization using a discriminative structural classifier , 2015, Pattern Recognit. Lett..

[16]  Palaiahnakote Shivakumara,et al.  New Fourier-Statistical Features in RGB Space for Video Text Detection , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Nicholas R. Howe,et al.  Document binarization with automatic parameter tuning , 2013, International Journal on Document Analysis and Recognition (IJDAR).

[18]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Shijian Lu,et al.  Multioriented Video Scene Text Detection Through Bayesian Classification and Boundary Growing , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Vladik Kreinovich,et al.  WHAT NON-LINEARITY TO CHOOSE? MATHEMATICAL FOUNDATIONS OF FUZZY CONTROL , 2008 .

[21]  Palaiahnakote Shivakumara,et al.  A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video , 2015, IEEE Transactions on Multimedia.

[22]  Zhixin Shi,et al.  A Two Level Algorithm for Text Detection in Natural Scene Images , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.