MFSR: Maximum feature score region-based captions locating in news video images

For news video images, caption recognizing is a useful and important step for content understanding. Caption locating is usually the first step of caption recognizing and this paper proposes a simple but effective caption locating algorithm called maximum feature score region (MFSR) based method, which mainly consists of two stages: In the first stage, up/down boundaries are attained by turning to edge map projection. Then, maximum feature score region is defined and left/right boundaries are achieved by utilizing MFSR. Experiments show that the proposed MFSR based method has superior and robust performance on news video images of different types.

[1]  Qifeng Liu,et al.  A new approach for text segmentation using a stroke filter , 2008, Signal Process..

[2]  Avideh Zakhor,et al.  Applications of Video-Content Analysis and Retrieval , 2002, IEEE Multim..

[3]  Ioannis Pratikakis,et al.  Multiresolution text detection in video frames , 2007, VISAPP.

[4]  Palaiahnakote Shivakumara,et al.  Accurate video text detection through classification of low and high contrast images , 2010, Pattern Recognit..

[5]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Xinbo Gao,et al.  Detection and recognition of text superimposed in images base on layered method , 2014, Neurocomputing.

[7]  Jean-Philippe Thiran,et al.  A localization/verification scheme for finding text in images and video frames based on contrast independent features and machine learning methods , 2004, Signal Process. Image Commun..

[8]  Akhil Sahai,et al.  Web E-Speak: Facilitating Web-Based E-Services , 2002, IEEE Multim..

[9]  Ioannis Pratikakis,et al.  A two-stage scheme for text detection in video images , 2010, Image Vis. Comput..

[10]  Palaiahnakote Shivakumara,et al.  Video text detection based on filters and edge features , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[11]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[12]  He Huang,et al.  A method of caption location and segmentation in news video , 2014, 2014 7th International Congress on Image and Signal Processing.

[13]  Jun Guo,et al.  Text extraction from natural scene image: A survey , 2013, Neurocomputing.

[14]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[15]  Chang Hong Lin,et al.  A robust video text detection approach using SVM , 2012, Expert Syst. Appl..

[16]  Xinbo Gao,et al.  Chinese text location under complex background using Gabor filter and SVM , 2011, Neurocomputing.

[17]  Chunheng Wang,et al.  Scene text detection using graph model built upon maximally stable extremal regions , 2013, Pattern Recognit. Lett..

[18]  Wen Gao,et al.  Fast and robust text detection in images and video frames , 2005, Image Vis. Comput..

[19]  Chitra Dorai,et al.  Automatic text extraction from video for content-based annotation and retrieval , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[20]  Datong Chen,et al.  Text enhancement with asymmetric filter for video OCR , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[21]  Jin Hyung Kim,et al.  Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Jean-Marc Odobez,et al.  Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[23]  Qingshan Liu,et al.  Robust Text Detection in Natural Scenes Using Text Geometry and Visual Appearance , 2014, Int. J. Autom. Comput..