Size-Independent Caption Extraction for Korean Captions with Edge Connected Components

Captions include information which relates to the images. In order to obtain the information in the captions, text extraction methods from images have been developed. However, most existing methods can be applied to captions with a fixed height or stroke width using fixed pixel-size or block-size operators which are derived from morphological supposition. We propose an edge connected components based method that can extract Korean captions that are composed of various sizes and fonts. We analyze the properties of edge connected components embedding captions and build a decision tree which discriminates edge connected components which include captions from ones which do not. The images for the experiment are collected from broadcast programs such as documentaries and news programs which include captions with various heights and fonts. We evaluate our proposed method by comparing the performance of the latent caption area extraction. The experiment shows that the proposed method can efficiently extract various sizes of Korean captions.

[1]  Youngbin Im,et al.  Somewhat fuzzy irresolute continuous mappings , 2012, Int. J. Fuzzy Log. Intell. Syst..

[2]  Hyeran Byun,et al.  Text Extraction in Digital News Video Using Morphology , 2002, Document Analysis Systems.

[3]  Michael R. Lyu,et al.  A robust statistic method for classifying color polarity of video text , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[4]  Kyu-Heon Kim,et al.  A Stroke-Based Text Extraction Algorithm for Digital Videos , 2007 .

[5]  Lina J. Karam,et al.  Morphological text extraction from images , 2000, IEEE Trans. Image Process..

[6]  Jee-Hyong Lee,et al.  Connected Component-Based and Size-Independent Caption Extraction with Neural Networks , 2007 .

[7]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[8]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Edward K. Wong,et al.  A robust algorithm for text extraction in color video , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[10]  Eun Yi Kim,et al.  Automatic Text Extraction for Content-Based Image Indexing , 2004, PAKDD.

[11]  Jiaying He,et al.  Hybrid Chinese/English text identification in Web images , 2004, Third International Conference on Image and Graphics (ICIG'04).