Apply SOM to Video Artificial Text Area Detection

Video artificial text detection is a challenging problem of pattern recognition. Current methods which are usually based on edge, texture, connected domain, feature or learning are always limited by size, location, language of artificial text in video. To solve the problems mentioned above, this paper applied SOM (Self-Organizing Map) based on supervised learning to video artificial text detection. First, text features were extracted. And considering the video artificial text's limitations mentioned, artificial text’s location and gradient of each pixel were used as the features which were used to classify. Then three layers supervised SOM was proposed to classify the text and non-text areas in video image. At last, the morphologic operating was used to get a much more accurate result of text area. Experiments showed that this method could locate and detect artificial text area in video efficiently.

[1]  Zhang Yi,et al.  Automatic Text Detection In Video Frames Based on Bootstrap Artificial Neural Network And CED , 2003 .

[2]  Martin T. Hagan,et al.  Neural network design , 1995 .

[3]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[4]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[5]  Korris Fu-Lai Chung,et al.  Hybrid Chinese/English text detection in images and video frames , 2002, Object recognition supported by user interaction for service robots.

[6]  Guangshe Chen,et al.  An Edge Detection Method Based on Optimized BP Neural Network , 2008, 2008 International Symposium on Information Science and Engineering.