A Comprehensive Method for Arabic Video Text Detection, Localization, Extraction and Recognition

With the rapid growth of the number of TV channels, the internet and online information services, more and more information becomes available and accessible. The digitization enhances preservation of records and makes the access to documents easier. However, when the quantity of documents become important the digitalization is not enough to ensure an efficient access. Indeed, we need to extract semantic information to help users to find what we need quickly. The text included in video sequences is highly needed for indexing and searching system. However, this text is difficult to detect and recognize because of the variability of its size, low resolution characters and the complexity of the backgrounds. To resolve these shortcomings, we propose a two task system: As a first step, we extract the textual information from video sequences and second, we recognize this text. Our system is tested on a diverse database composed of several Arabic news broadcast. The obtained results are encouraging and prove the qualities of our approach.

[1]  Walid Mahdi,et al.  Improving the Spatial-Temporal Clue Based Segmentation by the Use of Rhythm , 1998, ECDL.

[2]  Adel M. Alimi,et al.  Toward an interactive device for quick news story browsing , 2008, 2008 19th International Conference on Pattern Recognition.

[3]  Xinbo Gao,et al.  Automatic News Video Caption Extraction and Recognition , 2000, IDEAL.

[4]  Edward M. Riseman,et al.  TextFinder: An Automatic System to Detect and Recognize Text In Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[6]  Atreyi Kankanhalli,et al.  Automatic Extraction of Characters in Complex Scene Images , 1995, Int. J. Pattern Recognit. Artif. Intell..

[7]  Nevenka Dimitrova,et al.  Text detection for video analysis , 1999, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL'99).

[8]  Stuart Macdonald,et al.  User Engagement in Research Data Curation , 2009, ECDL.

[9]  Rainer Lienhart,et al.  Automatic text recognition in digital videos , 1995, Electronic Imaging.

[10]  C. Garcia,et al.  Text detection and segmentation in complex color images , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[11]  Nevenka Dimitrova,et al.  Multi-layered videotext extraction method , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[12]  Adel M. Alimi,et al.  Indexing Video Summaries for Quick Video Browsing , 2010, Pervasive Computing, Innovations in Intelligent Multimedia and Applications.

[13]  Xian-Sheng Hua,et al.  Automatic location of text in video frames , 2001, MULTIMEDIA '01.

[14]  Adel M. Alimi,et al.  Detection and extraction of the text in a video sequence , 2005, 2005 12th IEEE International Conference on Electronics, Circuits and Systems.