A System to Retrieve Text / Symbols from Color Maps using Connected Component and Skeleton Analysis

Automatic separation of Text and Graphic Symbols in document images is one of the fundamental aims in Graphics Recognition. In Maps, separation of Text and Graphic Symbols involves many challenges because the text and symbol frequently touches/overlaps to the long lines of street, river, border of the regions etc. of the maps with similar color. Sometimes the colors in a simple character are gradually distributed which adds extra difficulty in the problem. In this paper we proposed a system to retrieve text/symbols from maps. Here, at first, we separate the maps into different layers according to color features and then connected component features and skeleton information are used to identify text characters, symbols from graphics on the basis of their geometrical features. From the experiment we obtained encouraging results.

[1]  Rabab Kreidieh Ward,et al.  A Rotation Invariant Rule-Based Thinning Algorithm for Character Recognition , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Bart Lamiroy,et al.  Text/Graphics Separation Revisited , 2002, Document Analysis Systems.

[3]  María Vanrell,et al.  Topological Histogram Reduction Towards Colour Segmentation , 2007, IbPRIA.

[4]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Bhabatosh Chanda,et al.  Extraction and recognition of geographical features from paper maps , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[6]  Chew Lim Tan,et al.  Text/Graphics Separation in Maps , 2001, GREC.

[7]  Joan Serrat,et al.  Multilocal Creaseness Based on the Level-Set Extrinsic Curvature , 2000, Comput. Vis. Image Underst..

[8]  Joan Serrat,et al.  Evaluation of Methods for Ridge and Valley Detection , 1999, IEEE Trans. Pattern Anal. Mach. Intell..