Toponym Recognition in Scanned Color Topographic Maps

Topographic paper maps are a common support for geographical information. In the field of document analysis of this kind of support, this paper proposes an automatic approach to extract and recognize toponyms. We present a technique based on image segmentation and connected component processing. Different filtering stages ensure the consistency of plausible characters and strings. Detected text areas are used to feed an OCR software and the recognized words are analyzed and corrected. The main advantage of our technique is that no assumption is made about the character font, size or orientation. Experimental results obtained are encouraging in term of recognition efficiency.

[1]  Luyang Li,et al.  Cooperative text and line-art extraction from a topographic map , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[2]  Serguei Levachkine,et al.  Text/Graphics Separation and Recognition in Raster-Scanned Color Cartographic Maps , 2003, GREC.

[3]  Alireza Khotanzad,et al.  Contour Line and Geographic Feature Extraction from USGS Color Topographical Paper Maps , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Serguei Levachkine,et al.  Semantic Analysis and Recognition of Raster-Scanned Color Cartographic Images , 2001, GREC.

[5]  David S. Doermann An Introduction to Vectorization and Segmentation , 1997, GREC.

[6]  Po-Yueh Chen,et al.  DWT Based Text Localization , 2004 .

[7]  Ralph Ewerth,et al.  A robust algorithm for text detection in images , 2003, 3rd International Symposium on Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the.

[8]  Ching Y. Suen,et al.  Extraction of text areas in printed document images , 2001, DocEng '01.

[9]  Serguei Levachkine Raster to Vector Conversion of Color Cartographic Maps , 2003, GREC.

[10]  H. Tran,et al.  A Novel Approach for Text Detection in Images Using Structural Features , 2005, ICAPR.

[11]  Bernd Freisleben,et al.  Text detection in images based on unsupervised classification of high-frequency wavelet coefficients , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[12]  Hervé Le Men,et al.  Characters string recognition on maps, a method for high level reconstruction , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[13]  J. Serra,et al.  MATHEMATICAL MORPHOLOGY IN COLOR SPACES APPLIED TO THE ANALYSIS OF CARTOGRAPHIC IMAGES , 2003 .

[14]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Chew Lim Tan,et al.  Separation of overlapping text from graphics , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.