Recognizing Text Elements for SVG Comic Compression and Its Novel Applications

SVG (scalable vector graphics) has become the standard format for 2D graphics in HTML5. Although some image-to-SVG conversion systems had been proposed, the sizes of files they produced are still large. In [1], we proposed a new system to convert raster comic images into vector SVG files. The compression ratio is better than the previous methods. However, these methods do not process text in raster images. In this paper, we improve our system to recognize text elements in the comic and use these text elements to provide better compression and novel applications. The proposed method uses SCW (sliding concentric windows) and SVM (support vector machine) to identify text regions. Then, OCR (optical character recognition) is applied to recognize text elements in those regions. Instead of encoding the text regions as vectors, the text elements are embedded in the SVG file along with their coordinate values. Experimental results show that we can reduce the file sizes to about 52% of the original SVG files. Using these text elements, we can translate comics into other languages to provide multilingual services easily. Text/content-based image search can be supported efficiently. It can also provide a novel application system for story teller.

[1]  Shu-Yuan Chen,et al.  Adaptive page segmentation for color technical journals' cover images , 1998, Image Vis. Comput..

[2]  Ioannis Anagnostopoulos,et al.  A License Plate-Recognition Algorithm for Intelligent Transportation System Applications , 2006, IEEE Transactions on Intelligent Transportation Systems.

[3]  Andreas Neumann Scalable Vector Graphics (SVG) , 2008, Encyclopedia of GIS.

[4]  Makoto Tanaka,et al.  Text-Tracking Wearable Camera System for the Blind , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[5]  Ray-I Chang,et al.  An XML-Based Comic Image Compression , 2008, PCM.

[6]  C. Peng,et al.  SCALABLE VECTOR GRAPHICS (SVG) , 2000 .

[7]  Jung H. Kim,et al.  English to Spanish translation of signboard images from mobile phone camera , 2009, IEEE Southeastcon 2009.

[8]  Sebastiano Battiato,et al.  SVG rendering of real images using data dependent triangulation , 2004, SCCG '04.

[9]  Ralph R. Martin,et al.  Vectorizing Cartoon Animations , 2009, IEEE Transactions on Visualization and Computer Graphics.

[10]  Masakazu Iwamura,et al.  Real-Time Retrieval for Images of Documents in Various Languages Using a Web Camera , 2009, 2009 10th International Conference on Document Analysis and Recognition.