Detecting Text in Manga Using Stroke Width Transform

The Japanese comic-book style known as manga is becoming a popular topic for researchers. This paper focuses on the problem of detecting text regions in manga pages. Because it is time-consuming and laborious to identify the text regions in images manually, an automatic approach is highly desirable. Here, we propose a new text-detection method for manga using a Stroke Width Transform (SWT) technique in conjunction with a Support Vector Machine (SVM). Conventional SWT-based text-detection techniques perform poorly with manga because both text and non-text objects have similar characteristics for strokes, lines, and shapes. To better suit manga, we propose modifying the rules for finding letter candidates, which improves the ability to capture text. An SVM is then used to classify image patches into letter and nonletter regions. We compared our proposed framework with a conventional framework and other text-detection methods including deep-learning techniques. In the results, our proposed method achieved the highest F-measure of 0.506.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Kiyoharu Aizawa,et al.  Object Detection for Comics using Manga109 Annotations , 2018, ArXiv.

[3]  Kiyoharu Aizawa,et al.  A layered method for determining manga text bubble reading order , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[4]  Kohei Arai,et al.  Manga content extraction method for automatic mobile comic content creation , 2013, 2013 International Conference on Advanced Computer Science and Information Systems (ICACSIS).

[5]  William T. Freeman,et al.  Orientation Histograms for Hand Gesture Recognition , 1995 .

[6]  Motoi Iwata,et al.  Semi-automatic Text and Graphics Extraction of Manga Using Eye Tracking Information , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[7]  Kiyoharu Aizawa,et al.  Text detection in manga by combining connected-component-based and region-based classifications , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[8]  Joost van de Weijer,et al.  Automatic Text Localisation in Scanned Comic Books , 2013, VISAPP.

[9]  Kiyoharu Aizawa,et al.  DrawFromDrawings: 2D Drawing Assistance via Stroke Interpolation with a Sketch Database , 2017, IEEE Transactions on Visualization and Computer Graphics.

[10]  Kiyoharu Aizawa,et al.  Interactive segmentation for manga using lossless thinning and coarse labeling , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[11]  Jon Almazán,et al.  ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[12]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Dimosthenis Karatzas,et al.  Multi-script Text Extraction from Natural Scenes , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[14]  Rynson W. H. Lau,et al.  A Robust Panel Extraction Method for Manga , 2014, ACM Multimedia.

[15]  Kiyoharu Aizawa,et al.  Sketch-based manga retrieval using manga109 dataset , 2015, Multimedia Tools and Applications.

[16]  Hiroshi Watanabe,et al.  A study on object detection method from manga images using CNN , 2018, 2018 International Workshop on Advanced Image Technology (IWAIT).

[17]  Yusuke Matsui,et al.  Illustration2Vec: a semantic vector representation of illustrations , 2015, SIGGRAPH Asia Technical Briefs.

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  Xueting Liu,et al.  Text-aware balloon extraction from manga , 2015, The Visual Computer.

[20]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.