An Active Contour Model for Speech Balloon Detection in Comics

Comic books constitute an important cultural heritage asset in many countries. Digitization combined with subsequent comic book understanding would enable a variety of new applications, including content-based retrieval and content retargeting. Document understanding in this domain is challenging as comics are semi-structured documents, combining semantically important graphical and textual parts. Few studies have been done in this direction. In this work we detail a novel approach for closed and non-closed speech balloon localization in scanned comic book pages, an essential step towards a fully automatic comic book understanding. The approach is compared with existing methods for closed balloon localization found in the literature and results are presented.

[1]  Kohei Arai,et al.  Method for Real Time Text Extraction of Digital Manga Comic , 2011 .

[2]  Laurent D. Cohen,et al.  On active contour models and balloons , 1991, CVGIP Image Underst..

[3]  Daniel Cremers,et al.  Diffusion Snakes: Introducing Statistical Shape Knowledge into the Mumford-Shah Functional , 2002, International Journal of Computer Vision.

[4]  Koichi Kise,et al.  Similar Manga Retrieval Using Visual Vocabulary Based on Regions of Interest , 2011, 2011 International Conference on Document Analysis and Recognition.

[5]  Jean-Christophe Burie,et al.  Panel and Speech Balloon Extraction from Comic Books , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[6]  Alexandru Telea,et al.  International Conference on Computer Vision Theory and Applications (VISAPP) , 2014 .

[7]  Joost van de Weijer,et al.  Automatic Text Localisation in Scanned Comic Books , 2013, VISAPP.

[8]  Kiyoharu Aizawa,et al.  Interactive Manga retargeting , 2011, SIGGRAPH '11.

[9]  Junaed Sattar Snakes , Shapes and Gradient Vector Flow , 2022 .

[10]  Clément Guérin,et al.  Ontologies and spatial relations applied to comic books reading. , 2012 .

[11]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[12]  L. Cohen,et al.  Multi-resolution algorithms for active contour models , 1996 .

[13]  Alain Bouju,et al.  eBDtheque: A Representative Database of Comics , 2013, 2013 12th International Conference on Document Analysis and Recognition.