Automatic comic page segmentation based on polygon detection

Comic page segmentation aims to automatically decompose scanned comic images into storyboards (frames), which is the key technique to produce digital comic documents that are suitable for reading on mobile devices. In this paper, we propose a novel method for comic page segmentation by finding the quadrilateral enclosing box of each storyboard. We first acquire the edge image of the input comic image, and then extract line segments with a heuristic line segment detection algorithm. We perform line clustering to further merge the overlapped line segments and remove the redundancy line segments. Finally, we perform another round of line clustering and post-processing to compose the obtained line segments into complete quadrilateral enclosing boxes of the storyboards. The proposed method is tested on 2,237 comic images from 12 different printed comic series, and the experimental results demonstrate that our method is effective for comic image segmentation and outperforms the existing methods.

[1]  Allen R. Hanson,et al.  Extracting Straight Lines , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Hiroshi Watanabe,et al.  A STUDY ON FRAME POSITION DETECTION OF DIGITIZED COMICS IMAGES , 2010 .

[3]  Rafael Grompone von Gioi,et al.  LSD: A Fast Line Segment Detector with a False Detection Control , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Sergios Theodoridis,et al.  Pattern Recognition, Fourth Edition , 2008 .

[5]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[6]  Rita Cucchiara,et al.  Optimized Block-Based Connected Components Labeling With Decision Trees , 2010, IEEE Transactions on Image Processing.

[7]  Masakazu Higuchi,et al.  Fast frame decomposition and sorting by contour tracing for mobile phone comic images , 2010 .

[8]  Rahmat Budiarto,et al.  Comic Image Decomposition for Reading Comics on Cellular Phones , 2004, IEICE Trans. Inf. Syst..

[9]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[10]  Anil K. Jain,et al.  Document Representation and Its Application to Page Decomposition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[12]  Kenji Shoji,et al.  Layout Analysis of Tree-Structured Scene Frames in Comic Images , 2007, IJCAI.

[13]  Kuo-Liang Chung,et al.  New orientation-based elimination approach for accurate line-detection , 2010, Pattern Recognit. Lett..

[14]  Kohei Arai,et al.  Automatic E-Comic Content Adaptation , 2010 .

[15]  Jianming Hu,et al.  Page segmentation of Chinese newspapers , 2002, Pattern Recognit..

[16]  Ling-Hwei Chen,et al.  A high-speed algorithm for line detection , 1996, Pattern Recognit. Lett..

[17]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Wen-Hsiang Tsai,et al.  Gray-scale hough transform for thick line detection in gray-scale images , 1995, Pattern Recognit..

[19]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .