Look over here

Picture subjects and text balloons are basic elements in comics, working together to propel the story forward. Japanese comics artists often leverage a carefully designed composition of subjects and balloons (generally referred to as panel elements) to provide a continuous and fluid reading experience. However, such a composition is hard to produce for people without the required experience and knowledge. In this paper, we propose an approach for novices to synthesize a composition of panel elements that can effectively guide the reader's attention to convey the story. Our primary contribution is a probabilistic graphical model that describes the relationships among the artist's guiding path, the panel elements, and the viewer attention, which can be effectively learned from a small set of existing manga pages. We show that the proposed approach can measurably improve the readability, visual appeal, and communication of the story of the resulting pages, as compared to an existing method. We also demonstrate that the proposed approach enables novice users to create higher-quality compositions with less time, compared with commercially available programs.

[1]  A. D. Manning,et al.  Understanding Comics: The Invisible Art , 1993 .

[2]  Harish Katti,et al.  An Eye Fixation Database for Saliency Detection in Images , 2010, ECCV.

[3]  Hwan-Gue Cho,et al.  An Automated Procedure for Word Balloon Placement in Cinema Comics , 2006, ISVC.

[4]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[5]  Douglas DeCarlo,et al.  Stylization and abstraction of photographs , 2002, ACM Trans. Graph..

[6]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[7]  Tien-Tsin Wong,et al.  Richness-preserving manga screening , 2008, SIGGRAPH 2008.

[8]  Tien-Tsin Wong,et al.  Richness-preserving manga screening , 2008, ACM Trans. Graph..

[9]  Masahiro Toyoura,et al.  Using eye-tracking data for automatic film comic creation , 2012, ETRA '12.

[10]  Tien-Tsin Wong,et al.  Manga colorization , 2006, ACM Trans. Graph..

[11]  Subramanian Ramanathan,et al.  Can computers learn from humans to see better?: inferring scene semantics from viewers' eye movements , 2011, ACM Multimedia.

[12]  Yaser Sheikh,et al.  Inferring artistic intention in comic art through viewer gaze , 2012, SAP.

[13]  Tien-Tsin Wong,et al.  Manga colorization , 2006, SIGGRAPH 2006.

[14]  Joe Marks,et al.  A General Cartographic Labeling Algorithm , 1996 .

[15]  Joe Marks,et al.  An empirical study of algorithms for point-feature label placement , 1995, TOGS.

[16]  A. Lewis Making Comics: Storytelling Secrets of Comics, Manga and Graphic Novels , 2007 .

[17]  Hua Huang,et al.  Arcimboldo-like collage using internet images , 2011, ACM Trans. Graph..

[18]  Kevin Murphy,et al.  A brief introduction to graphical models and bayesian networks , 1998 .

[19]  Kevin Crossley Fantasy Clip Art: Everything You Need to Create Your Own Professional-Looking Fantasy Artwork , 2000 .

[20]  Masahiro Toyoura,et al.  Film Comic Generation with Eye Tracking , 2013, MMM.

[21]  Michael Collins,et al.  EM Algorithm , 2010, Encyclopedia of Machine Learning.

[22]  Adam Finkelstein,et al.  Video tapestries with continuous temporal zoom , 2010, SIGGRAPH 2010.

[23]  Kwan-Liu Ma,et al.  Dynamic video narratives , 2010, SIGGRAPH 2010.

[24]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[25]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[26]  Geoffrey J. McLachlan,et al.  The EM Algorithm , 2012 .

[27]  Yaser Sheikh,et al.  Attention-guided Algorithms to Retarget and Augment Animations, Stills, and Videos , 2012 .

[28]  Sally Gillen In the public domain: The transfer of public health from the NHS to local authorities will present opportunities and challenges for nurses, writes Sally Gillen , 2012 .

[29]  David Salesin,et al.  Comic Chat , 1996, SIGGRAPH.

[30]  Roger L. Wainwright,et al.  Placing Text Labels an Maps and Diagrams using Genetic Algorithms with Masking , 1997, INFORMS J. Comput..

[31]  Stuart M. Shieber,et al.  Placing Text Labels on Maps and Diagrams , 1994, Graphics Gems.

[32]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[33]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[34]  Carlos D. Correa,et al.  Dynamic video narratives , 2010, ACM Trans. Graph..

[35]  Adam Finkelstein,et al.  Video tapestries with continuous temporal zoom , 2010, ACM Trans. Graph..

[36]  Fabio Pellacini,et al.  Jigsaw image mosaics , 2002, ACM Trans. Graph..

[37]  Nir Friedman,et al.  Gaussian Process Networks , 2000, UAI.

[38]  Takahide Omori・Takeharu Igaki Eye catchers in comics: Controlling eye movements in reading pictorial and textual media , 2005 .

[39]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[40]  Rynson W. H. Lau,et al.  Automatic stylistic manga layout , 2012, ACM Trans. Graph..