论文信息 - Effective Comic-Like Representations with Embedded Regions of Interest

Effective Comic-Like Representations with Embedded Regions of Interest

Comic-like summaries exploit the narrative structure of comics to create intuitive and easily readable abstracts. However, real comics use complex composition techniques which are difficult to mimic in an unsupervised way as they involve high level semantic understanding. This paper explores the use of visual attention analysis and face detection to embed regions of interest in adjacent images, obtaining more compact yet informative representations. This paper also addresses the generation of the layout, which involves combinatorial optimization problems. In practice, using exhaustive search to solve the problem is not feasible due to the large number of images. A split and merge approach is proposed to effectively address the layout problem, thus the limitations of finding solutions in a wide range of row widths can be avoided. A user study conducted on several episodes of TV series confirmed the utility of the proposed approach.

Luis Herranz | Shuqiang Jiang | Huiying Liu

[1] Janko Calic,et al. Efficient Layout of Comic-Like Video Summaries , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[2] R. Likert. “Technique for the Measurement of Attitudes, A” , 2022, The SAGE Encyclopedia of Research Design.

[3] Andreas Girgensohn,et al. A fast layout algorithm for visual video summaries , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[4] Janko Calic,et al. Compact Visualisation of Video Summaries , 2007, EURASIP J. Adv. Signal Process..

[5] Delbert Dueck,et al. Clustering by Passing Messages Between Data Points , 2007, Science.

[6] N. Otsu. A threshold selection method from gray level histograms , 1979 .

[7] Boon-Lock Yeo,et al. Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[8] Scott McCloud. Understanding comics: the invisible art = Memahami komik / Scott McCloud; penerjemah S. Kinanti , 2001 .

[9] Meng Wang,et al. Movie2Comics: a feast of multimedia artwork , 2010, ACM Multimedia.

[10] Changsheng Xu,et al. A generic virtual content insertion system based on visual attention analysis , 2008, ACM Multimedia.

[11] Shingo Uchihashi,et al. Video Manga: generating semantically meaningful video summaries , 1999, MULTIMEDIA '99.