Scalable storyboards in handheld devices: applications and evaluation metrics

Summaries are an essential component of video retrieval and browsing systems. Most research in video summarization has focused on content analysis to obtain compact yet comprehensive representations of video items. However, important aspects such as how they can be effectively integrated in mobile interfaces and how to predict the quality and usability of the summaries have not been investigated. Conventional summaries are limited to a single instance with certain length (i.e. a single scale). In contrast, scalable summaries target representations with multiple scales, that is, a set of summaries with increasing length in which longer summaries include more information about the video. Thus, scalability provides high flexibility that can be exploited in devices such as smartphones or tablets to provide versions of the summary adapted to the limited visualization area. In this paper, we explore the application of scalable storyboards to summary adaptation and zoomable video navigation in handheld devices. By introducing a new adaptation dimension related with the summarization scale, we can formulate navigation and adaptation in a two-dimensional adaptation space, where different navigation actions modify the trajectory in that space. We also describe the challenges to evaluate scalable summaries and some usability issues that arise from having multiple scales, proposing some objective metrics that can provide useful insight about their potential quality and usability without requiring very costly user studies. Experimental results show a reasonable agreement with the trends shown in subjective evaluations. Experiments also show that content-based scalable storyboards are less redundant and useful than the content-blind baselines.

[1]  Jun Xin,et al.  Video Adaptation : Concepts , Technologies , and Open Issues , .

[2]  Bernard Mérialdo,et al.  VERT: automatic evaluation of video summaries , 2010, ACM Multimedia.

[3]  Gerald Friedland,et al.  Narrative theme navigation for sitcoms supported by fan-generated scripts , 2010, AIEMPro '10.

[4]  Yueting Zhuang,et al.  Adaptive key frame extraction using unsupervised clustering , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[5]  Marie-Francine Moens,et al.  Finding a needle in a haystack: an interactive video archive explorer for professional video searchers , 2011, Multimedia Tools and Applications.

[6]  Yi Yang,et al.  How Related Exemplars Help Complex Event Detection in Web Videos? , 2013, 2013 IEEE International Conference on Computer Vision.

[7]  Dimitrios Paris Darzentas,et al.  HiStory: a hierarchical storyboard interface design for video browsing on mobile devices , 2012, MUM.

[8]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  José María Martínez Sanchez,et al.  Scalable Comic-Like Video Summaries and Layout Disturbance , 2012, IEEE Transactions on Multimedia.

[10]  Jianping Fan,et al.  Hierarchical video content description and summarization using unified semantic and visual similarity , 2003, Multimedia Systems.

[11]  Jesús Bescós,et al.  Content-Driven Adaptation of On-Line Video , 2007, 2007 International Workshop on Content-Based Multimedia Indexing.

[12]  Hongxun Yao,et al.  Flexible Presentation of Videos Based on Affective Content Analysis , 2013, MMM.

[13]  Klaus Schöffmann,et al.  3-D Interfaces to Improve the Performance of Visual Known-Item Search , 2014, IEEE Transactions on Multimedia.

[14]  Debargha Mukherjee,et al.  A framework for fully format-independent adaptation of scalable bit streams , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  José María Martínez Sanchez,et al.  A Framework for Scalable Summarization of Video , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Riccardo Leonardi,et al.  Statistical Skimming of Feature Films , 2010, Int. J. Digit. Multim. Broadcast..

[17]  R. Likert “Technique for the Measurement of Attitudes, A” , 2022, The SAGE Encyclopedia of Research Design.

[18]  Yu Huang,et al.  Video summarization with semantic concept preservation , 2011, MUM.

[19]  S. Santini Who needs video summarization anyway , 2007 .

[20]  V. S. Subrahmanian,et al.  The priority curve algorithm for video summarization , 2006, Inf. Syst..

[21]  Robin Sibson,et al.  SLINK: An Optimally Efficient Algorithm for the Single-Link Cluster Method , 1973, Comput. J..

[22]  José María Martínez Sanchez,et al.  An integrated approach to summarization and adaptation using H.264/MPEG-4 SVC , 2009, Signal Process. Image Commun..

[23]  Ansgar Scherp,et al.  Survey on modeling and indexing events in multimedia , 2014, Multimedia Tools and Applications.

[24]  Ming-Ting Sun,et al.  Digital Video Transcoding , 2005, Proceedings of the IEEE.

[25]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.

[26]  Bhabatosh Chanda,et al.  Generation of size constrained video storyboard using spanning tree , 2009, ICIMCS '09.

[27]  Jens-Rainer Ohm,et al.  Advances in Scalable Video Coding , 2005, Proceedings of the IEEE.

[28]  Paul Over,et al.  The trecvid 2008 BBC rushes summarization evaluation , 2008, TVS '08.

[29]  Hongxun Yao,et al.  Video classification and recommendation based on affective analysis of viewers , 2013, Neurocomputing.

[30]  José María Martínez Sanchez,et al.  Generation of scalable summaries based on iterative GoP ranking , 2008, 2008 15th IEEE International Conference on Image Processing.

[31]  Fuchun Sun,et al.  Video key-frame extraction for smart phones , 2014, Multimedia Tools and Applications.

[32]  Jianping Fan,et al.  Exploring video content structure for hierarchical summarization , 2004, Multimedia Systems.

[33]  Xin Liu,et al.  Video summarization using singular value decomposition , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[34]  Yu Sun,et al.  Video transcoding: an overview of various techniques and research issues , 2005, IEEE Transactions on Multimedia.

[35]  Kiyoharu Aizawa,et al.  Automatic trailer generation , 2010, ACM Multimedia.

[36]  Riccardo Leonardi,et al.  Extraction of Significant Video Summaries by Dendrogram Analysis , 2006, 2006 International Conference on Image Processing.

[37]  Li Zhuo,et al.  An iteratively reweighting algorithm for dynamic video summarization , 2014, Multimedia Tools and Applications.

[38]  Bernard Mérialdo,et al.  Rushes video summarization and evaluation , 2009, Multimedia Tools and Applications.

[39]  Harry W. Agius,et al.  Video summarisation: A conceptual framework and survey of the state of the art , 2008, J. Vis. Commun. Image Represent..

[40]  Gary Marchionini,et al.  The Open Video Digital Library: A Möbius strip of research and practice , 2006, J. Assoc. Inf. Sci. Technol..

[41]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[42]  Jiebo Luo,et al.  Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection , 2012, IEEE Transactions on Multimedia.

[43]  Gary Marchionini,et al.  The Open Video Digital Library: A Möbius strip of research and practice , 2006 .

[44]  Anthony Vetro,et al.  MPEG-21 digital item adaptation: enabling universal multimedia access , 2004, IEEE MultiMedia.

[45]  Paul Over,et al.  The trecvid 2007 BBC rushes summarization evaluation pilot , 2007, TVS '07.

[46]  Alberto Signoroni,et al.  State-of-the-Art and Trends in Scalable Video Compression With Wavelet-Based Approaches , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[47]  Klaus Schöffmann,et al.  A User-Centric Media Retrieval Competition: The Video Browser Showdown 2012-2014 , 2014, IEEE Multim..

[48]  Yelena Yesha,et al.  Keyframe-based video summarization using Delaunay clustering , 2006, International Journal on Digital Libraries.

[49]  José María Martínez Sanchez,et al.  Automatic evaluation of video summaries , 2012, TOMCCAP.

[50]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[51]  Heiko Schwarz,et al.  Overview of the Scalable Video Coding Extension of the H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[52]  Yue Gao,et al.  Feature Correlation Hypergraph: Exploiting High-order Potentials for Multimodal Recognition , 2014, IEEE Transactions on Cybernetics.