Personalized video summarization based on Multi-Layered Probabilistic Latent Semantic Analysis with shared topics

In this paper, we propose a multi-layered Probabilistic Latent Semantic Analysis (PLSA) model for personalized video summarization problem based on time synchronous comments offered by multiple users. Preliminary evaluations performed on an animation series of 624 minutes long with 12212 users show that the proposed model is able to capture the relationships among the preference of each individual user and the various video events, therefore is able to generate personalized summaries of unseen videos for different users.

[1]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[2]  John Hannon,et al.  Personalized and automatic social summarization of events in video , 2011, IUI '11.

[3]  Paris Smaragdis,et al.  Collaborative audio enhancement using probabilistic latent component sharing , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Zhenglu Yang,et al.  Harnessing the wisdom of crowds: video event detection based on synchronous comments , 2011, WWW.

[5]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[6]  Thorsten Brants,et al.  Test Data Likelihood for PLSA Models , 2005, Information Retrieval.

[7]  Luís Carriço,et al.  Proceedings of the 2012 ACM international conference on Intelligent User Interfaces , 2012 .

[8]  Noboru Babaguchi,et al.  Event based indexing of broadcasted sports video by intermodal collaboration , 2002, IEEE Trans. Multim..

[9]  David A. Cohn,et al.  The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity , 2000, NIPS.

[10]  P. Herman,et al.  Eighth International Conference on Laser Ablation , 2007 .

[11]  Jeffrey Nichols,et al.  Summarizing sporting events using twitter , 2012, IUI '12.

[12]  Atsuo Yoshitaka,et al.  Personalized Video Summarization Based on Behavior of Viewer , 2012, 2012 Eighth International Conference on Signal Image Technology and Internet Based Systems.

[13]  Paris Smaragdis,et al.  Missing data imputation for spectral audio signals , 2009, 2009 IEEE International Workshop on Machine Learning for Signal Processing.

[14]  Minje Kim,et al.  Nonnegative matrix partial co-factorization for drum source separation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Anoop Gupta,et al.  Auto-summarization of audio-video presentations , 1999, MULTIMEDIA '99.

[16]  Ching-Yung Lin,et al.  Personalized video summary using visual semantic annotations and automatic speech transcriptions , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[17]  Thomas Hofmann,et al.  Topic-based language models using EM , 1999, EUROSPEECH.

[18]  Qiang Yang,et al.  Topic-bridged PLSA for cross-domain text classification , 2008, SIGIR '08.

[19]  Tat-Seng Chua,et al.  Fusion of AV features and external information sources for event detection in team sports video , 2006, TOMCCAP.

[20]  Meng Wang,et al.  Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification , 2012, IEEE Transactions on Multimedia.

[21]  Harry W. Agius,et al.  Video summarisation: A conceptual framework and survey of the state of the art , 2008, J. Vis. Commun. Image Represent..

[22]  Deepayan Chakrabarti,et al.  Event Summarization Using Tweets , 2011, ICWSM.