News videos anchor person detection by shot clustering

In recent years, extensive research efforts have been dedicated to automatic news content analysis. In this paper, we propose a novel algorithm for anchorperson detection in news video sequences. In this method, the raw news videos are firstly split into shots by a four-threshold method, and the key frames are extracted from each shot. After that, the anchorperson detection is conducted from these key frames by using a clustering-based method based on a statistical distance of Pearson's correlation coefficient. To evaluate the effectiveness of the proposed method, we have conducted experiments on 10 news sequences. In these experiments, the proposed scheme achieves a recall of 0.96 and a precision of 0.97 for anchorperson detection.

[1]  Hao Wu,et al.  Anchor Shot Detection with Diverse Style Backgrounds Based on Spatial-Temporal Slice Analysis , 2010, MMM.

[2]  Sang-Kyun Kim,et al.  An Effective News Anchorperson Shot Detection Method Based on Adaptive Audio/Visual Model Generation , 2005, CIVR.

[3]  Sang Uk Lee,et al.  Efficient video indexing scheme for content-based retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[4]  Xinbo Gao,et al.  Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing , 2002, IEEE Trans. Circuits Syst. Video Technol..

[5]  N. Nikolaidis,et al.  Video shot detection and condensed representation. a review , 2006, IEEE Signal Processing Magazine.

[6]  Angelo Chianese,et al.  Foveated shot detection for video segmentation , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Mubarak Shah,et al.  Detection and representation of scenes in videos , 2005, IEEE Transactions on Multimedia.

[8]  Alberto Del Bimbo,et al.  Content-based indexing and retrieval of TV news , 2001, Pattern Recognit. Lett..

[9]  Yuting Su,et al.  Anchorperson Shot Detection in MPEG Domain , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[10]  Bingbing Ni,et al.  Assistive tagging: A survey of multimedia tagging with human-computer joint exploration , 2012, CSUR.

[11]  Chong-Wah Ngo,et al.  Video summarization and scene detection by graph modeling , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.

[13]  Dong Liu,et al.  Image retrieval with query-adaptive hashing , 2013, TOMCCAP.

[14]  Yue Gao,et al.  Cross-View Down/Up-Sampling Method for Multiview Depth Video Coding , 2012, IEEE Signal Processing Letters.

[15]  Yue Gao,et al.  Clip based video summarization and ranking , 2008, CIVR '08.

[16]  Wu Lingda Adaptive Method to Detect Anchorperson Shot in News Video , 2008 .

[17]  WangMeng,et al.  Active learning in multimedia annotation and retrieval , 2011 .

[18]  Ching Y. Suen,et al.  A Method of Combining Multiple Experts for the Recognition of Unconstrained Handwritten Numerals , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Qi Tian,et al.  Less is More: Efficient 3-D Object Retrieval With Query View Selection , 2011, IEEE Transactions on Multimedia.

[20]  Ioannis Pitas,et al.  Temporal Video Segmentation by Graph Partitioning , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[21]  Marcel Worring,et al.  Concept-Based Video Retrieval , 2009, Found. Trends Inf. Retr..

[22]  Jeho Nam,et al.  Detection of gradual transitions in video sequences using B-spline interpolation , 2005, IEEE Transactions on Multimedia.

[23]  Meng Wang,et al.  Unified Video Annotation via Multigraph Learning , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Guoliang Fan,et al.  Combined key-frame extraction and object-based video segmentation , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Nicu Sebe,et al.  Feature Selection for Multimedia Analysis by Sharing Information Among Multiple Tasks , 2013, IEEE Transactions on Multimedia.

[26]  R. Brunelli,et al.  A Survey on the Automatic Indexing of Video Data, , 1999, J. Vis. Commun. Image Represent..

[27]  Shin Satoh News video analysis based on identical shot detection , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[28]  Yue Gao,et al.  A video summarization tool using two-level redundancy detection for personal video recorders , 2008, IEEE Transactions on Consumer Electronics.

[29]  Yue Gao,et al.  THU-ICRC at rush summarization of TRECVID 2007 , 2007, TVS '07.

[30]  Tsung-Han Tsai,et al.  A robust shot change detection method for content-based retrieval , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[31]  Meng Wang,et al.  Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification , 2012, IEEE Transactions on Multimedia.

[32]  Hao Li,et al.  Weighted Block Matching-Based Anchor Shot Detection with Dynamic Background , 2009, ICIAR.

[33]  Mario Vento,et al.  Combining experts for anchorperson shot detection in news videos , 2004, Pattern Analysis and Applications.

[34]  Qi Tian,et al.  A unified framework for semantic shot classification in sports video , 2005, IEEE Trans. Multim..

[35]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[36]  Alan Hanjalic,et al.  Template-based detection of anchorperson shots in news programs , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[37]  Wen Gao,et al.  A Fast Anchor Shot Detection Algorithm on Compressed Video , 2001, IEEE Pacific Rim Conference on Multimedia.

[38]  Mario Vento,et al.  An Improved Algorithm for Anchor Shot Detection , 2005, ICIAP.

[39]  Wen Gao,et al.  Learning to Distribute Vocabulary Indexing for Scalable Visual Search , 2013, IEEE Transactions on Multimedia.

[40]  Zi Huang,et al.  Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis , 2013, IEEE Transactions on Multimedia.

[41]  Yuan Dong,et al.  Automatic and fast temporal segmentation for personalized news consuming , 2010, Information Systems Frontiers.

[42]  Yue Gao,et al.  Shot-based similarity measure for content-based video summarization , 2008, 2008 15th IEEE International Conference on Image Processing.

[43]  Yue Gao,et al.  Dynamic video summarization using two-level redundancy detection , 2009, Multimedia Tools and Applications.

[44]  Dong-Sik Jang,et al.  Gradual shot boundary detection using localized edge blocks , 2006, Multimedia Tools and Applications.

[45]  Qionghai Dai,et al.  Comparative Interactivity Analysis in Multiview Video Coding Schemes , 2010 .

[46]  Meng Wang,et al.  Active learning in multimedia annotation and retrieval: A survey , 2011, TIST.

[47]  Takeshi Mita,et al.  Joint Haar-like features for face detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[48]  Ramesh C. Jain,et al.  A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video , 2002, Pattern Recognit..

[49]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[50]  Georges Quénot,et al.  Automatic Story Segmentation for TV News Video Using Multiple Modalities , 2012, Int. J. Digit. Multim. Broadcast..

[51]  Joon-Min Gil,et al.  A unified scheme of shot boundary detection and anchor shot detection in news video story parsing , 2011, Multimedia Tools and Applications.

[52]  Edward J. Delp,et al.  The indexing of persons in news sequences using audio-visual data , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[53]  José Manuel Menéndez,et al.  A unified model for techniques on video-shot transition detection , 2005, IEEE Transactions on Multimedia.

[54]  Qi Tian,et al.  News sports video shot classification with sports play field and motion features , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[55]  Patrick P. K. Chan,et al.  L-GEM based RBFNN for news anchorperson detection with Dominant Color Descriptor , 2010, 2010 International Conference on Machine Learning and Cybernetics.

[56]  Bede Liu,et al.  Temporal segmentation of video using frame and histogram space , 2006, IEEE Trans. Multim..

[57]  T. Kanade,et al.  Color information for region segmentation , 1980 .

[58]  Mario Vento,et al.  An Unsupervised Algorithm for Anchor Shot Detection , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[59]  Shuwu Zhang,et al.  Anchor Shot Detection Based on Face Detection and SIFT: Anchor Shot Detection Based on Face Detection and SIFT , 2009 .

[60]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying production effects , 1999, Multimedia Systems.

[61]  Hugh E. Williams,et al.  Gradual Transition Detection Using Average Frame Similarity , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[62]  Gao Xinbo,et al.  A graph-theoretical clustering based anchorperson shot detection for news video indexing , 2003, Proceedings Fifth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2003.

[63]  Divakar Yadav,et al.  Topical web crawling using weighted anchor text and web page change detection techniques , 2009 .

[64]  Yap-Peng Tan,et al.  An effective post-refinement method for shot boundary detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[65]  Mario Vento,et al.  A Multi-stage Approach for Anchor Shot Detection , 2006, SSPR/SPR.

[66]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.