Unsupervised anchorpersons differentiation in news video

The automatic extraction of video structure from content is of key importance to enable a variety of multimedia services that span from search and retrieval to content manipulation. An unsupervised independent unimodal clustering method for anchorpersons detection and differentiation in newscasts is presented in this paper. The algorithm exploits audio, frame and face information to identify major cast in the content. These three components are first processed independently during the cluster analysis and then jointly in a compositional mining phase. A differentiation of the role played by the people in the video has been implemented exploiting the temporal characteristics of the detected anchorpersons. Experiments show significant precision/recall results thus opening further research directions in video analysis, particularly when the content is highly structured as in TV newscasts.

[1]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[2]  Xinbo Gao,et al.  Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing , 2002, IEEE Trans. Circuits Syst. Video Technol..

[3]  Chin-Hui Lee,et al.  Unsupervised anchor shot detection using multi-modal spectral clustering , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[5]  Yiannis S. Boutalis,et al.  Selection of the proper Compact Composite Descriptor for improving content based image retrieval , 2009 .

[6]  Hao Wu,et al.  Anchor Shot Detection with Diverse Style Backgrounds Based on Spatial-Temporal Slice Analysis , 2010, MMM.

[7]  Ramesh C. Jain,et al.  Automatic Person Annotation of Family Photo Album , 2006, CIVR.

[8]  Mattia Broilo,et al.  Unsupervised event segmentation of news content with multimodal cues , 2010, AIEMPro '10.

[9]  Liu An-an News Anchorperson Detection Algorithm Based on Spatio-Temporal Slice , 2008 .

[10]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Sheng Tang,et al.  A Novel Anchorperson Detection Algorithm Based on Spatio-temporal Slice , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[12]  Winston H. Hsu,et al.  Anchor Shot Detection in TRECVID-2005 Broadcast News Videos , 2006 .

[13]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14]  Zhu Liu,et al.  Major Cast Detection in Video Using Both Speaker and Face Information , 2007, IEEE Transactions on Multimedia.

[15]  David C. Gibbon,et al.  A Fast, Comprehensive Shot Boundary Determination System , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[16]  Mario Vento,et al.  An Unsupervised Algorithm for Anchor Shot Detection , 2006, 18th International Conference on Pattern Recognition (ICPR'06).