论文信息 - Multi-level anchorperson detection using multimodal association

Multi-level anchorperson detection using multimodal association

In contemporary TV news programs, multi-level anchorpersons are often used which indicate the inherent hierarchical structure of news program. However, these diverse anchorperson patterns make the conventional anchorperson detection algorithms failed. In this paper, we propose a robust approach to anchorperson detection by integrating visual modality, auditory modality and human appearance modality into multimodal associated clustering. Based on the structure of clustered multi-level anchorpersons, the ToC (table-of-content) of news video can be effectively generated. The effectiveness and robustness of the proposed approach are demonstrated by the experiments on five hours news programs from different TV channels.

HongJiang Zhang | Yu-Fei Ma | Dong-Jun Lan

[1] Xinbo Gao,et al. Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing , 2002, IEEE Trans. Circuits Syst. Video Technol..

[2] Alexander G. Hauptmann,et al. Informedia: news-on-demand multimedia information acquisition and retrieval , 1997 .

[3] Karen Spärck Jones,et al. Automatic content-based retrieval of broadcast news , 1995, MULTIMEDIA '95.

[4] Stanley Boykin,et al. Improving broadcast news segmentation processing , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[5] HongJiang Zhang,et al. A novel motion-based representation for video mining , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[6] Yihong Gong,et al. Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[7] Harry Shum,et al. Statistical Learning of Multi-view Face Detection , 2002, ECCV.

[8] Alan Hanjalic,et al. Template-based detection of anchorperson shots in news programs , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[9] Lie Lu,et al. Speaker change detection and tracking in real-time news broadcasting analysis , 2002, MULTIMEDIA '02.