A unified scheme of shot boundary detection and anchor shot detection in news video story parsing

In this paper, we propose an efficient one-pass algorithm for shot boundary detection and a cost-effective anchor shot detection method with search space reduction, which are unified scheme in news video story parsing. First, we present the desired requirements for shot boundary detection from the perspective of news video story parsing, and propose a new shot boundary detection method, based on singular value decomposition, and a newly developed algorithm, viz., Kernel-ART, which meets all of these requirements. Second, we propose a new anchor shot detection system, viz., MASD, which is able to detect anchor person cost-effectively by reducing the search space. It consists of skin color detector, face detector, and support vector data descriptions with non-negative matrix factorization sequentially. The experimental results with the qualitative analysis illustrate the efficiency of the proposed method.

[1]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[2]  Ioannis Pitas,et al.  Information theory-based shot cut/fade detection and video summarization , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[4]  Ioannis Pitas,et al.  Video shot segmentation using singular value decomposition , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[5]  Ethem Alpaydin,et al.  Simplified ART: A new class of ART algorithms , 1998 .

[6]  Yong Fang,et al.  A New General Framework for Shot Boundary Detection Based on SVM , 2005, 2005 International Conference on Neural Networks and Brain.

[7]  Ting Liu,et al.  Video Segmentation via Temporal Pattern Classification , 2007, IEEE Transactions on Multimedia.

[8]  Gene H. Golub,et al.  Matrix computations , 1983 .

[9]  Li Huan,et al.  A Method for Fast Shot Boundary Detection Based on SVM , 2008, 2008 Congress on Image and Signal Processing.

[10]  HongJiang Zhang,et al.  Multi-level anchorperson detection using multimodal association , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[11]  Xinbo Gao,et al.  Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing , 2002, IEEE Trans. Circuits Syst. Video Technol..

[12]  Sung-Bae Cho,et al.  Video scene retrieval with interactive genetic algorithm , 2007, Multimedia Tools and Applications.

[13]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[14]  Chin-Hui Lee,et al.  A Multi-Modal Approach to Story Segmentation for News Video , 2003, World Wide Web.

[15]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[16]  Yong Fang,et al.  News video story segmentation , 2006, 2006 12th International Multi-Media Modelling Conference.

[17]  Gao Xinbo,et al.  A graph-theoretical clustering based anchorperson shot detection for news video indexing , 2003, Proceedings Fifth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2003.

[18]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[19]  Mario Vento,et al.  An Unsupervised Algorithm for Anchor Shot Detection , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[20]  Alan Hanjalic,et al.  Template-based detection of anchorperson shots in news programs , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[21]  Gennaro Percannella,et al.  A Probabilistic Framework for TV-News Stories Detection and Classification , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[22]  Xin Liu,et al.  Video summarization using singular value decomposition , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[23]  Bo Zhang,et al.  A Formal Study of Shot Boundary Detection , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[25]  Chien-Chuan Ko,et al.  News Video Segmentation and Categorization Techniques for Content-Demand Browsing , 2008, 2008 Congress on Image and Signal Processing.

[26]  Franc Solina,et al.  COLOR-BASED FACE DETECTION IN THE "15 SECONDS OF FAME" ART INSTALLATION , 2003 .

[27]  Songyang Lao,et al.  AnchorClu: An Anchorperson Shot Detection Method Based on Clustering , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[28]  Hui Fang,et al.  A fuzzy logic approach for detection of video shot boundaries , 2006, Pattern Recognit..