Shot Boundary Determination on MPEC Compressed Domain and Story Segmentation Experiments for TRECVID 2003

1. Briefly, what approach or combination of approaches did you test in each of your submitted runs? 1_kddi_ss_base1_5: “Baseline” method based on SVM, which discriminates shots that contain story boundaries. 1_kddi_ss_c+k1_4: Baseline + section-specialized segmentation (SS-S). 1_kddi_ss_all1_3: Baseline + SS-S + anchor shot segmentation (ASS) based on audio classification results 1_kddi_ss_all1_pfil_1: Baseline + SS-S + ASS and post-filtering (PF) based on audio classification results 1_kddi_ss_all2_pfil_2: Extended baseline + SS-S + ASS + PF based on audio classification results. 1_kddi_ss_all1nsp07_pfil_6: Baseline + SS-S + ASS + PF by HMM-based non-speech detection. 1_kddi_ss_all2nsp07_pfil_7: Extended baseline + SSS + ASS + PF by HMM-based non-speech detection. 2_kddi_ss2_all1_pfil_8: Baseline + SS-S + ASS and PF based on "speech segment" information from LIMSI ASR results[1]. 2_kddi_ss2_all2_pfil_9: Extended baseline + SS-S + ASS and PF based on "speech segment" information from LIMSI ASR results. 3_kddi_ss3_10: Naive TextTiling based story segmentation based on LIMSI ASR data.

[1]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[2]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[3]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[4]  Behzad Shahraray,et al.  Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.

[5]  Yukinobu Taniguchi,et al.  Structured Video Computing , 1994, IEEE MultiMedia.

[6]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[7]  Masaru Sugano,et al.  MPEG content summarization based on compressed domain feature analysis , 2003, SPIE ITCom.

[8]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[9]  Jean-Luc Gauvain,et al.  The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[10]  Ramesh C. Jain,et al.  Digital video segmentation , 1994, MULTIMEDIA '94.

[11]  Ching-Yung Lin,et al.  Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets , 2003, TRECVID.

[12]  Kunio Kashino,et al.  A quick search method for audio and video signals based on histogram pruning , 2003, IEEE Trans. Multim..

[13]  Yihong Gong,et al.  Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[14]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[15]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[16]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[17]  Yang Lu,et al.  A fast audio classification from MPEG coded data , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[18]  Yoshinobu Tonomura,et al.  Projection-detecting filter for video cut detection , 1994, MULTIMEDIA '93.

[19]  Yasuyuki Nakajima,et al.  Video Structure Analysis and Its Application to Creation of Video Summary , 1995 .

[20]  Edward J. Delp,et al.  A fast algorithm for video parsing using MPEG compressed sequences , 1995, Proceedings., International Conference on Image Processing.

[21]  Akio Yoneyama,et al.  Universal scene change detection on MPEG-coded data domain , 1997, Electronic Imaging.

[22]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.