Shot Boundary Determination on MPEC Compressed Domain and Story Segmentation Experiments for TRECVID 2003
Abstract:1. Briefly, what approach or combination of approaches did you test in each of your submitted runs? 1_kddi_ss_base1_5: “Baseline” method based on SVM, which discriminates shots that contain story boundaries. 1_kddi_ss_c+k1_4: Baseline + section-specialized segmentation (SS-S). 1_kddi_ss_all1_3: Baseline + SS-S + anchor shot segmentation (ASS) based on audio classification results 1_kddi_ss_all1_pfil_1: Baseline + SS-S + ASS and post-filtering (PF) based on audio classification results 1_kddi_ss_all2_pfil_2: Extended baseline + SS-S + ASS + PF based on audio classification results. 1_kddi_ss_all1nsp07_pfil_6: Baseline + SS-S + ASS + PF by HMM-based non-speech detection. 1_kddi_ss_all2nsp07_pfil_7: Extended baseline + SSS + ASS + PF by HMM-based non-speech detection. 2_kddi_ss2_all1_pfil_8: Baseline + SS-S + ASS and PF based on "speech segment" information from LIMSI ASR results[1]. 2_kddi_ss2_all2_pfil_9: Extended baseline + SS-S + ASS and PF based on "speech segment" information from LIMSI ASR results. 3_kddi_ss3_10: Naive TextTiling based story segmentation based on LIMSI ASR data.
暂无分享,去 创建一个
[1] Marti A. Hearst. Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.
[2] Stephen W. Smoliar,et al. Content based video indexing and retrieval , 1994, IEEE MultiMedia.
[3] J. J. Rocchio,et al. Relevance feedback in information retrieval , 1971 .
[4] Behzad Shahraray,et al. Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.
[5] Yukinobu Taniguchi,et al. Structured Video Computing , 1994, IEEE MultiMedia.
[6] Gerard Salton,et al. The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .
[7] Masaru Sugano,et al. MPEG content summarization based on compressed domain feature analysis , 2003, SPIE ITCom.
[8] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[9] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..
[10] Ramesh C. Jain,et al. Digital video segmentation , 1994, MULTIMEDIA '94.
[11] Ching-Yung Lin,et al. Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets , 2003, TRECVID.
[12] Kunio Kashino,et al. A quick search method for audio and video signals based on histogram pruning , 2003, IEEE Trans. Multim..
[13] Yihong Gong,et al. Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.
[14] Akio Nagasaka,et al. Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.
[15] Boon-Lock Yeo,et al. Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..
[16] Shih-Fu Chang,et al. Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.
[17] Yang Lu,et al. A fast audio classification from MPEG coded data , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[18] Yoshinobu Tonomura,et al. Projection-detecting filter for video cut detection , 1994, MULTIMEDIA '93.
[19] Yasuyuki Nakajima,et al. Video Structure Analysis and Its Application to Creation of Video Summary , 1995 .
[20] Edward J. Delp,et al. A fast algorithm for video parsing using MPEG compressed sequences , 1995, Proceedings., International Conference on Image Processing.
[21] Akio Yoneyama,et al. Universal scene change detection on MPEG-coded data domain , 1997, Electronic Imaging.
[22] Arding Hsu,et al. Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.