相关论文

CLIPS at TRECVID : Shot Boundary Detection and Feature Detection

Abstract:This paper presents the systems used by CLIPS-IMAG to perform the Shot Boundary Detection (SBD) task and the Feature Extraction (FE) task of the TRECvid workshop. Results obtained for the 2003 evaluation are presented. The CLIPS SBD system based on image difference with motion compensation and direct dissolve detection was second among 14 systems. This system gives control of the silence to noise ratio over a wide range of values and for an equal value of noise and silence (or recall and precision), the value is 12 % for all types of transitions. Detection of person X from speaker recognition alone was deceiving due to the small number of shots containing person X in the overall test collection (about 1/700) and the even small number in which person X was actually speaking (about 1/6000). Detection of person X from speech transcription performed much better but was still lower than other systems using also the image track for the detection.

参考文献
引用
Video Indexing: A Survey
2014
Content based video retrieval systems
ArXiv
2012
Adapting content based video retrieval systems to accommodate the novice user on mobile devices.
2013
Automatic Story Segmentation for TV News Video Using Multiple Modalities
Int. J. Digit. Multim. Broadcast.
2012
From Text Detection in Videos to Person Identification
2012 IEEE International Conference on Multimedia and Expo
2012
Video Segmentation and Shot Boundary Detection Using Self-Organizing Maps
SCIA
2007
LSIS TREC VIDEO 2009 High Level Feature Retrieval using Compact Profile Entropy Descriptors
2009
Multimodal Fusion: Combining Visual and Textual Cues for Concept Detection in Video
Multimedia Data Mining and Analytics
2015
Efficient Video Summarization Based on Motion SIFT-Distribution Histogram
2016 13th International Conference on Computer Graphics, Imaging and Visualization (CGiV)
2016
A Survey on Visual Content-Based Video Indexing and Retrieval
IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)
2011
A Local Temporal Context-Based Approach for TV News Story Segmentation
2012 IEEE International Conference on Multimedia and Expo
2012
Analysis and Review of Formal Approaches to Automatic Video Shot Boundary Detection
2014
University of Sheffield at TRECVID 2007: Shot Boundary Detection and Rushes Summarisation
TRECVID
2007
Automatic tag correction in videos : an approach based on frequent pattern mining. (Correction automatique d'annotations de vidéos : une approche à base de fouille de motifs fréquents)
2014
Robust Speaker Diarization for Single Channel Recorded Meetings
2009
Multi-Modal Music Information Retrieval: Augmenting Audio-Analysis with Visual Computing for Improved Music Video Analysis
ArXiv
2020
Step-by-step and integrated approaches in broadcast news speaker diarization
Comput. Speech Lang.
2006
Shot Boundary Detection In The Framework of Rough Indexing Paradigm
TRECVID
2004
On the Unsolved Problem of Shot Boundary Detection for Music Videos
MMM
2018
Video story segmentation with multi-modal features: experiments on TRECvid 2003
MIR '04
2004