论文信息 - Object-Based Access to TV Rushes Video

Object-Based Access to TV Rushes Video

Recent years have seen the development of different modalities for video retrieval. The most common of these are (1) to use text from speech recognition or closed captions, (2) to match keyframes using image retrieval techniques like colour and texture [6] and (3) to use semantic features like “indoor”, “outdoor” or “persons”. Of these, text-based retrieval is the most mature and useful, while image-based retrieval using low-level image features usually depends on matching keyframes rather than whole-shots. Automatic detection of video concepts is receiving much attention and as progress is made in this area we will see consequent impact on the quality of video retrieval. In practice it is the combination of these techniques which realises the most useful, and effective, video retrieval as shown by us repeatedly in TRECVid [5].

Alan F. Smeaton | Noel E. O'Connor | Gareth J. F. Jones | Hyowon Lee | Sorin Sav

[1] Marcel Worring,et al. Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2] Alan F. Smeaton,et al. A usage study of retrieval modalities for video shot retrieval , 2006, Inf. Process. Manag..

[3] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[4] Paul Browne. Video information retrieval using objects and ostensive relevance feedback , 2004, SAC '04.

[5] Paul Over,et al. The TREC VIdeo Retrieval Evaluation (TRECVID): A Case Study and Status Report , 2004, RIAO.

[6] Noel E. O'Connor,et al. Efficient contour-based shape representation and matching , 2003, MIR '03.