Video query: Research directions

As digital video databases become more and more pervasive, finding video in large databases becomes a major problem. Because of the nature of video (streamed objects), accessing the content of such databases is inherently a time-consuming operation. Enabling intelligent means of video retrieval and rapid video viewing through the processing, analysis, and interpretation of visual content are, therefore, important topics of research. In this paper, we survey the art of video query and retrieval and propose a framework for video-query formulation and video retrieval based on an iterated sequence of navigating, searching, browsing, and viewing. We describe how the rich information media of video in the forms of image, audio, and text can be appropriately used in each stage of the search process to retrieve relevant segments. Also, we address the problem of automatic video annotation-- attaching meanings to video segments to aid the query steps. Subsequently, we present a novel framework of structural video analysis that focuses on the processing of high-level features as well as low-level visual cues. This processing augments the semantic interpretation of a wide variety of long video segments and assists in the search, navigation, and retrieval of video. We describe several such techniques.

[1]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[2]  Steve Mann,et al.  Virtual bellows: constructing high quality stills from video , 1994, Proceedings of 1st International Conference on Image Processing.

[3]  Salim Roukos,et al.  TREC-5 Ad Hoc Retrieval Using K Nearest-Neighbors Re-Scoring , 1996, TREC.

[4]  Yoshinobu Tonomura,et al.  VideoMAP and VideoSpaceIcon: tools for anatomizing video content , 1993, INTERCHI.

[5]  David K. Gifford,et al.  Composition and Search with a Video Algebra , 1995, IEEE Multim..

[6]  Glorianna Davenport,et al.  Cinematic primitives for multimedia , 1991, IEEE Computer Graphics and Applications.

[7]  Katsumi Tanaka,et al.  OVID: Design and Implementation of a Video-Object Database System , 1993, IEEE Trans. Knowl. Data Eng..

[8]  Yihong Gong,et al.  Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[9]  S. Eisenstein,et al.  The Film Sense , 1942 .

[10]  John S. Boreczky,et al.  Indexes for user access to large video databases , 1994, Electronic Imaging.

[11]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[12]  Boon-Lock Yeo,et al.  Video content characterization and compaction for digital library applications , 1997, Electronic Imaging.

[13]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[14]  Minerva Ming-Yee Yeung Analysis, modeling and representation of digital video , 1996 .

[15]  Walter Bender,et al.  Salient video stills: content and context preserved , 1993, MULTIMEDIA '93.

[16]  Minerva M. Yeung,et al.  Efficient matching and clustering of video shots , 1995, Proceedings., International Conference on Image Processing.

[17]  S. Eisenstein,et al.  Film Form: Essays in Film Theory , 1949 .

[18]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[19]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[20]  Nilesh V. Patel,et al.  Statistical approach to scene change detection , 1995, Electronic Imaging.

[21]  Ramesh C. Jain,et al.  Metadata in video databases , 1994, SGMD.

[22]  Ramesh Jain,et al.  Storage and Retrieval for Still Image and Video Databases IV , 1996 .

[23]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.

[24]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[25]  Remi Depommier,et al.  Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[26]  Harpreet S. Sawhney,et al.  Model-based 2D&3D dominant motion estimation for mosaicing and video representation , 1995, Proceedings of IEEE International Conference on Computer Vision.

[27]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[28]  Ramesh Jain,et al.  Storage and Retrieval for Image and Video Databases III , 1995 .

[29]  Elke A. Rundensteiner,et al.  A visual query language for identifying temporal trends in video data , 1995, Proceedings. International Workshop on Multi-Media Database Management Systems.

[30]  Richard Szeliski,et al.  Image mosaicing for tele-reality applications , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[31]  Boon-Lock Yeo,et al.  Analysis And Presentation Of Soccer Highlights From Digital Video , 1995 .

[32]  Philippe Aigrain,et al.  The automatic real-time analysis of film editing and transition effects and its applications , 1994, Comput. Graph..

[33]  Gregory L. Zick,et al.  Scene decomposition of MPEG-compressed video , 1995, Electronic Imaging.

[34]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.

[35]  Yoshinobu Tonomura,et al.  Video browsing using brightness data , 1991, Other Conferences.

[36]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[37]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[38]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[39]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[40]  Behzad Shahraray,et al.  Automatic generation of pictorial transcripts of video programs , 1995, Electronic Imaging.

[41]  Steven Ascher,et al.  The filmmaker's handbook , 1984 .

[42]  Boon-Lock Yeo Efficient processing of compressed images and video , 1996 .

[43]  Behzad Shahraray,et al.  Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.

[44]  Ramesh C. Jain,et al.  Digital video segmentation , 1994, MULTIMEDIA '94.

[45]  D. Anastassiou,et al.  Digital television , 1994, Proc. IEEE.