On Video Retrieval: Content Analysis by ImageMinerTM

In this paper videos are analyzed to get a content-based decription of the video. The structure of a given video is useful to index long videos eeciently and automatically. A comparison between shots gives an overview about cut frequency, cut pattern, and scene bounds. After a shot detection the shots are grouped into clusters based on their visual similarity. A time-constraint clustering procedure is used to compare only those shots that are positioned inside a time range. Shots from diierent areas of the video (e.g., begin/end) are not compared. With this cluster information that contains a list about shots and their clusters it is possible to calculate scene bounds. A labeling of all clusters gives a declaration about the cut pattern. It is easy now to distinguish a dialogue from an action scene. The nal content analysis is done by the ImageMiner system. The ImageMiner system developed at the University of Bremen of the Image Processing Department of the Center for Computing Technology realizes content-based image retrieval for still images through a novel combination of methods and techniques of computer vision and artiical intelligence. The ImageMiner system consists of three analysis modules for computer vision, namely for color, texture, and contour analysis. Additionally exists a module for object recognition. The output of the object recognition module can be indexed by a text retrieval system. Thus, concepts like forestscene may be searched for. We combine the still image analysis with the results of the video analysis in order to retrieve shots or scenes.

[1]  Rosalind W. Picard,et al.  Video Orbits of the Projective Group: A New Perspective on Image Mosaicing , 1995 .

[2]  Boon-Lock Yeo,et al.  Video content characterization and compaction for digital library applications , 1997, Electronic Imaging.

[3]  尾島 修一,et al.  IS & T/SPIE's Symposium on Electronic Imaging : Science & Technology , 1995 .

[4]  Thorsten Hermes,et al.  Textures: an approach for new abstract description language , 1996, Electronic Imaging.

[5]  Walter Bender,et al.  Salient Stills: Process and Practice , 1996, IBM Syst. J..

[6]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[7]  Otthein Herzog,et al.  Video retrieval by still-image analysis with ImageMiner , 1997, Electronic Imaging.

[8]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[9]  Calvin C. Gotlieb,et al.  Texture descriptors based on co-occurrence matrices , 1990, Comput. Vis. Graph. Image Process..

[10]  Harpreet S. Sawhney,et al.  Dominant and multiple motion estimation for video representation , 1995, Proceedings., International Conference on Image Processing.

[11]  Peter Alshuth,et al.  IRIS - Image retrieval for images and videos , 1996 .

[12]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Axel Korn,et al.  Toward a Symbolic Representation of Intensity Changes in Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[15]  Azriel Rosenfeld,et al.  A Comparative Study of Texture Measures for Terrain Classification , 1975, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[17]  Thomas Ertl,et al.  Computer Graphics - Principles and Practice, 3rd Edition , 2014 .

[18]  Michael Fröhlich,et al.  Demonstration of the Interactive Graph-Visualization System da Vinci , 1994, GD.

[19]  Ramesh C. Jain,et al.  NSF workshop on visual information management systems: workshop report , 1993, Electronic Imaging.

[20]  A. E. Cawkell,et al.  Imaging systems and picture collection management: a review , 1992 .