We present a framework for content based query and retrieval of information from large video databases. This framework enables content based retrieval of video sequences by characterizing the sequences using motion, texture and colorimetry cues. This characterization is biologically inspired and results in a compact parameter space where every segment of video is represented by an 8 dimensional vector. Searching and retrieval is done in real-time with high accuracy in this parameter space. The present version of the VideoBook has 165 video sequences, each 15 seconds long at 30 frames a second representing storage of 65 Giga bytes. The VideoBook is able to search and retrieve video sequences with 92% accuracy in real-time. Experiments thus demonstrate that the characterization is capable of extracting higher level structure from raw pixel values.
[1]
E. Adelson,et al.
The Plenoptic Function and the Elements of Early Vision
,
1991
.
[2]
D H Hubel,et al.
Brain mechanisms of vision.
,
1979,
Scientific American.
[3]
Dragutin Petkovic,et al.
Query by Image and Video Content: The QBIC System
,
1995,
Computer.
[4]
Wentian Li.
Mutual information functions versus correlation functions
,
1990
.
[5]
Rosalind W. Picard.
A Society of Models for Video and Image Libraries
,
1996,
IBM Syst. J..
[6]
Berthold K. P. Horn.
Robot vision
,
1986,
MIT electrical engineering and computer science series.
[7]
David J. Fleet,et al.
Performance of optical flow techniques
,
1992,
Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.