Light-years from Lena: video and image libraries of the future

The average consumer with a personal computer will soon have access to the world's collections of digital video and images. However, the theory and tools that facilitate browsing, querying, retrieval, and manipulation of imagery are still in their infancy. For example, people would like to access content in movies, e.g. "fast forward to where they bicycle through the sky". This new application area reveals an abundance of unsolved scientific problems for image processing. An overview is provided of the key technical challenges that the image processing community should embrace. The scope of the paper is restricted to image processing, with particular focus on the problems of representation and analysis of image content, and frame-to-frame motion.

[1]  R. Gray,et al.  Combining Image Compression and Classification Using Vector Quantization , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Fang Liu,et al.  Periodicity, directionality, and randomness: Wold features for perceptual pattern recognition , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[3]  Harpreet S. Sawhney,et al.  Model-based 2D&3D dominant motion estimation for mosaicing and video representation , 1995, Proceedings of IEEE International Conference on Computer Vision.

[4]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[5]  Yukinobu Taniguchi,et al.  Structured Video Computing , 1994, IEEE MultiMedia.

[6]  Richard L. Delanoy Machine learning for a Toolkit for Image Mining , 1995 .

[7]  Atreyi Kankanhalli,et al.  A Video Database System for Digital Libraries , 1994, DL.

[8]  R. Nelson,et al.  Low level recognition of human motion (or how to get your man without finding his body parts) , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[9]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[10]  Shih-Fu Chang,et al.  Transform features for texture classification and discrimination in large image databases , 1994, Proceedings of 1st International Conference on Image Processing.

[11]  Rosalind W. Picard,et al.  Orbits': Characterizing the Coordinate Transformation between Two Images Using the Projective Group , 1995 .

[12]  Michael J. Swain,et al.  Indexing via color histograms , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[13]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Other Conferences.

[14]  Arding Hsu,et al.  Feature management for large video databases , 1993, Electronic Imaging.

[15]  Linda G. Shapiro,et al.  Image Segmentation Techniques , 1984, Other Conferences.

[16]  Michal Irani,et al.  Detecting and Tracking Multiple Moving Objects Using Temporal Integration , 1992, ECCV.

[17]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[18]  John S. Boreczky,et al.  Indexes for user access to large video databases , 1994, Electronic Imaging.