Techniques and Systems for Image and Video Retrieval

Storage and retrieval of multimedia has become a requirement for many contemporary information systems. These systems need to provide browsing, querying, navigation, and, sometimes, composition capabilities involving various forms of media. In this survey, we review techniques and systems for image and video retrieval. We first look at visual features for image retrieval such as color, texture, shape, and spatial relationships. The indexing techniques are discussed for these features. Nonvisual features include captions, annotations, relational attributes, and structural descriptions. Temporal aspects of video retrieval and video segmentation are discussed next. We review several systems for image and video retrieval including research, commercial, and World Wide Web-based systems. We conclude with an overview of current challenges and future trends for image and video retrieval.

[1]  Jake K. Aggarwal,et al.  Segmentation through the detection of changes due to motion , 1979 .

[2]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[3]  Shi-Kuo Chang,et al.  Iconic Indexing by 2-D Strings , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Gilles Halin,et al.  Machine learning and vectorial matching for an image retrieval model: EXPRIM and the system RIVAGE , 1989, SIGIR '90.

[5]  Hanan Samet,et al.  The Design and Analysis of Spatial Data Structures , 1989 .

[6]  辻 三郎,et al.  Computer analysis of visual textures , 1990 .

[7]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[8]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[9]  Toshikazu Kato,et al.  A sketch retrieval method for full color image database-query by visual example , 1992, [1992] Proceedings. 11th IAPR International Conference on Pattern Recognition.

[10]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[11]  Max J. Egenhofer,et al.  What's special about spatial?: database requirements for vehicle navigation in geographic space , 1993, SIGMOD Conference.

[12]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[13]  P. Venkat Rangan,et al.  Efficient Storage Techniques for Digital Continuous Multimedia , 1993, IEEE Trans. Knowl. Data Eng..

[14]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[15]  Euripides G. M. Petrakis,et al.  Similarity Searching in Large Image DataBases , 1994 .

[16]  Euripides G. M. Petrakis,et al.  Similarity searching in large image database , 1994 .

[17]  Joshua R. Smith,et al.  Automatic Feature Extraction and Indexing for Content-Based Visual Query , 1995 .

[18]  Rohini K. Srihari,et al.  Automatic Indexing and Content-Based Retrieval of Captioned Images , 1995, Computer.

[19]  Clement T. Yu,et al.  Design, implementation and evaluation of SCORE (a system for content based retrieval of pictures) , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[20]  Gwang S. Jung,et al.  Adaptive Query Reformulation In Attribute Based Image Retrieval , 1995 .

[21]  Timos K. Sellis,et al.  Topological relations in the world of minimum bounding rectangles: a study with R-trees , 1995, SIGMOD '95.

[22]  Aidong Zhang,et al.  Texture-Based Image Retrieval Using Fractal Codes , 1995 .

[23]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Other Conferences.

[24]  Rama Chellappa,et al.  Human and machine recognition of faces: a survey , 1995, Proc. IEEE.

[25]  Michael Stonebraker,et al.  Chabot: Retrieval from a Relational Database of Images , 1995, Computer.

[26]  David K. Gifford,et al.  Composition and Search with a Video Algebra , 1995, IEEE Multim..

[27]  Tat-Seng Chua,et al.  An integrated color-spatial approach to content-based image retrieval , 1995, MULTIMEDIA '95.

[28]  TheodoridisYannis,et al.  Topological relations in the world of minimum bounding rectangles , 1995 .

[29]  Takeo Kanade,et al.  Human Face Detection in Visual Scenes , 1995, NIPS.

[30]  Alan F. Smeaton,et al.  Experiments on using semantic distances between words in image caption retrieval , 1996, SIGIR '96.

[31]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[32]  Takeo Kanade,et al.  Intelligent Access to Digital Video: Informedia Project , 1996, Computer.

[33]  David J. Abel What's Special about Spatial? , 1996, Australasian Database Conference.

[34]  Edoardo Ardizzone,et al.  JACOB: just a content-based query system for video databases , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[35]  David Doermann,et al.  Archiving, indexing, and retrieval of video in the compressed domain , 1996, Other Conferences.

[36]  Ricky K. Taira,et al.  A Knowledge-Based Approach for Retrieving Images by Content , 1996, IEEE Trans. Knowl. Data Eng..

[37]  Rakesh Mohan,et al.  Text-based search of TV news stories , 1996, Other Conferences.

[38]  Shih-Fu Chang,et al.  Efficient Techniques for Feature-Based Image/Video Access and Manipulation , 1996, Data Processing Clinic.

[39]  Anil K. Jain,et al.  Object Matching Using Deformable Templates , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Rainer Lienhart,et al.  Automatic text recognition for video indexing , 1997, MULTIMEDIA '96.

[41]  R. Jain,et al.  Visual Information Retrieval Technology A Virage Perspective , 1997 .

[42]  Satish K. Tripathi,et al.  Impact of video scheduling on bandwidth allocation for multiplexed MPEG streams , 1997, Multimedia Systems.

[43]  Th. Hermes,et al.  Video retrieval with IRIS , 1997, MULTIMEDIA '96.

[44]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[45]  Clement T. Yu,et al.  Similarity based retrieval of videos , 1997, Proceedings 13th International Conference on Data Engineering.

[46]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[47]  Clement T. Yu,et al.  Using semantic contents and WordNet in image retrieval , 1997, SIGIR '97.

[48]  K. Selçuk Candan,et al.  SEMCOG: an object-based image retrieval system and its visual query interface , 1997, SIGMOD '97.

[49]  Jonathan D. Courtney Automatic, object-based indexing for assisted analysis of video data , 1997, MULTIMEDIA '96.

[50]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[51]  Hans-Peter Kriegel,et al.  The pyramid-technique: towards breaking the curse of dimensionality , 1998, SIGMOD '98.

[52]  Alfio Lombardo,et al.  Control of perceived quality of service in multimedia retrieval services: prediction-based mechanism vs. compensation buffers , 1998, Multimedia Systems.

[53]  BerchtoldStefan,et al.  The pyramid-technique , 1998 .

[54]  John R. Smith,et al.  Searching for Images and Videos on the World-Wide Web , 1999 .