Techniques and data structures for efficient multimedia retrieval based on similarity

As more and more information is captured and stored in digital form, many techniques and systems have been developed for indexing and retrieval of text documents, audio, images, and video. The retrieval is normally based on similarities between extracted feature vectors of the query and stored items. Feature vectors are usually multidimensional. When the number of stored objects and/or the number of dimensions of the feature vectors are large, it will be too slow to linearly search all stored feature vectors to find those that satisfy the query criteria. Techniques and data structures are thus required to organize feature vectors and manage the search process so that objects relevant to the query can be located quickly. This paper provides a survey of these techniques and data structures.

[1]  Ramez Elmasri,et al.  Fundamentals of database systems (2nd ed.) , 1994 .

[2]  Guojun Lu,et al.  A Grid-based Shape Indexing and Retrieval Method , 1997, Aust. Comput. J..

[3]  Linda G. Shapiro,et al.  Efficient image retrieval with multiple distance measures , 1997, Electronic Imaging.

[4]  Qi Yang,et al.  MB+-Tree: An Index Structure for Content-Based Retrieval , 1996 .

[5]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[6]  Michael S. Lew Next-Generation Web Searches for Visual Content , 2000, Computer.

[7]  Guojun Lu,et al.  Indexing 2D nonoccluded shapes for similarity retrieval , 1997, Optics & Photonics.

[8]  Clement T. Yu,et al.  Techniques and Systems for Image and Video Retrieval , 1999, IEEE Trans. Knowl. Data Eng..

[9]  Eli Upfal,et al.  Updates to the QBIC system , 1997, Electronic Imaging.

[10]  C.-C. Jay Kuo,et al.  Hierarchical clustering techniques for image database organization and summarization , 1998, Other Conferences.

[11]  Atsuo Yoshitaka,et al.  A Survey on Content-Based Retrieval for Multimedia Databases , 1999, IEEE Trans. Knowl. Data Eng..

[12]  Brian Christopher Smith,et al.  Query by humming: musical information retrieval in an audio database , 1995, MULTIMEDIA '95.

[13]  Ramesh C. Jain,et al.  Similarity indexing: algorithms and performance , 1996, Electronic Imaging.

[14]  Jürg Nievergelt,et al.  The Grid File: An Adaptable, Symmetric Multikey File Structure , 1984, TODS.

[15]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[16]  Raymond T. Ng,et al.  Analysis of multilevel color histograms , 1997, Electronic Imaging.

[17]  Guojun Lu,et al.  Region-based shape representation and similarity measure suitable for content-based image retrieval , 1999, Multimedia Systems.

[18]  Jesse S. Jin,et al.  Using browsing to improve content-based image retrieval , 1998, Other Conferences.

[19]  Jesse S. Jin,et al.  SS+ tree: an improved index structure for similarity searches in a high-dimensional feature space , 1997, Electronic Imaging.

[20]  Markus A. Stricker,et al.  Color indexing with weak spatial constraints , 1996, Electronic Imaging.

[21]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[22]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[23]  Divyakant Agrawal,et al.  Data declustering for efficient range and similarity searching , 1998, Other Conferences.

[24]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[25]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[26]  Susan T. Dumais,et al.  Personalized information delivery: an analysis of information filtering methods , 1992, CACM.

[27]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[28]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[29]  K. Wakimoto,et al.  Efficient and Effective Querying by Image Content , 1994 .

[30]  Dragutin Petkovic,et al.  Content-based representation and retrieval of visual media: A state-of-the-art review , 1996, Multimedia Tools and Applications.

[31]  Jon Louis Bentley,et al.  Multidimensional Binary Search Trees in Database Applications , 1979, IEEE Transactions on Software Engineering.

[32]  V. S. Subrahmanian Principles of Multimedia Database Systems , 1998 .

[33]  Ramesh C. Jain,et al.  Similarity indexing with the SS-tree , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[34]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[35]  Christos Faloutsos,et al.  Searching Multimedia Databases by Content , 1996, Advances in Database Systems.