High-Dimensional Indexing for Video Retrieval

The video retrieval task raises many fundamental questions in computer vision and information retrieval, such as how to represent video items, what information can be directly extracted from them, and how to explore such information in order to satisfy the user’s information need. Video items are intrinsically complex, and the analysis of their content requires heavy computing processes. Excepting the case of text, multimedia content analysis does not result in the high-level concepts required by the generality of the search tasks. This brings about the so-called “semantic gap”, clearly identified as the main issue in multimedia retrieval Gudivada & Raghavan (1995); Smith (2007). Text communication is based on concepts, expressed in the user’s language and close to the way humans think. Searching text items may require a number of processing techniques like pattern matching, stemming, finding synonyms, translating, natural language analysis. Supposing that the user expresses his information need through words, the set of retrieved documents includes those containing precise or imprecise word matches and can be continuously enlarged to more and more semantically related documents. In the case of a video repository, however, there is not such a clear semantic channel between the object’s content and the user information need. The automatic video analysis may produce many descriptors related to the contents, the so-called low-level descriptors Bober (Jun 2001); Manjunath et al. (2001); Mufit Ferman et al. (2000), but hardly produces accurate descriptions close enough to human concepts Snoek & Smeulders (2010); Tesic & Smith (2006). This is a problem for a wide range of video-based applications, other than search and retrieval, such as human-computer interface, security and surveillance, copyright protection, and personal entertainment.

[1]  Horst M. Eidenberger,et al.  Distance measures for MPEG-7-based retrieval , 2003, MIR '03.

[2]  Christian Böhm,et al.  Adaptable Similarity Search Using Vector Quantization , 2001, DaWaK.

[3]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[4]  Cristina Ribeiro,et al.  Multidimensional Descriptor Indexing: Exploring the BitMatrix , 2006, CIVR.

[5]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[6]  Simone Santini,et al.  Similarity is a Geometer , 1997, Multimedia Tools and Applications.

[7]  Joshua B. Tenenbaum,et al.  A Generative Theory of Similarity , 2005 .

[8]  Ben Shneiderman,et al.  Find that photo! , 2006, Commun. ACM.

[9]  Naphtali Rishe,et al.  Content-based image retrieval , 1995, Multimedia Tools and Applications.

[10]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[11]  Hermann Ney,et al.  Features for Image Retrieval: A Quantitative Comparison , 2004, DAGM-Symposium.

[12]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[13]  Nicu Sebe,et al.  A New Study on Distance Metrics as Similarity Measurement , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[14]  Beng Chin Ooi,et al.  Indexing the edges—a simple and yet efficient approach to high-dimensional indexing , 2000, PODS.

[15]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[16]  Usama M. Fayyad,et al.  On the Handling of Continuous-Valued Attributes in Decision Tree Generation , 1992, Machine Learning.

[17]  S. Robertson The probability ranking principle in IR , 1997 .

[18]  John R. Smith The Real Problem of Bridging the "Semantic Gap" , 2007, MCAM.

[19]  Craig MacDonald,et al.  Searching for expertise using the terrier platform , 2006, SIGIR '06.

[20]  Anthony K. H. Tung,et al.  LDC: enabling search by partial distance in a hyper-dimensional space , 2004, Proceedings. 20th International Conference on Data Engineering.

[21]  Djemel Ziou,et al.  Image Retrieval from the World Wide Web: Issues, Techniques, and Systems , 2004, CSUR.

[22]  Chong-Wah Ngo,et al.  Towards optimal bag-of-features for object categorization and semantic video retrieval , 2007, CIVR '07.

[23]  Guang-Ho Cha,et al.  Bitmap indexing method for complex similarity queries with relevance feedback , 2003, MMDB '03.

[24]  Christos Faloutsos,et al.  A survey of information retrieval and filtering methods , 1995 .

[25]  B. S. Manjunath,et al.  Nearest neighbor search for relevance feedback , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[26]  G. Medioni,et al.  Content-based image retrieval: an overview , 2004 .

[27]  Takeo Kanade,et al.  Intelligent Access to Digital Video: Informedia Project , 1996, Computer.

[28]  Nuno Vasconcelos,et al.  Query by Semantic Example , 2006, CIVR.

[29]  Nevenka Dimitrova Context and Memory in Multimedia Content Analysis , 2004, IEEE Multim..

[30]  B. S. Manjunath,et al.  Managing large-scale multimedia repositories , 2004 .

[31]  Mario A. López,et al.  High dimensional similarity search with space filling curves , 2001, Proceedings 17th International Conference on Data Engineering.

[32]  Ricardo A. Baeza-Yates,et al.  Searching in metric spaces , 2001, CSUR.

[33]  Paul O'Leary,et al.  Cheshire II: Designing a Next-Generation Online Catalog , 1996, J. Am. Soc. Inf. Sci..

[34]  Jeffrey F. Naughton,et al.  Generalized Search Trees for Database Systems , 1995, VLDB.

[35]  W. Bruce Croft,et al.  Inference networks for document retrieval , 1989, SIGIR '90.

[36]  Philip S. Yu,et al.  The IGrid index: reversing the dimensionality curse for similarity indexing in high dimensional space , 2000, KDD '00.

[37]  Stefan M. Rüger,et al.  Fractional Distance Measures for Content-Based Image Retrieval , 2005, ECIR.

[38]  Jarek Gryz,et al.  Maximal Vector Computation in Large Data Sets , 2005, VLDB.

[39]  R. Bayer,et al.  Organization and maintenance of large ordered indices , 1970, SIGFIDET '70.

[40]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[41]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[42]  Christos Faloutsos,et al.  MindReader: Querying Databases Through Multiple Examples , 1998, VLDB.

[43]  Jan Chomicki,et al.  Querying with Intrinsic Preferences , 2002, EDBT.

[44]  Sunil Arya,et al.  An optimal algorithm for approximate nearest neighbor searching fixed dimensions , 1998, JACM.

[45]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[46]  Wolf-Tilo Balke,et al.  Multi-objective Query Processing for Database Systems , 2004, VLDB.

[47]  Jitendra Malik,et al.  Blobworld: A System for Region-Based Image Indexing and Retrieval , 1999, VISUAL.

[48]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[49]  Shin'ichi Satoh,et al.  Distinctiveness-sensitive nearest-neighbor search for efficient similarity retrieval of multimedia information , 2001, Proceedings 17th International Conference on Data Engineering.

[50]  Ramesh C. Jain,et al.  Similarity indexing with the SS-tree , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[51]  Qi Tian,et al.  Semantic retrieval of video - review of research on video retrieval in meetings, movies and broadcast news, and sports , 2006, IEEE Signal Processing Magazine.

[52]  Rong Yan,et al.  How many high-level concepts will fill the semantic gap in news video retrieval? , 2007, CIVR '07.

[53]  Charu C. Aggarwal,et al.  Towards meaningful high-dimensional nearest neighbor search by human-computer interaction , 2002, Proceedings 18th International Conference on Data Engineering.

[54]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[55]  Christian Böhm,et al.  ProVeR: Probabilistic Video Retrieval using the Gauss-Tree , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[56]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[57]  Angela Schwering,et al.  Hybrid Model for Semantic Similarity Measurement , 2005, OTM Conferences.

[58]  I. Jolliffe Principal Component Analysis , 2002 .

[59]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[60]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[61]  A. Murat Tekalp,et al.  Integrated semantic-syntactic video modeling for search and browsing , 2004, IEEE Transactions on Multimedia.

[62]  Òscar Celma,et al.  Foafing the Music: Bridging the Semantic Gap in Music Recommendation , 2006, SEMWEB.

[63]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[64]  Marco Patella,et al.  Searching in metric spaces with user-defined and approximate distances , 2002, TODS.

[65]  Gertjan J. Burghouts,et al.  Performance evaluation of local colour invariants , 2009, Comput. Vis. Image Underst..

[66]  Ronald Fagin,et al.  Efficient similarity search and classification via rank aggregation , 2003, SIGMOD '03.

[67]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[68]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[69]  Christian Böhm,et al.  Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases , 2001, CSUR.

[70]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[71]  Horst M. Eidenberger,et al.  An experimental study on the performance of visual information retrieval similarity models , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[72]  Adnan Yazici,et al.  Slim-tree and BitMatrix index structures in image retrieval system using MPEG-7 Descriptors , 2008, 2008 International Workshop on Content-Based Multimedia Indexing.

[73]  Cristina Ribeiro,et al.  An Evaluation Framework for Multidimensional Multimedia Descriptor Indexing , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[74]  M. Raubal Formalizing Conceptual Spaces , 2004 .

[75]  Stefan Rüger,et al.  Image and Video Retrieval , 2002 .

[76]  Li Wu,et al.  VP-EMD Tree: An Efficient Indexing Strategy for Image Retrieval , 2004, CISST.

[77]  Wenbin Chen,et al.  Image Tangent Space for Image Retrieval , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[78]  Werner Kießling,et al.  Personalized Keyword Search with Partial-Order Preferences , 2002, SBBD.

[79]  Qing Liu,et al.  Efficient Computation of the Skyline Cube , 2005, VLDB.

[80]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[81]  KanadeTakeo,et al.  Intelligent Access to Digital Video , 1996 .

[82]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[83]  Rong Yan,et al.  A review of text and image retrieval approaches for broadcast news video , 2007, Information Retrieval.

[84]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[85]  Martin L. Kersten,et al.  Efficient k-NN search on vertically decomposed data , 2002, SIGMOD '02.

[86]  Werner Bailer,et al.  An innovative system for formulating complex, combined content-based and keyword-based queries , 2003, IS&T/SPIE Electronic Imaging.

[87]  Jun Yang,et al.  Annotating News Video with Locations , 2006, CIVR.

[88]  Stefan M. Rüger,et al.  Trading Precision for Speed: Localised Similarity Functions , 2005, CIVR.

[89]  A. Murat Tekalp,et al.  Group-of-frames/pictures color histogram descriptors for multimedia applications , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[90]  Beng Chin Ooi,et al.  Toward efficient multifeature query processing , 2006, IEEE Transactions on Knowledge and Data Engineering.

[91]  Miroslaw Bober,et al.  MPEG-7 visual shape descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[92]  Dedre Gentner,et al.  Structure-Mapping: A Theoretical Framework for Analogy , 1983, Cogn. Sci..

[93]  Ilaria Bartolini,et al.  Optimal Incremental Evaluation of Preference Queries Based on Ranked Sub-queries , 2005, SEBD.

[94]  Yihong Gong,et al.  Lessons Learned from Building a Terabyte Digital Video Library , 1999, Computer.

[95]  Gerhard Weikum,et al.  Integrating DB and IR Technologies: What is the Sound of One Hand Clapping? , 2005, CIDR.

[96]  Jonathan Goldstein,et al.  Redundant Bit Vectors for Quickly Searching High-Dimensional Regions , 2004, Deterministic and Statistical Methods in Machine Learning.

[97]  Horst M. Eidenberger,et al.  Visual similarity measurement with the feature contrast model , 2003, IS&T/SPIE Electronic Imaging.

[98]  Bernhard Seeger,et al.  XXL - A Library Approach to Supporting Efficient Implementations of Advanced Database Queries , 2001, VLDB.

[99]  M. Tamer Özsu,et al.  Integrating the Results of Multimedia Sub-Queries Using Qualitative Preferences , 2004, Multimedia Information Systems.

[100]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[101]  Z. Meral Özsoyoglu,et al.  Indexing large metric spaces for similarity search queries , 1999, TODS.

[102]  Mingjing Li,et al.  Mapping low-level features to high-level semantic concepts in region-based image retrieval , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[103]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[104]  Wessel Kraaij,et al.  Variations on language modeling for information retrieval , 2005, SIGF.

[105]  A. Hanjalic,et al.  Extracting moods from pictures and sounds: towards truly personalized TV , 2006, IEEE Signal Processing Magazine.

[106]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[107]  László Böszörményi,et al.  The Life Cycle of Multimedia Metadata , 2005, IEEE Multim..

[108]  John R. Smith,et al.  Semantic Labeling of Multimedia Content Clusters , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[109]  Dick C. A. Bulterman,et al.  Is It Time for a Moratorium on Metadata? , 2004, IEEE Multim..

[110]  Jian Pei,et al.  Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces , 2005, VLDB.

[111]  Mario A. Nascimento,et al.  High-Dimensional Similarity Searches Using A Metric Pseudo-Grid , 2005, 21st International Conference on Data Engineering Workshops (ICDEW'05).

[112]  Beng Chin Ooi,et al.  Querying high-dimensional data in single-dimensional space , 2004, The VLDB Journal.

[113]  Marcel Worring,et al.  The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[114]  Li Yujian,et al.  A Normalized Levenshtein Distance Metric , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[115]  Beng Chin Ooi,et al.  iDistance: An adaptive B+-tree based indexing method for nearest neighbor search , 2005, TODS.

[116]  Wei-Ying Ma,et al.  Image and Video Retrieval , 2003, Lecture Notes in Computer Science.

[117]  Marcel Worring,et al.  Multimedia event-based video indexing using time intervals , 2005, IEEE Transactions on Multimedia.

[118]  Thomas S. Huang,et al.  CBIR: from low-level features to high-level semantics , 2000, Electronic Imaging.

[119]  Dan Corbett,et al.  A basic mathematical framework for conceptual graphs , 2006, IEEE Transactions on Knowledge and Data Engineering.

[120]  Marcel Worring,et al.  The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[121]  Thijs Westerveld,et al.  Using generative probabilistic models for multimedia retrieval , 2005, SIGF.

[122]  Franciska de Jong,et al.  Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition , 2007, SAMT.

[123]  Shih-Fu Chang,et al.  Multimedia Knowledge Integration, Summarization And Evaluation , 2002, MDM/KDD.

[124]  Jan Chomicki,et al.  Preference formulas in relational queries , 2003, TODS.

[125]  Aoying Zhou,et al.  An adaptive and dynamic dimensionality reduction method for high-dimensional indexing , 2007, The VLDB Journal.

[126]  A. Tversky Features of Similarity , 1977 .

[127]  Setsuo Ohsuga,et al.  INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES , 1977 .

[128]  Hanan Samet,et al.  Index-driven similarity search in metric spaces (Survey Article) , 2003, TODS.

[129]  Quan Wang,et al.  Fast Similarity Search for High-Dimensional Dataset , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[130]  Philip S. Yu,et al.  A new method for similarity indexing of market basket data , 1999, SIGMOD '99.

[131]  T.H.W. Westerveld,et al.  RECVID as a Re-Usable Test-Collection for Video Retrieval , 2003 .

[132]  Rudolf Bayer,et al.  Organization and maintenance of large ordered indexes , 1972, Acta Informatica.

[133]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[134]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[135]  Walid G. Aref,et al.  SP-GiST: An Extensible Database Index for Supporting Space Partitioning Trees , 2001, Journal of Intelligent Information Systems.

[136]  Vijay V. Raghavan,et al.  Content-Based Image Retrieval Systems - Guest Editors' Introduction , 1995, Computer.

[137]  Arnold W. M. Smeulders,et al.  Visual-Concept Search Solved? , 2010, Computer.

[138]  Charu C. Aggarwal,et al.  On the Surprising Behavior of Distance Metrics in High Dimensional Spaces , 2001, ICDT.