GeM-Tree: Towards a Generalized Multidimensional Index Structure Supporting Image and Video Retrieval

In this paper, we propose a tree-based multidimensional structure, GeM-Tree, which indexes both images and videos within a single general framework utilizing Earth Moverpsilas Distance. It can support different content-based image and video retrieval approaches, and can accommodate applications where the cross-similarity between images and videos need to be considered during content-based retrievals. Furthermore, it is flexible enough to index different video classification units and can maintain the hierarchical relationship between them. In addition, it uses a construct called hierarchical Markov model mediator to introduce high-level semantic relationships among images and different levels of video units. The experimental results indicate that GeM-Tree is a promising generalized index structure for multimedia data with low computational overhead, is flexible enough to support different retrieval approaches and generates query results with high relevance.

[1]  A. Guttman,et al.  A Dynamic Index Structure for Spatial Searching , 1984, SIGMOD 1984.

[2]  Volker Markl,et al.  Integrating the UB-Tree into a Database System Kernel , 2000, VLDB.

[3]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[4]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[5]  Leonidas J. Guibas,et al.  A metric for distributions with applications to image databases , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[6]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[7]  Jeffrey F. Naughton,et al.  Generalized Search Trees for Database Systems , 1995, VLDB.

[8]  Shu-Ching Chen,et al.  Hierarchical affinity hybrid tree: A multidimensional index structure to organize videos and support content-based retrievals , 2008, 2008 IEEE International Conference on Information Reuse and Integration.

[9]  Min Chen,et al.  DETECTION OF SOCCER GOAL SHOTS USING JOINT MULTIMEDIA FEATURES AND CLASSIFICATION RULES , 2003 .

[10]  S. Rachev The Monge–Kantorovich Mass Transference Problem and Its Stochastic Applications , 1985 .

[11]  Shu-Ching Chen,et al.  Modeling Semantic Concepts and User Preferences in Content-Based Video Retrieval , 2007, Int. J. Semantic Comput..

[12]  Mubarak Shah,et al.  Content based video matching using spatiotemporal volumes , 2008, Comput. Vis. Image Underst..

[13]  Jian-Kang Wu Content-Based Indexing of Multimedia Databases , 1997, IEEE Trans. Knowl. Data Eng..

[14]  Shu-Ching Chen,et al.  Affinity Hybrid Tree: An Indexing Technique for Content-Based Image Retrieval in Multimedia Databases , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[15]  Sharad Mehrotra,et al.  Evaluating refined queries in top-k retrieval systems , 2004, IEEE Transactions on Knowledge and Data Engineering.