An efficient index structure for large-scale geo-tagged video databases

An unprecedented number of user-generated videos (UGVs) are currently being collected by mobile devices, however, such unstructured data are very hard to index and search. Due to recent development, UGVs can be geo-tagged, e.g., GPS locations and compass directions, at the acquisition time at a very fine spatial granularity. Ideally, each video frame can be tagged by the spatial extent of its coverage area, termed Field-Of-View (FOV). In this paper, we focus on the challenges of spatial indexing and querying of FOVs in a large repository. Since FOVs contain both location and orientation information, and their distribution is non-uniform, conventional spatial indexes (e.g., R-tree, Grid) cannot index them efficiently. We propose a class of new R-tree-based index structures that effectively harness FOVs' camera locations, orientations and view-distances, in tandem, for both filtering and optimization. In addition, we present novel search strategies and algorithms for efficient range and directional queries on FOVs utilizing our indexes. Our experiments with a real-world dataset and a large synthetic video dataset (over 30 years worth of videos) demonstrate the scalability and efficiency of our proposed indexes and search algorithms and their superiority over the competitors.

[1]  Cyrus Shahabi,et al.  MediaQ: mobile multimedia management system , 2014, MMSys '14.

[2]  Yannis Theodoridis,et al.  Supporting Direction Relations in Spatial Database systems , 1996 .

[3]  Jimeng Sun,et al.  The TPR*-Tree: An Optimized Spatio-Temporal Access Method for Predictive Queries , 2003, VLDB.

[4]  Shashi Shekhar,et al.  An Object Model of Direction and Its Implications , 1999, GeoInformatica.

[5]  J. Blat,et al.  VideoGIS: Segmenting and indexing video based on geographic information , 2002 .

[6]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[7]  Ken C. K. Lee,et al.  Nearest Surrounder Queries , 2006, IEEE Transactions on Knowledge and Data Engineering.

[8]  Roger Zimmermann,et al.  Relevance ranking in georeferenced video search , 2009, Multimedia Systems.

[9]  Allen R. Hanson,et al.  An efficient method for geo-referenced video mosaicing for environmental monitoring , 2005, Machine Vision and Applications.

[10]  Jing Xu,et al.  DESKS: Direction-Aware Spatial Keyword Search , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[11]  Roger Zimmermann,et al.  Automatic tag generation and ranking for sensor-rich outdoor videos , 2011, MM '11.

[12]  Tao Chen,et al.  Cardinal directions between complex regions , 2012, TODS.

[13]  Roger Zimmermann,et al.  Generating synthetic meta-data for georeferenced video management , 2010, GIS '10.

[14]  Jinha Kim,et al.  GeoTree: Using spatial information for georeferenced video search , 2014, Knowl. Based Syst..

[15]  Roger Zimmermann,et al.  Viewable scene modeling for geospatial video search , 2008, ACM Multimedia.

[16]  Ben Shneiderman,et al.  Data structures for dynamic queries: an analytical and experimental evaluation , 1994, AVI '94.

[17]  Kentaro Toyama,et al.  Geographic location tags on digital images , 2003, ACM Multimedia.

[18]  He Ma,et al.  Large-scale geo-tagged video indexing and queries , 2014, GeoInformatica.

[19]  Shashi Shekhar,et al.  Object-Based Directional Query Processing in Spatial Databases , 2003, IEEE Trans. Knowl. Data Eng..

[20]  Ravi Krishnamurthy,et al.  The Multilevel Grid File - A Dynamic Hierarchical Multidimensional File Structure , 1991, DASFAA.