Efficient indexing and retrieval of large-scale geo-tagged video databases

We are witnessing a significant growth in the number of smartphone users and advances in phone hardware and sensor technology. In conjunction with the popularity of video applications such as YouTube, an unprecedented number of user-generated videos (UGVs) are being generated and consumed by the public, which leads to a Big Data challenge in social media. In a very large video repository, it is difficult to index and search videos in their unstructured form. However, due to recent development, videos can be geo-tagged (e.g., locations from GPS receiver and viewing directions from digital compass) at the acquisition time, which can provide potential for efficient management of video data. Ideally, each video frame can be tagged by the spatial extent of its coverage area, termed Field-Of-View (FOV). This effectively converts a challenging video management problem into a spatial database problem. This paper attacks the challenges of large-scale video data management using spatial indexing and querying of FOVs, especially maximally harnessing the geographical properties of FOVs. Since FOVs are shaped similar to slices of pie and contain both location and orientation information, conventional spatial indexes, such as R-tree, cannot index them efficiently. The distribution of UGVs’ locations is non-uniform (e.g., more FOVs in popular locations). Consequently, even multilevel grid-based indexes, which can handle both location and orientation, have limitations in managing the skewed distribution. Additionally, since UGVs are usually captured in a casual way with diverse setups and movements, no a priori assumption can be made to condense them in an index structure. To overcome the challenges, we propose a class of new R-tree-based index structures that effectively harness FOVs’ camera locations, orientations and view-distances, in tandem, for both filtering and optimization. We also present novel search strategies and algorithms for efficient range and directional queries on our indexes. Our experiments using both real-world and large synthetic video datasets (over 30 years’ worth of videos) demonstrate the scalability and efficiency of our proposed indexes and search algorithms.

[1]  Ken C. K. Lee,et al.  Nearest Surrounder Queries , 2010, IEEE Trans. Knowl. Data Eng..

[2]  Jimeng Sun,et al.  The TPR*-Tree: An Optimized Spatio-Temporal Access Method for Predictive Queries , 2003, VLDB.

[3]  Cyrus Shahabi,et al.  An efficient index structure for large-scale geo-tagged video databases , 2014, SIGSPATIAL/GIS.

[4]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[5]  Roger Zimmermann,et al.  Relevance ranking in georeferenced video search , 2009, Multimedia Systems.

[6]  Shashi Shekhar,et al.  Object-Based Directional Query Processing in Spatial Databases , 2003, IEEE Trans. Knowl. Data Eng..

[7]  Cyrus Shahabi,et al.  MediaQ: mobile multimedia management system , 2014, MMSys '14.

[8]  Shashi Shekhar,et al.  An Object Model of Direction and Its Implications , 1999, GeoInformatica.

[9]  He Ma,et al.  Large-scale geo-tagged video indexing and queries , 2014, GeoInformatica.

[10]  Ben Shneiderman,et al.  Data structures for dynamic queries: an analytical and experimental evaluation , 1994, AVI '94.

[11]  Roger Zimmermann,et al.  Generating synthetic meta-data for georeferenced video management , 2010, GIS '10.

[12]  Ken C. K. Lee,et al.  Nearest Surrounder Queries , 2006, IEEE Transactions on Knowledge and Data Engineering.

[13]  Yannis Theodoridis,et al.  Supporting Direction Relations in Spatial Database systems , 1996 .

[14]  Roger Zimmermann,et al.  Viewable scene modeling for geospatial video search , 2008, ACM Multimedia.

[15]  Jinha Kim,et al.  GeoTree: Using spatial information for georeferenced video search , 2014, Knowl. Based Syst..

[16]  Roger Zimmermann,et al.  Automatic tag generation and ranking for sensor-rich outdoor videos , 2011, MM '11.

[17]  Ravi Krishnamurthy,et al.  The Multilevel Grid File - A Dynamic Hierarchical Multidimensional File Structure , 1991, DASFAA.

[18]  J. Blat,et al.  VideoGIS: Segmenting and indexing video based on geographic information , 2002 .

[19]  Tao Chen,et al.  Cardinal directions between complex regions , 2012, TODS.

[20]  Jing Xu,et al.  DESKS: Direction-Aware Spatial Keyword Search , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[21]  Gal Ashour,et al.  Efficient storage and retrieval of geo-referenced video from moving sensors , 2013, SIGSPATIAL/GIS.

[22]  Kentaro Toyama,et al.  Geographic location tags on digital images , 2003, ACM Multimedia.

[23]  Allen R. Hanson,et al.  An efficient method for geo-referenced video mosaicing for environmental monitoring , 2005, Machine Vision and Applications.