Large-scale geo-tagged video indexing and queries

With the wide spread of smartphones, a large number of user-generated videos are produced everyday. The embedded sensors, e.g., GPS and the digital compass, make it possible that videos are accessed based on their geo-properties. In our previous work, we have created a framework for integrated, sensor-rich video acquisition (with one instantiation implemented in the form of smartphone applications) which associates a continuous stream of location and viewing direction information with the collected videos, hence allowing them to be expressed and manipulated as spatio-temporal objects. These sensor meta-data are considerably smaller in size compared to the visual content and are helpful in effectively and efficiently searching for geo-tagged videos in large-scale repositories. In this study, we propose a novel three-level grid-based index structure and introduce a number of related query types, including typical spatial queries and ones based on bounded radius and viewing direction restriction. These two criteria are important in many video applications and we demonstrate the importance with a real-world dataset. Moreover, experimental results on a large-scale synthetic dataset show that our approach can provide a significant speed improvements of at least 30 %, considering a mix of queries, compared to a multi-dimensional R-tree implementation.

[1]  C. H. Graham,et al.  Vision and visual perception , 1965 .

[2]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[3]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[4]  Atsuyuki Okabe,et al.  Spatial Tessellations: Concepts and Applications of Voronoi Diagrams , 1992, Wiley Series in Probability and Mathematical Statistics.

[5]  Hari Balakrishnan,et al.  6th ACM/IEEE International Conference on on Mobile Computing and Networking (ACM MOBICOM ’00) The Cricket Location-Support System , 2022 .

[6]  Agnès Voisard,et al.  Spatial Databases: With Application to GIS , 2001 .

[7]  H. Buchner The Grid File : An Adaptable , Symmetric Multikey File Structure , 2001 .

[8]  J. Blat,et al.  VideoGIS: Segmenting and indexing video based on geographic information , 2002 .

[9]  Jong-Hun Lee,et al.  MPEG-7 metadata for video-based GIS applications , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[10]  Jong-Hyun Park,et al.  The interactive geographic video , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[11]  Divyakant Agrawal,et al.  Range and kNN Query Processing for Moving Objects in Grid Model , 2003, Mob. Networks Appl..

[12]  Jon Louis Bentley,et al.  Quad trees a data structure for retrieval on composite keys , 1974, Acta Informatica.

[13]  Allen R. Hanson,et al.  An efficient method for geo-referenced video mosaicing for environmental monitoring , 2005, Machine Vision and Applications.

[14]  David Eppstein,et al.  The skip quadtree: a simple dynamic data structure for multidimensional data , 2005, SCG.

[15]  Lars Kulik,et al.  The V*-Diagram: a query-dependent approach to moving KNN queries , 2008, Proc. VLDB Endow..

[16]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[17]  Roger Zimmermann,et al.  Viewable scene modeling for geospatial video search , 2008, ACM Multimedia.

[18]  Prashant J. Shenoy,et al.  SEVA: Sensor-enhanced video annotation , 2009, TOMCCAP.

[19]  Byunggu Yu,et al.  Vector model in support of versatile georeferenced video search , 2010, MMSys '10.

[20]  Roger Zimmermann,et al.  Generating synthetic meta-data for georeferenced video management , 2010, GIS '10.

[21]  Rongrong Ji,et al.  Active query sensing for mobile location search , 2011, ACM Multimedia.

[22]  He Ma,et al.  A Grid-Based Index and Queries for Large-Scale Geo-tagged Video Collections , 2012, DASFAA Workshops.