论文信息 - Vector model in support of versatile georeferenced video search

Vector model in support of versatile georeferenced video search

Increasingly geographic properties are being associated with videos, especially those captured from mobile cameras. The meta data from camera-attached sensors can be used to model the coverage area of the scene as a spatial object such that videos can be organized, indexed and searched based on their field of views (FOV). The most accurate representation of an FOV is through the geometric shape of a circular sector. However, spatial search and indexing methods are traditionally optimized for rectilinear shapes because of their simplicity. Established methods often use an approximation shape, such as a minimum bounding rectangle (MBR), to efficiently filter a large archive for possibly matching candidates. A second, refinement step is then applied to perform the time-consuming, precise matching function. MBR estimation has been successful for general spatial overlap queries, however it provides limited flexibility for georeferenced video search. In this study we propose a novel vector-based model for FOV estimation which provides a more versatile basis for georeferenced video search while providing competitive performance for the filter step. We demonstrate how the vector model can provide a unified method to perform traditional overlap queries while also enabling searches that, for example, concentrate on the vicinity of the camera's position or harness its view direction. To the best of our knowledge no comparable technique exists today.

[1] Kentaro Toyama,et al. Geographic location tags on digital images , 2003, ACM Multimedia.

[2] Mor Naaman,et al. Generating diverse and representative image search results for landmarks , 2008, WWW.

[3] Jack A. Orenstein. Spatial query processing in an object-oriented database system , 1986, SIGMOD '86.

[4] H. Garcia-Molina,et al. Automatic organization for digital photographs with geographic coordinates , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[5] Carlo Torniai,et al. Sharing, Discovering and Browsing Photo Collections through RDF geo-metadata , 2006, SWAP.

[6] Marc Gelgon,et al. Building and tracking hierarchical geographical & temporal partitions for image collection management on mobile devices , 2005, MULTIMEDIA '05.

[7] C. H. Graham,et al. Vision and visual perception , 1965 .

[8] Carlo Torniai,et al. Sharing, Discovering and Browsing Geotagged Pictures on the World Wide Web , 2007, The Geospatial Web.

[9] Yonatan Wexler,et al. Hierarchical photo organization using geo-relevance , 2007, GIS.

[10] Edward J. Delp,et al. Multimedia for mobile environment: image enhanced navigation , 2006, Electronic Imaging.

[11] Kerry Rodden,et al. How do people manage their digital photographs? , 2003, CHI '03.

[12] Roger Zimmermann,et al. Viewable scene modeling for geospatial video search , 2008, ACM Multimedia.

[13] Prashant J. Shenoy,et al. SEVA: Sensor-enhanced video annotation , 2009, TOMCCAP.

[14] Steven M. Seitz,et al. Scene Segmentation Using the Wisdom of Crowds , 2008, ECCV.

[15] Katsumi Tanaka,et al. 3D viewpoint-based photo search and information browsing , 2005, SIGIR '05.

[16] Hans-Peter Kriegel,et al. The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.