Viewable scene modeling for geospatial video search

Video sensors are becoming ubiquitous and the volume of captured video material is very large. Therefore, tools for searching video databases are indispensable. Current techniques that extract features purely based on the visual signals of a video are struggling to achieve good results. By considering video related meta-information, more relevant and precisely delimited search results can be obtained. In this study we propose a novel approach for querying videos based on the notion that the geographical location of the captured scene in addition to the location of a camera can provide valuable information and may be used as a search criterion in many applications. This study provides an estimation model of the viewable area of a scene for indexing and searching and reports on a prototype implementation. Among our objectives is to stimulate a discussion of these topics in the research community as information fusion of different georeferenced data sources is becoming increasingly important. Initial results illustrate the feasibility of the proposed approach.

[1]  C. H. Graham,et al.  Vision and visual perception , 1965 .

[2]  Jong-Hyun Park,et al.  The interactive geographic video , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[3]  Timos K. Sellis,et al.  Spatio-temporal indexing for large multimedia applications , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[4]  Carlo Torniai,et al.  Sharing, Discovering and Browsing Photo Collections through RDF geo-metadata , 2006, SWAP.

[5]  Marc Gelgon,et al.  Building and tracking hierarchical geographical & temporal partitions for image collection management on mobile devices , 2005, MULTIMEDIA '05.

[6]  Jean-Yves Bouguet,et al.  Camera calibration toolbox for matlab , 2001 .

[7]  H. Garcia-Molina,et al.  Automatic organization for digital photographs with geographic coordinates , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[8]  Kentaro Toyama,et al.  Geographic location tags on digital images , 2003, ACM Multimedia.

[9]  Peter Fröhlich,et al.  A mobile application framework for the geospatial web , 2007, WWW '07.

[10]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[11]  Prashant J. Shenoy,et al.  SEVA: Sensor-enhanced video annotation , 2009, TOMCCAP.

[12]  Timos K. Sellis,et al.  Spatio-temporal composition and indexing for large multimedia applications , 1998, Multimedia Systems.

[13]  Carlo Torniai,et al.  Sharing, Discovering and Browsing Geotagged Pictures on the World Wide Web , 2007, The Geospatial Web.

[14]  Katsumi Tanaka,et al.  3D viewpoint-based photo search and information browsing , 2005, SIGIR '05.

[15]  Jong-Hun Lee,et al.  MPEG-7 metadata for video-based GIS applications , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[16]  Kerry Rodden,et al.  How do people manage their digital photographs? , 2003, CHI '03.

[17]  Yonatan Wexler,et al.  Hierarchical photo organization using geo-relevance , 2007, GIS.