Gift: A geospatial image and video filtering tool for computer vision applications with geo-tagged mobile videos

In this paper, we propose a novel geospatial image and video filtering tool (GIFT) to select the most relevant input images and videos for computer vision applications with geotagged mobile videos. GIFT tightly couples mobile media content and their geospatial metadata for fine granularity video manipulation and intelligently indexes FOVs (Field of View) to deal with large volumes of data. To demonstrate the effectiveness of GIFT, we introduce an end-to-end application that utilizes mobile videos to achieve persistent target tracking over large space and time. Our experimental results show promising performance of vision applications with GIFT in terms of lower communication load, improved efficiency, accuracy and scalability.

[1]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[2]  Jiebo Luo,et al.  Estimating the camera direction of a geotagged image using reference images , 2014, Pattern Recognit..

[3]  Roger Zimmermann,et al.  Generating synthetic meta-data for georeferenced video management , 2010, GIS '10.

[4]  David S. Munro,et al.  Topology Estimation for Thousand-Camera Surveillance Networks , 2007, 2007 First ACM/IEEE International Conference on Distributed Smart Cameras.

[5]  Roger Zimmermann,et al.  Viewable scene modeling for geospatial video search , 2008, ACM Multimedia.

[6]  Changhu Wang,et al.  Photo2Trip: generating travel routes from geo-tagged photos for trip planning , 2010, ACM Multimedia.

[7]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  Cyrus Shahabi,et al.  MediaQ: mobile multimedia management system , 2014, MMSys '14.

[9]  Mubarak Shah,et al.  Tracking across multiple cameras with disjoint views , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[10]  Gérard G. Medioni,et al.  Exploring context information for inter-camera multiple target tracking , 2014, IEEE Winter Conference on Applications of Computer Vision.

[11]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.