论文信息 - Satellite image-based localization via learned embeddings

Satellite image-based localization via learned embeddings

We propose a vision-based method that localizes a ground vehicle using publicly available satellite imagery as the only prior knowledge of the environment. Our approach takes as input a sequence of ground-level images acquired by the vehicle as it navigates, and outputs an estimate of the vehicle's pose relative to a georeferenced satellite image. We overcome the significant viewpoint and appearance variations between the images through a neural multi-view model that learns location-discriminative embeddings in which ground-level images are matched with their corresponding satellite view of the scene. We use this learned function as an observation model in a filtering framework to maintain a distribution over the vehicle's pose. We evaluate our method on different benchmark datasets and demonstrate its ability localize ground-level images in environments novel relative to training, despite the challenges of significant viewpoint and appearance variations.

Matthew R. Walter | Dong-Ki Kim | Dong-Ki Kim

[1] P. Fearnhead,et al. Improved particle filter for nonlinear problems , 1999 .

[2] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3] Wolfram Burgard,et al. Robust vision-based localization by combining an image-retrieval system with Monte Carlo localization , 2005, IEEE Transactions on Robotics.

[4] Yann LeCun,et al. Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6] Wei Zhang,et al. Image Based Localization in Urban Environments , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[7] Jana Kosecka,et al. Probabilistic location recognition using reduced feature set , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[8] David Filliat,et al. A visual bag of words method for interactive qualitative localization and mapping , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[9] Achim J. Lilienthal,et al. SIFT, SURF and Seasons: Long-term Outdoor Localization Using Local Features , 2007, EMCR.

[10] Robert Pless,et al. Geolocating Static Cameras , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[11] Richard Szeliski,et al. City-Scale Location Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Alexei A. Efros,et al. IM2GPS: estimating geographic information from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Paul Newman,et al. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[14] Jon M. Kleinberg,et al. Mapping the world's photos , 2009, WWW '09.

[15] Christopher Hunt,et al. Notes on the OpenSURF Library , 2009 .

[16] Michael Warren,et al. Unaided stereo vision based pose estimation , 2010, ICRA 2010.

[17] Albert S. Huang,et al. Ground robot navigation using uncalibrated cameras , 2010, 2010 IEEE International Conference on Robotics and Automation.

[18] Mubarak Shah,et al. Accurate Image Localization Based on Google Maps Street View , 2010, ECCV.

[19] Gordon Wyeth,et al. FAB-MAP + RatSLAM: Appearance-based SLAM for multiple times of day , 2010, 2010 IEEE International Conference on Robotics and Automation.

[20] Hui Cheng,et al. Geo-localization of street views with aerial image databases , 2011, ACM Multimedia.

[21] Xin Chen,et al. City-scale landmark identification on mobile devices , 2011, CVPR 2011.

[22] Paul Newman,et al. Practice makes perfect? Managing and leveraging visual experiences for lifelong navigation , 2012, 2012 IEEE International Conference on Robotics and Automation.

[23] Takeo Kanade,et al. Real-time topometric localization , 2012, 2012 IEEE International Conference on Robotics and Automation.

[24] Gordon Wyeth,et al. SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[25] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[26] Davide Scaramuzza,et al. MAV urban localization from Google street view data , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[27] Serge J. Belongie,et al. Cross-View Image Geolocalization , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[28] Andreas Geiger,et al. Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[29] Niko Sünderhauf,et al. Are We There Yet? Challenging SeqSLAM on a 3000 km Journey Across All Four Seasons , 2013 .

[30] Guang-Zhong Yang,et al. Feature Co-occurrence Maps: Appearance-based localisation throughout the day , 2013, 2013 IEEE International Conference on Robotics and Automation.

[31] Niko Sünderhauf,et al. Appearance change prediction for long-term navigation across seasons , 2013, 2013 European Conference on Mobile Robots.

[32] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[33] Wolfram Burgard,et al. Robust Visual Robot Localization Across Seasons Using Network Flows , 2014, AAAI.

[34] Paul Newman,et al. Shady dealings: Robust, long-term visual localisation using illumination invariance , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[35] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[36] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[37] Paul Newman,et al. Scene Signatures: Localised and Point-less Features for Localisation , 2014, Robotics: Science and Systems.

[38] Gordon Wyeth,et al. Transforming morning to afternoon using linear regression techniques , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[39] Daniel Huber,et al. Vision based robot localization by ground to satellite matching in GPS-denied situations , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[40] Michael Milford,et al. Place Recognition with ConvNet Landmarks: Viewpoint-Robust, Condition-Robust, Training-Free , 2015, Robotics: Science and Systems.

[41] Niko Sünderhauf,et al. On the performance of ConvNet features for place recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[42] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[43] Scott Workman,et al. Wide-Area Image Geolocalization with Aerial Reference Imagery , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[44] Serge J. Belongie,et al. Learning deep representations for ground-to-aerial geolocalization , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[46] Matthew R. Walter,et al. Accurate Vision-based Vehicle Localization using Satellite Imagery , 2015, ArXiv.

[47] Daniel Huber,et al. Vision-based robot localization across seasons and in remote locations , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).