Sequential appearance-based Global Localization using an ensemble of kNN-DTW classifiers

The human Episodic Memory system stores sequences of events (episodes encoded in time and in space) experienced or imagined by an individual for later access to the episodes in whole or in part. Such ability provided by our Episodic Memory system is important, among other functions, for our localization in space throughout time. Inspired by that, in this paper, it is presented a new Sequential Appearance-Based approach to the Global Localization problem, dubbed SABGL. SABGL is based on an ensemble of kNN classifiers, where each classifier uses the Dynamic Time Warping (DTW) and the Hamming distance to compare binary features extracted from sequences of images. SABGL is designed to solve the global localization problem in two phases: mapping and localization. During mapping, it is trained with a sequence of images and associated locations that represents episodes experienced by an autonomous robot. During localization, it receives subsequences of images of the same environment and compares them to its previous experienced episodes, trying to recollect the most similar “experience” in time and space at once. Then, the system outputs the positions where it “believes” these images were captured. SABGL is compared with a previous single-image approach, named VibGL. Experimental results show that SABGL consistently outperforms VibGL in classification accuracy with higher precision in all maximum distance tolerance analyzed. For instance, given a maximum tolerance of 10m, SABGL is able to correctly localize an autonomous car 98% of the time.

[1]  Gordon Wyeth,et al.  Mapping a Suburb With a Single Camera Using a Biologically Inspired SLAM System , 2008, IEEE Transactions on Robotics.

[2]  Gordon Wyeth,et al.  FAB-MAP + RatSLAM: Appearance-based SLAM for multiple times of day , 2010, 2010 IEEE International Conference on Robotics and Automation.

[3]  Sanjiv Singh,et al.  The DARPA Urban Challenge: Autonomous Vehicles in City Traffic, George Air Force Base, Victorville, California, USA , 2009, The DARPA Urban Challenge.

[4]  E. Maguire,et al.  The Human Hippocampus and Spatial and Episodic Memory , 2002, Neuron.

[5]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[6]  Gordon Wyeth,et al.  OpenFABMAP: An open source toolbox for appearance-based loop closure detection , 2012, 2012 IEEE International Conference on Robotics and Automation.

[7]  E. Tulving,et al.  Episodic and semantic memory , 1972 .

[8]  J. O'Keefe,et al.  The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat. , 1971, Brain research.

[9]  Eamonn J. Keogh,et al.  Accelerating Dynamic Time Warping Subsequence Search with GPUs and FPGAs , 2010, 2010 IEEE International Conference on Data Mining.

[10]  Michael Milford Visual Route Recognition with a Handful of Bits , 2012, Robotics: Science and Systems.

[11]  Gordon Wyeth,et al.  SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[12]  Paul Newman,et al.  FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[13]  Mohamed A. Aly,et al.  Novel Methods for the Feature Subset Ensemble Approach , 2006 .

[14]  Paul Newman,et al.  Highly scalable appearance-only SLAM - FAB-MAP 2.0 , 2009, Robotics: Science and Systems.

[15]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[16]  Artur S. d'Avila Garcez,et al.  Fat-Fast VG-RAM WNN: A high performance approach , 2016, Neurocomputing.

[17]  Alberto Ferreira de Souza,et al.  Image-based mapping, global localization and position tracking using VG-RAM weightless neural networks , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[18]  E. Tulving Episodic memory: from mind to brain. , 2002, Annual review of psychology.

[19]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[20]  Juan J. Rodríguez,et al.  Multivariate Time Series Classification via Stacking of Univariate Classifiers , 2008 .

[21]  Edilson de Aguiar,et al.  Compressing VG-RAM WNN memory for lightweight applications , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[22]  Peter I. Corke,et al.  Visual Place Recognition: A Survey , 2016, IEEE Transactions on Robotics.

[23]  Alberto Ferreira de Souza,et al.  Image-based global localization using VG-RAM Weightless Neural Networks , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[24]  Gordon Wyeth,et al.  RatSLAM: a hippocampal model for simultaneous localization and mapping , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[25]  Alberto Ferreira de Souza,et al.  Face Recognition with VG-RAM Weightless Neural Networks , 2008, ICANN.

[26]  Michael Milford,et al.  Vision-based place recognition: how low can you go? , 2013, Int. J. Robotics Res..

[27]  Michael Milford,et al.  Vision-based Simultaneous Localization and Mapping in Changing Outdoor Environments , 2014, J. Field Robotics.

[28]  Andreas Geiger,et al.  Motion-without-structure: Real-time multipose optimization for accurate visual odometry , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[29]  Thomas P. Trappenberg,et al.  Continuous Attractor Neural Networks , 2005 .

[30]  Paul Newman,et al.  Appearance-only SLAM at large scale with FAB-MAP 2.0 , 2011, Int. J. Robotics Res..