EchoVPR: Echo State Networks for Visual Place Recognition

Recognising previously visited locations is an important, but unsolved, task in autonomous navigation. Current visual place recognition (VPR) benchmarks typically challenge models to recover the position of a query image (or images) from sequential datasets that include both spatial and temporal components. Recently, Echo State Network (ESN) varieties have proven particularly powerful at solving machine learning tasks that require spatio-temporal modelling. These networks are simple, yet powerful neural architectures that—exhibiting memory over multiple time-scales and non-linear high-dimensional representations—can discover temporal relations in the data while still maintaining linearity in the learning. In this paper, we present a series of ESNs and analyse their applicability to the VPR problem. We report that the addition of ESNs to preprocessed convolutional neural networks led to a dramatic boost in performance in comparison to non-recurrent networks in five out of six standard benchmarks (GardensPoint, SPEDTest, ESSEX3IN1, Oxford RobotCar, and Nordland) demonstrating that ESNs are able to capture the temporal structure inherent in VPR problems. Moreover, we show that models that include ESNs can outperform class-leading VPR models which also exploit the sequential dynamics of the data. Finally, our results demonstrate that ESNs also improve generalisation abilities, robustness, and accuracy further supporting their suitability to VPR applications.

[1]  Yubin Kuang,et al.  Mapillary Street-Level Sequences: A Dataset for Lifelong Place Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Eric T. Trautman,et al.  A Complete Electron Microscopy Volume of the Brain of Adult Drosophila melanogaster , 2017, Cell.

[3]  Niko Sünderhauf,et al.  Are We There Yet? Challenging SeqSLAM on a 3000 km Journey Across All Four Seasons , 2013 .

[4]  Torsten Sattler,et al.  Long-Term Visual Localization Revisited , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Lei Wang,et al.  Visual place recognition: A survey from deep learning perspective , 2020, Pattern Recognit..

[6]  Claudio Gallicchio,et al.  Design of deep echo state networks , 2018, Neural Networks.

[7]  Herbert Jaeger,et al.  Discovering multiscale dynamical features with hierarchical Echo State Networks , 2008 .

[8]  Michael Milford,et al.  Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Paolo Del Giudice,et al.  Exploiting Multiple Timescales in Hierarchical Echo State Networks , 2021, Frontiers in Applied Mathematics and Statistics.

[10]  Gordon Wyeth,et al.  SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[11]  M. Giurfa,et al.  Neural substrate for higher-order learning in an insect: Mushroom bodies are necessary for configural discriminations , 2015, Proceedings of the National Academy of Sciences.

[12]  Cédric Hartland,et al.  Using echo state networks for robot navigation behavior acquisition , 2007, 2007 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[13]  Michael Milford,et al.  VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change , 2020, International Journal of Computer Vision.

[14]  Paul Newman,et al.  1 year, 1000 km: The Oxford RobotCar dataset , 2017, Int. J. Robotics Res..

[15]  T. van der Zant,et al.  Identification of motion with echo state network , 2004, Oceans '04 MTS/IEEE Techno-Ocean '04 (IEEE Cat. No.04CH37600).

[16]  Herbert Jaeger,et al.  Optimization and applications of echo state networks with leaky- integrator neurons , 2007, Neural Networks.

[17]  Michael Milford,et al.  Vision-based place recognition: how low can you go? , 2013, Int. J. Robotics Res..

[18]  Michael Milford,et al.  Deep learning features at scale for visual place recognition , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[19]  M. Giurfa,et al.  GABAergic feedback signaling into the calyces of the mushroom bodies enables olfactory reversal learning in honey bees , 2015, Front. Behav. Neurosci..

[20]  Herbert Jaeger,et al.  The''echo state''approach to analysing and training recurrent neural networks , 2001 .

[21]  Masatoshi Okutomi,et al.  24/7 Place Recognition by View Synthesis , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Klaus D. McDonald-Maier,et al.  Memorable Maps: A Framework for Re-Defining Places in Visual Place Recognition , 2018, IEEE Transactions on Intelligent Transportation Systems.

[23]  Ajay Narendra,et al.  A Hybrid Compact Neural Architecture for Visual Place Recognition , 2020, IEEE Robotics and Automation Letters.

[24]  Michael Milford,et al.  A Hierarchical Dual Model of Environment- and Place-Specific Utility for Visual Place Recognition , 2021, IEEE Robotics and Automation Letters.

[25]  S. Fahrbach Structure of the mushroom bodies of the insect brain. , 2006, Annual review of entomology.

[26]  Ali Deihimi,et al.  Application of echo state networks in short-term electric load forecasting , 2012 .

[27]  Lingqiao Liu,et al.  Learning Context Flexible Attention Model for Long-Term Visual Place Recognition , 2018, IEEE Robotics and Automation Letters.

[28]  Barbara Caputo,et al.  A Survey on Deep Visual Place Recognition , 2021, IEEE Access.

[29]  Eleni Vasilaki,et al.  Abstract concept learning in a simple neural network inspired by the insect brain , 2018, bioRxiv.

[30]  Louis K. Scheffer,et al.  A connectome of a learning and memory center in the adult Drosophila brain , 2017, eLife.

[31]  Tomasz Malisiewicz,et al.  SuperGlue: Learning Feature Matching With Graph Neural Networks , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[33]  Michael Milford,et al.  Where is your place, Visual Place Recognition? , 2021, IJCAI.

[34]  Michael Milford,et al.  Multi-Process Fusion: Visual Place Recognition Using Multiple Image Processing Methods , 2019, IEEE Robotics and Automation Letters.

[35]  Andrew Philippides,et al.  Insect Inspired View Based Navigation Exploiting Temporal Information , 2020, Living Machines.

[36]  M. Heisenberg,et al.  An engram found? Evaluating the evidence from fruit flies , 2004, Current Opinion in Neurobiology.

[37]  R. Menzel,et al.  Cognitive architecture of a mini-brain: the honeybee , 2001, Trends in Cognitive Sciences.

[38]  Paola Cognigni,et al.  Do the right thing: neural network mechanisms of memory formation, expression and update in Drosophila , 2018, Current Opinion in Neurobiology.

[39]  Michael Milford,et al.  Event-Based Visual Place Recognition With Ensembles of Temporal Windows , 2020, IEEE Robotics and Automation Letters.

[40]  Brett Browning,et al.  Visual place recognition using HMM sequence matching , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[41]  Benjamin Schrauwen,et al.  Recurrent Kernel Machines: Computing with Infinite Echo State Networks , 2012, Neural Computation.

[42]  Barbara Webb,et al.  Spatio-temporal Memory for Navigation in a Mushroom Body Model , 2020, bioRxiv.

[43]  Paul-Gerhard Plöger,et al.  Echo State Networks for Mobile Robot Modeling and Control , 2003, RoboCup.

[44]  Mantas Lukosevicius,et al.  A Practical Guide to Applying Echo State Networks , 2012, Neural Networks: Tricks of the Trade.

[45]  Meng Wang,et al.  Gap junction networks in mushroom bodies participate in visual learning and memory in Drosophila , 2016, eLife.

[46]  Eleni Vasilaki,et al.  SpaRCe: Improved Learning of Reservoir Computing Systems Through Sparse Representations , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[47]  Bingyi Cao,et al.  Unifying Deep Local and Global Features for Image Search , 2020, ECCV.

[48]  Jun Wang,et al.  Chaotic Time Series Prediction Based on a Novel Robust Echo State Network , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[49]  Peter I. Corke,et al.  Visual Place Recognition: A Survey , 2016, IEEE Transactions on Robotics.

[50]  Torsten Sattler,et al.  Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[51]  Michael Milford,et al.  SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition , 2021, IEEE Robotics and Automation Letters.

[52]  Masatoshi Okutomi,et al.  Visual Place Recognition with Repetitive Structures , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.