论文信息 - HM$^4$: Hidden Markov Model With Memory Management for Visual Place Recognition

HM$^4$: Hidden Markov Model With Memory Management for Visual Place Recognition

Visual placerecognition needs to be robust against appearance variability due to natural and man-made causes. Training data collection should thus be an ongoing process to allow continuous appearance changes to be recorded. However, this creates an unboundedly-growing database that poses time and memory scalability challenges for place recognition methods. To tackle the scalability issue for visual place recognition in autonomous driving, we develop a Hidden Markov Model approach with a two-tiered memory management. Our algorithm, dubbed HM$^4$, exploits temporal look-ahead to transfer promising candidate images between passive storage and active memory when needed. The inference process takes into account both promising images and a coarse representations of the full database. We show that this allows constant time and space inference for a fixed coverage area. The coarse representations can also be updated incrementally to absorb new data. To further reduce the memory requirements, we derive a compact image representation inspired by Locality Sensitive Hashing (LSH). Through experiments on real world data, we demonstrate the excellent scalability and accuracy of the approach under appearance changes and provide comparisons against state-of-the-art techniques.

[1] Masatoshi Okutomi,et al. 24/7 Place Recognition by View Synthesis , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Tat-Jun Chin,et al. SPRINT: Subgraph Place Recognition for INtelligent Transportation , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[3] Sven Kosub,et al. A note on the triangle inequality for the Jaccard distance , 2016, Pattern Recognit. Lett..

[4] Michael Bosse,et al. The gist of maps - summarizing experience for lifelong localization , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[5] Paul Newman,et al. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[6] Alexandr Andoni,et al. Practical and Optimal LSH for Angular Distance , 2015, NIPS.

[7] Tat-Jun Chin,et al. Scalable Place Recognition Under Appearance Change for Autonomous Driving , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8] Joshua Zhexue Huang,et al. Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[9] Andrew Zisserman,et al. All About VLAD , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Svetlana Lazebnik,et al. Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.

[11] Winston Churchill,et al. Experience-based navigation for long-term localisation , 2013, Int. J. Robotics Res..

[12] Dorian Gálvez-López,et al. Bags of Binary Words for Fast Place Recognition in Image Sequences , 2012, IEEE Transactions on Robotics.

[13] Michael Milford,et al. Addressing Challenging Place Recognition Tasks Using Generative Adversarial Networks , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[14] Paul Newman,et al. 1 year, 1000 km: The Oxford RobotCar dataset , 2017, Int. J. Robotics Res..

[15] Jan Kautz,et al. Geometry-Aware Learning of Maps for Camera Localization , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16] Henrik Andreasson,et al. Lightweight, Viewpoint-Invariant Visual Place Recognition in Changing Environments , 2018, IEEE Robotics and Automation Letters.

[17] Gordon Wyeth,et al. SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[18] F. Michaud,et al. Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation , 2013, IEEE Transactions on Robotics.

[19] Cordelia Schmid,et al. Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20] Michael Milford,et al. Fast, Compact and Highly Scalable Visual Place Recognition through Sequence-based Matching of Overloaded Representations , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[21] Antonio Criminisi,et al. Epitomic Location Recognition , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] Gordon Wyeth,et al. FAB-MAP + RatSLAM: Appearance-based SLAM for multiple times of day , 2010, 2010 IEEE International Conference on Robotics and Automation.

[23] David G. Lowe,et al. Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.