Unsupervised Scene Categorization, Path Segmentation and Landmark Extraction while Traveling Path

Segmenting the movement path is an essential requirement of intelligent mobile robots. It assists intelligent systems in gaining a better understanding of the scene and identifying re-visited spaces. Moreover, it helps intelligent robots quantize the wide scene into sub-spaces that visually represent the same content-for instance, distinguishing rooms and kitchen in an indoor environment. This paper proposes an unsupervised approach to understand the transition of the scene while a robot is moving and helps to extract sub-spaces that visually represent a similar environment. The proposed approach benefits from a pre-trained deep network architecture to extract a description (feature representation) for the mobile robot's visual information at each time step. Then, based on the pairwise distance of the feature representations, sub-spaces of the scene and transition points are extracted.

[1]  Mubarak Shah,et al.  Image Geo-Localization Based on MultipleNearest Neighbor Feature Matching UsingGeneralized Graphs , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Dorian Gálvez-López,et al.  Bags of Binary Words for Fast Place Recognition in Image Sequences , 2012, IEEE Transactions on Robotics.

[3]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Han Wang,et al.  A New Approach to Train Convolutional Neural Networks for Real-Time 6-DOF Camera Relocalization , 2018, 2018 IEEE 14th International Conference on Control and Automation (ICCA).

[5]  Han Wang,et al.  OriNet: Robust 3-D Orientation Estimation With a Single Particular IMU , 2020, IEEE Robotics and Automation Letters.

[6]  Mubarak Shah,et al.  Accurate Image Localization Based on Google Maps Street View , 2010, ECCV.

[7]  Michael Milford,et al.  Convolutional Neural Network-based Place Recognition , 2014, ICRA 2014.

[8]  Pascal Fua,et al.  Worldwide Pose Estimation Using 3D Point Clouds , 2012, ECCV.

[9]  Noah Snavely,et al.  Graph-Based Discriminative Learning for Location Recognition , 2013, International Journal of Computer Vision.

[10]  Roberto Cipolla,et al.  Modelling uncertainty in deep learning for camera relocalization , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Keyu Wu,et al.  DeepDSAIR: Deep 6-DOF camera relocalization using deblurred semantic-aware image representation for large-scale outdoor environments , 2019, Image Vis. Comput..

[12]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[13]  C. V. Jawahar,et al.  Improved Visual Relocalization by Discovering Anchor Points , 2018, BMVC.

[14]  Tomás Pajdla,et al.  Learning and Calibrating Per-Location Classifiers for Visual Place Recognition , 2013, CVPR.

[15]  Torsten Sattler,et al.  Large-Scale Location Recognition and the Geometric Burstiness Problem , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Daniel Cremers,et al.  Image-Based Localization Using LSTMs for Structured Feature Correlation , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[17]  Torsten Sattler,et al.  Efficient & Effective Prioritized Matching for Large-Scale Image-Based Localization , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[19]  Keyu Wu,et al.  AbolDeepIO: A Novel Deep Inertial Odometry Network for Autonomous Vehicles , 2020, IEEE Transactions on Intelligent Transportation Systems.

[20]  Keyu Wu,et al.  From Local Understanding to Global Regression in Monocular Visual Odometry , 2020, Int. J. Pattern Recognit. Artif. Intell..

[21]  Han Wang,et al.  Towards Utilizing Deep Uncertainty In Traditional SLAM , 2019, 2019 IEEE 15th International Conference on Control and Automation (ICCA).

[22]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[23]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Torsten Sattler,et al.  Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[25]  Zuzana Kukelova,et al.  Real-Time Solution to the Absolute Pose Problem with Unknown Radial Distortion and Focal Length , 2013, 2013 IEEE International Conference on Computer Vision.