Dynamic SLAM: The Need For Speed

The static world assumption is standard in most simultaneous localisation and mapping (SLAM) algorithms. Increased deployment of autonomous systems to unstructured dynamic environments is driving a need to identify moving objects and estimate their velocity in real-time. Most existing SLAM based approaches rely on a database of 3D models of objects or impose significant motion constraints. In this paper, we propose a new feature-based, model-free, object-aware dynamic SLAM algorithm that exploits semantic segmentation to allow estimation of motion of rigid objects in a scene without the need to estimate the object poses or have any prior knowledge of their 3D models. The algorithm generates a map of dynamic and static structure and has the ability to extract velocities of rigid moving objects in the scene. Its performance is demonstrated on simulated, synthetic and real-world datasets.

[1]  Hugh F. Durrant-Whyte,et al.  Simultaneous Localization, Mapping and Moving Object Tracking , 2007, Int. J. Robotics Res..

[2]  K. Madhava Krishna,et al.  Realtime multibody visual SLAM with a smoothly moving monocular camera , 2011, 2011 International Conference on Computer Vision.

[3]  Wolfram Burgard,et al.  Motion-based detection and tracking in 3D LiDAR scans , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Qiao Wang,et al.  VirtualWorlds as Proxy for Multi-object Tracking Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Juan Andrade-Cetto,et al.  Information-Based Compact Pose SLAM , 2010, IEEE Transactions on Robotics.

[6]  Henrik I. Christensen,et al.  SLAM with Expectation Maximization for moveable object tracking , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Pavel Zemcík,et al.  Efficient implementation for block matrix operations for nonlinear least squares problems in robotic applications , 2013, 2013 IEEE International Conference on Robotics and Automation.

[8]  Jungwon Lee,et al.  Deep Robust Single Image Depth Estimation Neural Network Using Scene Understanding , 2019, CVPR Workshops.

[9]  Ryosuke Shibasaki,et al.  SLAM in a dynamic large outdoor environment using a laser scanner , 2008, 2008 IEEE International Conference on Robotics and Automation.

[10]  Shichao Yang,et al.  CubeSLAM: Monocular 3-D Object SLAM , 2018, IEEE Transactions on Robotics.

[11]  Paul Newman,et al.  Multimotion Visual Odometry (MVO): Simultaneous Estimation of Camera and Third-Party Motions , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Wolfram Burgard,et al.  Map building with mobile robots in populated environments , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Sebastian Thrun,et al.  Online simultaneous localization and mapping with detection and tracking of moving objects: theory and results from a ground vehicle in crowded urban areas , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[15]  Paul H. J. Kelly,et al.  SLAM++: Simultaneous Localisation and Mapping at the Level of Objects , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Dorian Gálvez-López,et al.  Real-time Monocular Object SLAM , 2015, Robotics Auton. Syst..

[17]  Mark E. Campbell,et al.  Rao-Blackwellized Particle Filtering for Mapping Dynamic Environments , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[18]  Hermann Winner,et al.  Adaptive Cruise Control , 2015, Handbuch Fahrerassistenzsysteme.

[19]  Frank Dellaert,et al.  iSAM2: Incremental smoothing and mapping using the Bayes tree , 2012, Int. J. Robotics Res..

[20]  Vincent Lepetit,et al.  Learning descriptors for object recognition and 3D pose estimation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Viorela Ila,et al.  SLAM++ 1 -A highly efficient and temporally scalable incremental SLAM framework , 2017, Int. J. Robotics Res..

[22]  Stefan Roth,et al.  MOT16: A Benchmark for Multi-Object Tracking , 2016, ArXiv.

[23]  Javier Civera,et al.  DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes , 2018, IEEE Robotics and Automation Letters.

[24]  Frank Dellaert,et al.  Square Root SAM: Simultaneous Localization and Mapping via Square Root Information Smoothing , 2006, Int. J. Robotics Res..

[25]  Wolfram Burgard,et al.  Simultaneous Localisation and Mapping in Dynamic Environments (SLAMIDE) with Reversible Data Association , 2008 .

[26]  Gaurav S. Sukhatme,et al.  Mobile Robot Simultaneous Localization and Mapping in Dynamic Environments , 2005, Auton. Robots.

[27]  John J. Leonard,et al.  Dynamic pose graph SLAM: Long-term mapping in low dynamic environments , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Pavel Zemcík,et al.  Incremental Block Cholesky Factorization for Nonlinear Least Squares in Robotics , 2013, Robotics: Science and Systems.

[29]  Jan Kautz,et al.  PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[31]  Charles E. Thorpe,et al.  Perception for collision avoidance and autonomous driving , 2003 .

[32]  Moritz Kampelmühler,et al.  Camera-based vehicle velocity estimation from monocular video , 2018, ArXiv.

[33]  F. Dellaert Factor Graphs and GTSAM: A Hands-on Introduction , 2012 .

[34]  Raquel Urtasun,et al.  Efficient Joint Segmentation, Occlusion Labeling, Stereo and Flow Estimation , 2014, ECCV.

[35]  K. Madhava Krishna,et al.  Dynamic body VSLAM with semantic constraints , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[36]  Gregory S. Chirikjian,et al.  Pose Changes From a Different Point of View , 2017 .

[37]  Dieter Fox,et al.  SE3-nets: Learning rigid body motion using deep neural networks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[38]  Wolfram Burgard,et al.  Map building with mobile robots in dynamic environments , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[39]  Kurt Konolige,et al.  Double window optimisation for constant time visual SLAM , 2011, 2011 International Conference on Computer Vision.

[40]  Ian D. Reid,et al.  Simultaneous Localisation and Mapping in Dynamic Environments (SLAMIDE) with Reversible Data Associa , 2007, Robotics: Science and Systems.