Improving RGB-D SLAM in dynamic environments: A motion removal approach

Abstract Visual Simultaneous Localization and Mapping (SLAM) based on RGB-D data has developed as a fundamental approach for robot perception over the past decades. There is an extensive literature regarding RGB-D SLAM and its applications. However, most of existing RGB-D SLAM methods assume that the traversed environments are static during the SLAM process. This is because moving objects in dynamic environments can severely degrade the SLAM performance. The static world assumption limits the applications of RGB-D SLAM in dynamic environments. In order to address this problem, we proposed a novel RGB-D data-based motion removal approach and integrated it into the front end of RGB-D SLAM. The motion removal approach acted as a pre-processing stage to filter out data that were associated with moving objects. We conducted experiments using a public RGB-D dataset. The results demonstrated that the proposed motion removal approach was able to effectively improve RGB-D SLAM in various challenging dynamic environments.

[1]  Sebastian Thrun,et al.  Learning to Segment and Track in RGBD , 2012, WAFR.

[2]  Gamini Dissanayake,et al.  A robust RGB-D SLAM algorithm , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Wolfram Burgard,et al.  3-D Mapping With an RGB-D Camera , 2014, IEEE Transactions on Robotics.

[4]  Yuxiang Sun,et al.  Motion removal from moving platforms: An RGB-D data-based motion detection, tracking and segmentation approach , 2015, 2015 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[5]  Antoine Vacavant,et al.  A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos , 2014, Comput. Vis. Image Underst..

[6]  Wolfram Burgard,et al.  Probabilistic Robotics (Intelligent Robotics and Autonomous Agents) , 2005 .

[7]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Cyrill Stachniss,et al.  Simultaneous Localization and Mapping , 2016, Springer Handbook of Robotics, 2nd Ed..

[9]  Roland Siegwart,et al.  Robust visual inertial odometry using a direct EKF-based approach , 2015, IROS 2015.

[10]  James W. Davis,et al.  Exploiting Temporal Geometry for Moving Camera Background Subtraction , 2014, 2014 22nd International Conference on Pattern Recognition.

[11]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[12]  Takeo Kanade,et al.  Background Subtraction for Freely Moving Cameras , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Jitendra Malik,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Segmentation of Moving Objects by Long Term Video Analysis , 2022 .

[14]  Dieter Fox,et al.  RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments , 2012, Int. J. Robotics Res..

[15]  Mubarak Shah,et al.  Detection of Independently Moving Objects in Non-planar Scenes via Multi-Frame Monocular Epipolar Constraint , 2012, ECCV.

[16]  Wolfram Burgard,et al.  A Tutorial on Graph-Based SLAM , 2010, IEEE Intelligent Transportation Systems Magazine.

[17]  John J. Leonard,et al.  Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age , 2016, IEEE Transactions on Robotics.

[18]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[19]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[20]  Bohyung Han,et al.  Modeling and segmentation of floating foreground and background in videos , 2012, Pattern Recognit..

[21]  Shoudong Huang,et al.  Towards dense moving object segmentation based robust dense RGB-D SLAM in dynamic scenarios , 2014, 2014 13th International Conference on Control Automation Robotics & Vision (ICARCV).

[22]  François Michaud,et al.  Online global loop closure detection for large-scale multi-session graph-based SLAM , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Hyung Jin Chang,et al.  Detection of Moving Objects with Non-stationary Cameras in 5.8ms: Bringing Motion Detection to Your Mobile Device , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[24]  Petar M. Djuric,et al.  Resampling algorithms and architectures for distributed particle filters , 2005, IEEE Transactions on Signal Processing.

[25]  Daniel Cremers,et al.  Robust odometry estimation for RGB-D cameras , 2013, 2013 IEEE International Conference on Robotics and Automation.

[26]  Silvio Savarese,et al.  Visually bootstrapped generalized ICP , 2011, 2011 IEEE International Conference on Robotics and Automation.

[27]  Andrea Fusiello,et al.  Segmentation and tracking of multiple video objects , 2007, Pattern Recognit..

[28]  Gurjit Singh Walia,et al.  Recent advances on multicue object tracking: a survey , 2016, Artificial Intelligence Review.

[29]  Feng Liu,et al.  Learning color and locality cues for moving object detection and segmentation , 2009, CVPR.

[30]  Wolfram Burgard,et al.  G2o: A general framework for graph optimization , 2011, 2011 IEEE International Conference on Robotics and Automation.

[31]  K. Madhava Krishna,et al.  Motion segmentation of multiple objects from a freely moving monocular camera , 2012, 2012 IEEE International Conference on Robotics and Automation.

[32]  Francesca Murabito,et al.  Superpixel-based video object segmentation using perceptual organization and location prior , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Jing Li Wang,et al.  Color image segmentation: advances and prospects , 2001, Pattern Recognit..

[34]  Elise Lachat,et al.  Assessment and Calibration of a RGB-D Camera (Kinect v2 Sensor) Towards a Potential Use for Close-Range 3D Modeling , 2015, Remote. Sens..

[35]  Daniel Cremers,et al.  Dense visual SLAM for RGB-D cameras , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[36]  Sander Oude Elberink,et al.  Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.

[37]  Niko Sünderhauf,et al.  Switchable constraints for robust pose graph SLAM , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[38]  Richard M. Karp,et al.  On-Line Algorithms Versus Off-Line Algorithms: How Much is it Worth to Know the Future? , 1992, IFIP Congress.

[39]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[40]  Thomas Moore,et al.  A Generalized Extended Kalman Filter Implementation for the Robot Operating System , 2014, IAS.