Improving RGBD Indoor Mapping with IMU data

With the release of RGBD-cameras (cameras that provide both RGB as well as depth information) researchers have started evaluating how these devices can be used in fields of localization, mapping and ubiquitous computing. Intel Seattle Research proposed an indoor mapping algorithm making use of such a camera. The algorithm itself is vulnerable to violations of the static environment assumption and image based localization failures that can be caused by, for example, featureless environments. The goal of this master thesis is to augment the indoor mapping algorithm with additional Inertial Measurement Unit (IMU) data to enhance the robustness to these vulnerabilities. To this end the characteristics and limitations of the Microsoft Kinect are investigated and an enhanced mapping algorithm is proposed. IMU orientation estimates are fused with pose estimates based on image data, which give an initial guess to the Iterative Closest Point (ICP) algorithm that is used to align point cloud data to create a final map. In case visual localization fails, the algorithm of Intel uses a constant velocity assumption as fallback mechanism while the IMU data provide more accurate orientation estimations than the constant velocity assumption can provide. The IMU-enhanced algorithm shows similar mapping quality in ideal mapping conditions compared to the plain mapping algorithm. While a series of corner case tests show that the IMU-enhanced algorithm was unable to improve the results compared with the plain mapping algorithm, it potentially generates improvements in mapping quality when dealing with non-static environments.

[1]  Igor Bisio,et al.  Recent advances in satellite and space communications , 2010, Journal of Communications and Networks.

[2]  Kai Borre,et al.  Indoor multipath mitigation , 2010, 2010 Ubiquitous Positioning Indoor Navigation and Location Based Service.

[3]  R. Munguia,et al.  Monocular SLAM for Visual Odometry , 2007, 2007 IEEE International Symposium on Intelligent Signal Processing.

[4]  K. S. Arun,et al.  Least-Squares Fitting of Two 3-D Point Sets , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Nitin H. Vaidya,et al.  Proceedings of the sixteenth annual international conference on Mobile computing and networking , 2010, MobiCom 2010.

[6]  Pose Estimation , Tracking and Model Learning of Articulated Objects from Dense Depth Video using Projected Texture Stereo , 2010 .

[7]  Luis Maldonado Ramos,et al.  Alcalá de Henares , 2005 .

[8]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[9]  Matthias Zwicker,et al.  Surfels: surface elements as rendering primitives , 2000, SIGGRAPH.

[10]  Gérard G. Medioni,et al.  Object modelling by registration of multiple range images , 1992, Image Vis. Comput..

[11]  J. Michael Herrmann,et al.  Homeokinetic proportional control of myoelectric prostheses , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Christian Schlegel,et al.  6 DoF SLAM using a ToF camera: The challenge of a continuously growing number of landmarks , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  A. Zelinsky,et al.  Accurate vision based position tracking between places in a topological map , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).

[14]  Gérard G. Medioni,et al.  Object modeling by registration of multiple range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[15]  Steve Benford,et al.  Physical manipulation: evaluating the potential for tangible designs , 2009, TEI.

[16]  Chris Hide,et al.  Low cost vision-aided IMU for pedestrian navigation , 2010, 2010 Ubiquitous Positioning Indoor Navigation and Location Based Service.

[17]  Jie Gao,et al.  Differential Forms for Target Tracking and Aggregate Queries in Distributed Networks , 2013, IEEE/ACM Transactions on Networking.

[18]  Jae Wook Jeon,et al.  FPGA-based real-time visual tracking system using adaptive color histograms , 2007, 2007 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[19]  G. Palli Intelligent Robots And Systems , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[20]  Diego López-de-Ipiña,et al.  TRIP: A Low-Cost Vision-Based Location System for Ubiquitous Computing , 2002, Personal and Ubiquitous Computing.

[21]  Romit Roy Choudhury,et al.  Did you see Bob?: human localization using mobile phones , 2010, MobiCom.

[22]  Venkata N. Padmanabhan,et al.  Indoor localization without the pain , 2010, MobiCom.

[23]  Zhengyou Zhang,et al.  Iterative point matching for registration of free-form curves and surfaces , 1994, International Journal of Computer Vision.

[24]  Li Yun,et al.  A robust real-time moving object tracking algorithm , 2009, 2009 IEEE International Conference on Control and Automation.

[25]  Graham Beasley Insight3D: a high performance toolkit for advanced visualization of space and terrestrial environments , 2010, SummerSim.

[26]  J. J. Moré,et al.  Newton's Method , 1982 .

[27]  Martin Jägersand,et al.  Image-based localization with depth-enhanced image map , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[28]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[29]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[30]  G. Klein,et al.  Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[31]  Ross Bencina,et al.  reacTIVision: a computer-vision framework for table-based tangible interaction , 2007, TEI.

[32]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[33]  Dieter Schmalstieg,et al.  Robust and unobtrusive marker tracking on mobile phones , 2008, 2008 7th IEEE/ACM International Symposium on Mixed and Augmented Reality.

[34]  Andrea Sanna,et al.  A 6-DOF ARTag-based tracking system , 2010, IEEE Transactions on Consumer Electronics.

[35]  Zhengyou Zhang,et al.  Flexible camera calibration by viewing a plane from unknown orientations , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[36]  Giovanni Ulivi,et al.  A Low Cost Vision Based Localization System for Mobile Robots , 2001 .

[37]  A. Nuchter,et al.  Accurate object localization in 3D laser range scans , 2005, ICAR '05. Proceedings., 12th International Conference on Advanced Robotics, 2005..

[38]  Didier Stricker,et al.  Using the marginalised particle filter for real-time visual-inertial sensor fusion , 2008, 2008 7th IEEE/ACM International Symposium on Mixed and Augmented Reality.

[39]  Michel Devy,et al.  Localization of an autonomous mobile robot from 3D depth images , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[40]  Sebastian O. H. Madgwick,et al.  An efficient orientation filter for inertial and inertial / magnetic sensor arrays , 2010 .

[41]  Dieter Schmalstieg,et al.  ARToolKitPlus for Pose Trackin on Mobile Devices , 2007 .

[42]  Yakup Genc,et al.  A balanced approach to 3D tracking from image streams , 2005, Fourth IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR'05).

[43]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[44]  Stergios I. Roumeliotis,et al.  A Multi-State Constraint Kalman Filter for Vision-aided Inertial Navigation , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[45]  Dieter Fox,et al.  RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[46]  Antonios Gasteratos,et al.  Color-Based Monocular Visuoinertial 3-D Pose Estimation of a Volant Robot , 2010, IEEE Transactions on Instrumentation and Measurement.

[47]  M. Mallem,et al.  Hybrid Localization System for Mobile Outdoor Augmented Reality Applications , 2008, 2008 First Workshops on Image Processing Theory, Tools and Applications.

[48]  Nassir Navab,et al.  SLAM combining ToF and high-resolution cameras , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[49]  Gerd Hirzinger,et al.  Real-time visual tracking of 3D objects with dynamic handling of occlusion , 1997, Proceedings of International Conference on Robotics and Automation.

[50]  Matthew A. Brown,et al.  Unsupervised 3D object recognition and reconstruction in unordered datasets , 2005, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05).

[51]  Mark S. Ackerman,et al.  Personal and Ubiquitous Computing , 2004, Personal and Ubiquitous Computing.

[52]  T.S. Perry,et al.  Consumer electronics , 1990, IEEE Spectrum.

[53]  K. Ohnishi,et al.  Eye-to-hand approach on eye-in-hand configuration within real-time visual servoing , 2004, IEEE/ASME Transactions on Mechatronics.

[54]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[55]  Henk L. Muller,et al.  Personal position measurement using dead reckoning , 2003, Seventh IEEE International Symposium on Wearable Computers, 2003. Proceedings..

[56]  A. Prusak,et al.  APPLICATION OF 3D-PMD VIDEO CAMERAS FOR TASKS IN THE AUTONOMOUS MOBILE ROBOTICS , 2005 .

[57]  Yu-Chee Tseng,et al.  Using Intelligent Mobile Devices for Indoor Wireless Location Tracking , Navigation , and Mobile Augmented Reality , 2010 .

[58]  Gaurav S. Sukhatme,et al.  Coarse In-Building Localization with Smartphones , 2009, MobiCASE.

[59]  Khalifa Djemal,et al.  Image Processing Theory, Tools and Applications , 2012 .

[60]  Arun K. Somani,et al.  Monocular vision SLAM for indoor aerial vehicles , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[61]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[62]  Les Check,et al.  Fifth International Conference , 1978 .

[63]  Ling Pei,et al.  Indoor/Outdoor Seamless Positioning Technologies Integrated on Smart Phone , 2009, 2009 First International Conference on Advances in Satellite and Space Communications.

[64]  Sebastian Thrun,et al.  Sub-meter indoor localization in unmodified environments with inexpensive sensors , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[65]  Kouhei Ohnishi,et al.  Eye-to-hand approach on eye-in-hand configuration within real-time visual servoing , 2005 .