Autonomous 3D Modeling of Unknown Objects for Active Scene Exploration

The thesis Autonomous 3D Modeling of Unknown Objects for Active Scene Exploration presents an approach for efficient model generation of small-scale objects applying a robot-sensor system. Active scene exploration incorporates object recognition methods for analyzing a scene of partially known objects as well as exploration approaches for autonomous modeling of unknown parts. Here, recognition, exploration, and planning methods are extended and combined in a single scene exploration system, enabling advanced techniques such as multi-view recognition from planned view positions and iterative recognition by integration of new objects from a scene. In household or industrial environments, novel and unknown objects appear regularly and need to be modeled in order for a robot to be able to recognize the object and manipulate it. Nowadays, 3D models of hand-sized objects are usually obtained by manual scanning which represents a tedious and time consuming task for the human operator. For an autonomous system to take over this task, the robot needs to autonomously obtain the model within the object scene and thereby cope with challenges such as bad incidence angle, sensor noise, reflections, collisions or occlusions. In this thesis, sensor paths denoted as Next-Best-Scan are iteratively determined by a boundary search and surface trend estimation of the acquired model. In each iteration, 3D measurements are merged into a probabilistic voxel space, which considers sensor uncertainties. It is used for scene exploration, planning collision-free paths, avoiding occlusions, and verifying the poses of the recognized objects against all previous information. In order to account for both a fast acquisition rate and a high model quality, a Next-Best-Scan is selected that maximizes a utility function integrating an exploration and a mesh-quality component. The mesh-quality component allows for the algorithm to terminate once the quality required by the application is reached. The Next-Best-Scan algorithm is verified in simulation by comparison with state-of-the-art approaches concerning processing time and final model quality and in real scenes. The versatile applicability of the method is shown by several experiments with different cultural heritage, household, and industrial objects. Modeling of single objects is evaluated on an industrial and a mobile robot. On the industrial robot, the robot moves around the object, whereas on the mobile robot, the object is moved in front of an external range sensor using the same method. For modeling of larger workspaces, the mobile platform moves around the scene. The active scene exploration approach is demonstrated using several scenes with different levels of complexity. Here, Next-Best-Scan planning is performed for improving both recognition and modeling. Concluding, the developed methods enable the robot to learn object models of unknown objects, to directly apply these models to the individual application and therefore to become more autonomous. Here, the autonomously acquired object models are successively inserted into an object database and utilized by an object recognition module.

[1]  Gregory D. Hager,et al.  Sequential scene parsing using range and intensity information , 2012, 2012 IEEE International Conference on Robotics and Automation.

[2]  Benjamin Loriot,et al.  Non-Model Based Method for an Automation of 3D Acquisition and Post-Processing , 2009 .

[3]  Nicola D'Apuzzo,et al.  Overview of 3D surface digitization technologies in Europe , 2006, Electronic Imaging.

[4]  Sven Molkenstruck,et al.  Low-Cost Laser Range Scanner and Fast Surface Registration Approach , 2006, DAGM-Symposium.

[5]  Klaus H. Strobl A Flexible Approach to Close-Range 3-D Modeling , 2014 .

[6]  G. Roth,et al.  View planning for automated three-dimensional object reconstruction and inspection , 2003, CSUR.

[7]  Christophe Dumont,et al.  A next-best-view system for autonomous 3-D object reconstruction , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[8]  Sebastian Thrun,et al.  FastSLAM: a factored solution to the simultaneous localization and mapping problem , 2002, AAAI/IAAI.

[9]  Steven M. LaValle,et al.  Rapidly-Exploring Random Trees: Progress and Prospects , 2000 .

[10]  Heiko Hirschmüller,et al.  Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Youfu Li,et al.  A Novel View Planning Method for Automatic Reconstruction of Unknown 3-D Objects Based on the Limit Visual Surface , 2009, 2009 Fifth International Conference on Image and Graphics.

[12]  William H. Press,et al.  Numerical recipes in C (2nd ed.): the art of scientific computing , 1992 .

[13]  Dieter Fox,et al.  DART: Dense Articulated Real-Time Tracking , 2014, Robotics: Science and Systems.

[14]  Tanneguy Redarce,et al.  A CAD-based 3D data acquisition strategy for inspection , 2003, Machine Vision and Applications.

[15]  Rafael Murrieta-Cid,et al.  Volumetric Next-best-view Planning for 3D Object Reconstruction with Positioning Error , 2014 .

[16]  Andrew W. Fitzgibbon,et al.  Automatic 3D Model Construction for Turn-Table Sequences , 1998, SMILE.

[17]  Alin Albu-Schäffer,et al.  The KUKA-DLR Lightweight Robot arm - a new reference platform for robotics research and manufacturing , 2010, ISR/ROBOTIK.

[18]  Zoltan-Csaba Marton,et al.  Combining object modeling and recognition for active scene exploration , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Kurt Konolige,et al.  Improved Occupancy Grids for Map Building , 1997, Auton. Robots.

[20]  Colin Bradley,et al.  Automated laser scanning based on orthogonal cross sections , 1996 .

[21]  Héctor H. González-Baños,et al.  Navigation Strategies for Exploring Indoor Environments , 2002, Int. J. Robotics Res..

[22]  Christoph Munkelt Aktive daten- und modellbasierte Sensorpositionierung zur 3-D-Vermessung , 2011 .

[23]  Ruzena Bajcsy,et al.  Occlusions as a Guide for Planning the Next View , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Peter Liepa,et al.  Filling Holes in Meshes , 2003, Symposium on Geometry Processing.

[25]  Lucian Cosmin Goron,et al.  Reconstruction and Verification of 3D Object Models for Grasping , 2009, ISRR.

[26]  Michael Suppa,et al.  Efficient next-best-scan planning for autonomous 3D surface reconstruction of unknown objects , 2013, Journal of Real-Time Image Processing.

[27]  Soon Ki Jung,et al.  Calibration-free approach to 3D reconstruction using light stripe projections on a cube frame , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[28]  Marc Levoy,et al.  The digital Michelangelo project: 3D scanning of large statues , 2000, SIGGRAPH.

[29]  Michael Suppa,et al.  Autonomous robot work cell exploration using multisensory eye-in-hand systems , 2008 .

[30]  Sören Larsson,et al.  Path planning for laser scanning with an industrial robot , 2008, Robotics Auton. Syst..

[31]  Jason Geng,et al.  Structured-light 3D surface imaging: a tutorial , 2011 .

[32]  David Fofi,et al.  An efficient method for fully automatic 3D digitization of unknown objects , 2013, Comput. Ind..

[33]  Stefano Caselli,et al.  Global registration of mid-range 3D observations and short range next best views , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[34]  T. Hales The Kepler conjecture , 1998, math/9811078.

[35]  B. Faverjon,et al.  Probabilistic Roadmaps for Path Planning in High-Dimensional Con(cid:12)guration Spaces , 1996 .

[36]  Olivier Stasse,et al.  A two-steps next-best-view algorithm for autonomous 3D object modeling by a humanoid robot , 2009, 2009 IEEE International Conference on Robotics and Automation.

[37]  Shengyong Chen,et al.  Active vision in robotic systems: A survey of recent developments , 2011, Int. J. Robotics Res..

[38]  Darius Burschka,et al.  The self-referenced DLR 3D-modeler , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39]  Peter K. Allen,et al.  Data acquisition and view planning for 3-D modeling tasks , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[40]  Björn Stenger,et al.  A new distance for scale-invariant 3D shape recognition and registration , 2011, 2011 International Conference on Computer Vision.

[41]  Gerd Hirzinger,et al.  Next-best-scan planning for autonomous 3D modeling , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42]  Marc Levoy,et al.  Real-time 3D model acquisition , 2002, ACM Trans. Graph..

[43]  Wolfram Burgard,et al.  OctoMap : A Probabilistic , Flexible , and Compact 3 D Map Representation for Robotic Systems , 2010 .

[44]  Michel Devy,et al.  Perception planning for an exploration task of a 3D environment , 2002, Object recognition supported by user interaction for service robots.

[45]  Shengyong Chen,et al.  Active Sensor Planning for Multiview Vision Tasks , 2008 .

[46]  Stefano Caselli,et al.  Perception and Grasping of Object Parts from Active Robot Exploration , 2014, J. Intell. Robotic Syst..

[47]  B. Siciliano,et al.  Visual Grasp Planning for Unknown Objects Using a Multifingered Robotic Hand , 2013, IEEE/ASME Transactions on Mechatronics.

[48]  Brian Yamauchi,et al.  A frontier-based approach for autonomous exploration , 1997, Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97. 'Towards New Computational Principles for Robotics and Automation'.

[49]  Gino van den Bergen A Fast and Robust GJK Implementation for Collision Detection of Convex Objects , 1999, J. Graphics, GPU, & Game Tools.

[50]  Manuel Brucker,et al.  Virtual reality support for teleoperation using online grasp planning , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[51]  Stefan Berchtold,et al.  A scalable optimizer for automatically generated manipulator motions , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[52]  Michael Beetz,et al.  Generality and legibility in mobile manipulation , 2010, Auton. Robots.

[53]  Wolfram Burgard,et al.  Probabilistic Robotics (Intelligent Robotics and Autonomous Agents) , 2005 .

[54]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[55]  Eric Wahl,et al.  Surflet-pair-relation histograms: a statistical 3D-shape representation for rapid classification , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[56]  Carme Torras,et al.  Information-Gain View Planning for Free-Form Object Reconstruction with a 3D ToF Camera , 2012, ACIVS.

[57]  Miguel Ángel García,et al.  A Two-Stage Algorithm for Planning the Next View From Range Images , 1998, BMVC.

[58]  Oliver Kroemer,et al.  Probabilistic Segmentation and Targeted Exploration of Objects in Cluttered Environments , 2014, IEEE Transactions on Robotics.

[59]  Mongi A. Abidi,et al.  Next-best-view algorithm for object reconstruction , 1998, Other Conferences.

[60]  François Blais Review of 20 years of range sensor development , 2004, J. Electronic Imaging.

[61]  Gregory D. Hager,et al.  Scene parsing using a prior world model , 2011, Int. J. Robotics Res..

[62]  Andreas Nüchter,et al.  3D Robotic Mapping - The Simultaneous Localization and Mapping Problem with Six Degrees of Freedom , 2009, Springer Tracts in Advanced Robotics.

[63]  Alexander Narr Automated 3D Reconstruction of Unknown Objects by Planning the Next-Best-View based on a Spatial Data Structure , 2012 .

[64]  Rafael Murrieta-Cid,et al.  Hierarchical Ray Tracing for Fast Volumetric Next-Best-View Planning , 2013, 2013 International Conference on Computer and Robot Vision.

[65]  Anselmo Lastra,et al.  Efficient Constraint Evaluation Algorithms for Hierarchical Next-Best-View Planning , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[66]  Tim Bodenmüller,et al.  Streaming surface reconstruction from real time 3D-measurements , 2009 .

[67]  Robert Sitnik,et al.  On-line, collision-free positioning of a scanner during fully automated three-dimensional measurement of cultural heritage objects , 2012, Robotics Auton. Syst..

[68]  B. Bhanu,et al.  Estimating 3D Egomotion from Perspective Image Sequence , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[69]  Gerd Hirzinger,et al.  View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter , 2012, J. Intell. Robotic Syst..

[70]  Rainer Christian Konietschke,et al.  Planning of Workplaces with Multiple Kinematically Redundant Robots , 2007 .

[71]  Gerd Hirzinger,et al.  The 3D-Modeller: A Multi-Purpose Vision Platform , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[72]  Anis Sahbani,et al.  An overview of 3D object grasp synthesis algorithms , 2012, Robotics Auton. Syst..

[73]  S. Foix,et al.  Lock-in Time-of-Flight (ToF) Cameras: A Survey , 2011, IEEE Sensors Journal.

[74]  Eric Wahl,et al.  The DLR multisensory Hand-Guided Device: the Laser Stripe Profiler , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[75]  C. Ian Connolly,et al.  The determination of next best views , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[76]  David Fofi,et al.  View Planning Approach for Automatic 3D Digitization of Unknown Objects , 2012, ECCV Workshops.

[77]  Joachim Denzler,et al.  Benchmarking 3D Reconstructions from Next Best View Planning , 2007, MVA.

[78]  Gaurav S. Sukhatme,et al.  Next Best View Estimation With Eye In Hand Camera , 2011, IROS 2011.

[79]  Stefan K. Gehrig,et al.  A Real-Time Low-Power Stereo Vision Engine Using Semi-Global Matching , 2009, ICVS.

[80]  Dieter Fox,et al.  Autonomous generation of complete 3D object models using next best view manipulation planning , 2011, 2011 IEEE International Conference on Robotics and Automation.

[81]  Gerd Hirzinger,et al.  Multi-scale 3D-Modeling , 2006, PSIVT.

[82]  Hugh F. Durrant-Whyte,et al.  A solution to the simultaneous localization and map building (SLAM) problem , 2001, IEEE Trans. Robotics Autom..

[83]  Li Zhang,et al.  Rapid shape acquisition using color structured light and multi-pass dynamic programming , 2002, Proceedings. First International Symposium on 3D Data Processing Visualization and Transmission.

[84]  Rahul Nair,et al.  A State of the Art Report on Kinect Sensor Setups in Computer Vision , 2013, Time-of-Flight and Depth Imaging.

[85]  Wolfram Burgard,et al.  On the position accuracy of mobile robot localization based on particle filters combined with scan matching , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[86]  Stefan Fuchs Calibration and Multipath Mitigation for Increased Accuracy of Time-of-Flight Camera Measurements in Robotic Applications , 2012 .

[87]  TREND SURFACE ANALYSIS-A REVIEW , 2004 .

[88]  Matei T. Ciocarlie,et al.  Contact-reactive grasping of objects with partial shape information , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[89]  Rüdiger Dillmann,et al.  The KIT object models database: An object model database for object recognition, localization and manipulation in service robotics , 2012, Int. J. Robotics Res..

[90]  Hugh F. Durrant-Whyte,et al.  Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[91]  Beat Kleiner,et al.  Graphical Methods for Data Analysis , 1983 .

[92]  Kari Pulli,et al.  Multiview registration for large data sets , 1999, Second International Conference on 3-D Digital Imaging and Modeling (Cat. No.PR00062).

[93]  Paolo Cignoni,et al.  RoboScan: an automatic system for accurate and unattended 3D scanning , 2004, Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004..

[94]  Nico Blodow,et al.  Autonomous semantic mapping for robots performing everyday manipulation tasks in kitchen environments , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[95]  Michael Suppa,et al.  DLR VR-SCAN: A versatile and robust miniaturized laser scanner for short range 3D-modelling and exploration in robotics , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[96]  Pieter Abbeel,et al.  BigBIRD: A large-scale 3D database of object instances , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[97]  Michael Suppa,et al.  Fusing Color and Geometry Information for Understanding Cluttered Scenes , 2014 .

[98]  Edwin Olson,et al.  AprilTag: A robust and flexible visual fiducial system , 2011, 2011 IEEE International Conference on Robotics and Automation.

[99]  Martial Hebert,et al.  A system for semi-automatic modeling of complex environments , 1997, Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134).

[100]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[101]  Hanan Samet,et al.  Applications of spatial data structures - computer graphics, image processing, and GIS , 1990 .

[102]  Joachim Denzler,et al.  Online Next-Best-View Planning for Accuracy Optimization Using an Extended E-Criterion , 2010, 2010 20th International Conference on Pattern Recognition.

[103]  Otmar Loffeld,et al.  Depth Camera Technology Comparison and Performance Evaluation , 2012, ICPRAM.

[104]  Christopher Schwartz,et al.  A Multi-camera, Multi-projector Super-Resolution Framework for Structured Light , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[105]  Richard Pito,et al.  A Solution to the Next Best View Problem for Automated Surface Acquisition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[106]  Pushmeet Kohli,et al.  When Can We Use KinectFusion for Ground Truth Acquisition , 2012 .

[107]  Mohammed Bennamoun,et al.  Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[108]  Kamal K. Gupta,et al.  An autonomous six-DOF eye-in-hand system for in situ 3D object modeling , 2012, Int. J. Robotics Res..

[109]  Giuseppe Oriolo,et al.  Frontier-Based Probabilistic Strategies for Sensor-Based Exploration , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[110]  Nico Blodow,et al.  Functional object mapping of kitchen environments , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[111]  Klaus H. Strobl Loop Closing for Visual Pose Tracking during Close-Range 3-D Modeling , 2014, ISVC.

[112]  Marilena Vendittelli,et al.  The SRT method: randomized strategies for exploration , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[113]  Achim J. Lilienthal,et al.  Comparative Evaluation of Range Sensor Accuracy in Indoor Environments , 2011, ECMR.

[114]  Subhashis Banerjee,et al.  Active recognition through next view planning: a survey , 2004, Pattern Recognit..

[115]  Florian Schmidt,et al.  Rollin' Justin - Mobile platform with variable base , 2009, 2009 IEEE International Conference on Robotics and Automation.

[116]  Tomás Pajdla,et al.  3D with Kinect , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[117]  Gerd Hirzinger,et al.  Optimal Hand-Eye Calibration , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[118]  Luis Enrique Sucar,et al.  View planning for 3D object reconstruction , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[119]  Nico Blodow,et al.  Partial view modeling and validation in 3D laser scans for grasping , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[120]  Gerd Hirzinger,et al.  A surface-based Next-Best-View approach for automated 3D model completion of unknown objects , 2011, 2011 IEEE International Conference on Robotics and Automation.

[121]  S. Sathiya Keerthi,et al.  A fast procedure for computing the distance between complex objects in three-dimensional space , 1988, IEEE J. Robotics Autom..

[122]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[123]  Claire Lartigue,et al.  Scan Planning Strategy for a General Digitized Surface , 2006, J. Comput. Inf. Sci. Eng..

[124]  Shengyong Chen,et al.  Vision sensor planning for 3-D model acquisition , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[125]  Xiaobu Yuan,et al.  A Mechanism of Automatic 3D Object Modeling , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[126]  Markus Wiedemann Watertight Surface Reconstruction for Uncertain Data , 2014 .

[127]  Luis Enrique Sucar,et al.  View planning for 3D object reconstruction with a mobile manipulator robot , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[128]  Gino van den Bergen Collision Detection in Interactive 3D Environments , 2003 .

[129]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[130]  Kamal K. Gupta,et al.  An autonomous 9-DOF mobile-manipulator system for in situ 3D object modeling , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[131]  Zoltan-Csaba Marton,et al.  Tracking-based interactive segmentation of textureless objects , 2013, 2013 IEEE International Conference on Robotics and Automation.

[132]  Joachim Hertzberg,et al.  Towards semantic maps for mobile robots , 2008, Robotics Auton. Syst..

[133]  Wolfram Burgard,et al.  OctoMap: an efficient probabilistic 3D mapping framework based on octrees , 2013, Autonomous Robots.

[134]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[135]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.