Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments

Environment models serve as important resources for an autonomous robot by providing it with the necessary task-relevant information about its habitat. Their use enables robots to perform their tasks more reliably, flexibly, and efficiently. As autonomous robotic platforms get more sophisticated manipulation capabilities, they also need more expressive and comprehensive environment models: for manipulation purposes their models have to include the objects present in the world, together with their position, form, and other aspects, as well as an interpretation of these objects with respect to the robot tasks.The dissertation presented in this article (Rusu, PhD thesis, 2009) proposes Semantic 3D Object Models as a novel representation of the robot’s operating environment that satisfies these requirements and shows how these models can be automatically acquired from dense 3D range data.

[1]  Andrew Y. Ng,et al.  STAIR: Hardware and Software Architecture , 2007 .

[2]  Remco C. Veltkamp,et al.  A survey of content based 3D shape retrieval methods , 2004, Proceedings Shape Modeling Applications, 2004..

[3]  Daniel E. Koditschek,et al.  RHex: A Simple and Highly Mobile Hexapod Robot , 2001, Int. J. Robotics Res..

[4]  Andrea J. van Doorn,et al.  Surface shape and curvature scales , 1992, Image Vis. Comput..

[5]  Advait Jain,et al.  A clickable world: Behavior selection through pointing and context for mobile manipulation , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Sylvain Petitjean,et al.  A survey of methods for recovering quadrics in triangle meshes , 2002, CSUR.

[7]  David Nister,et al.  Bundle Adjustment Rules , 2006 .

[8]  E. Aude,et al.  DOOR CROSSING AND STATE IDENTIFICATION USING ROBOTIC VISION , 2006 .

[9]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[10]  Patrick Haffner,et al.  Support vector machines for histogram-based image classification , 1999, IEEE Trans. Neural Networks.

[11]  Peter K. Allen,et al.  Graspit! A versatile simulator for robotic grasping , 2004, IEEE Robotics & Automation Magazine.

[12]  Helmut Pottmann,et al.  Registration of point cloud data from a geometric optimization perspective , 2004, SGP '04.

[13]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Chitra Dorai,et al.  3D object recognition: Representation and matching , 2000, Stat. Comput..

[15]  Bernard F. Buxton,et al.  Application of a Robust and Efficient ICP Algorithm for Fitting a Deformable 3D Human Torso Model to Noisy Data , 2005, Digital Image Computing: Techniques and Applications (DICTA'05).

[16]  Stan Sclaroff,et al.  Improved feature descriptors for 3D surface matching , 2007, SPIE Optics East.

[17]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[18]  Markus H. Gross,et al.  Multi‐scale Feature Extraction on Point‐Sampled Surfaces , 2003, Comput. Graph. Forum.

[19]  Gilles Burel,et al.  Three-dimensional invariants and their application to object recognition , 1995, Signal Process..

[20]  Siddhartha S. Srinivasa,et al.  The robotic busboy: Steps towards developing a mobile robotic home assistant , 2008 .

[21]  Joachim Hertzberg,et al.  Towards semantic maps for mobile robots , 2008, Robotics Auton. Syst..

[22]  Ran Liu,et al.  Outlier Robust ICP for Minimizing Fractional RMSD , 2006, Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007).

[23]  Michael Beetz,et al.  Human Action Recognition Using Global Point Feature Histograms and Action Shapes , 2009, Adv. Robotics.

[24]  Michael Beetz,et al.  Real-time perception-guided motion planning for a personal robot , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Martin Buss,et al.  Comparison of surface normal estimation methods for range sensing applications , 2009, 2009 IEEE International Conference on Robotics and Automation.

[26]  Alberto Elfes,et al.  Using occupancy grids for mobile robot perception and navigation , 1989, Computer.

[27]  Masayuki Inaba,et al.  Manipulation and recognition of objects incorporating joints by a humanoid robot for daily assistive tasks , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Berthold K. P. Horn Extended Gaussian images , 1984, Proceedings of the IEEE.

[29]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[30]  Nico Blodow,et al.  Model-based and learned semantic object labeling in 3D point cloud maps of kitchen environments , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  Derek D. Lichti,et al.  A Closed-Form Expression of the Positional Uncertainty for 3D Point Clouds , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Marc Levoy,et al.  Geometrically stable sampling for the ICP algorithm , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[33]  Derek Nowrouzezahrai,et al.  Robust statistical estimation of curvature on discretized surfaces , 2007, Symposium on Geometry Processing.

[34]  Burcu Akinci,et al.  A Comparative Analysis of Depth-Discontinuity and Mixed-Pixel Detection Algorithms , 2007, Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007).

[35]  Michael Garland,et al.  Curvature maps for local shape comparison , 2005, International Conference on Shape Modeling and Applications 2005 (SMI' 05).

[36]  Nico Blodow,et al.  Learning informative point classes for the acquisition of object model maps , 2008, 2008 10th International Conference on Control, Automation, Robotics and Vision.

[37]  Carl-Fredrik Westin,et al.  Robust Generalized Total Least Squares Iterative Closest Point Registration , 2004, MICCAI.

[38]  Craig Gotsman,et al.  Characterizing Shape Using Conformal Factors , 2008, 3DOR@Eurographics.

[39]  Tony DeRose,et al.  Surface reconstruction from unorganized points , 1992, SIGGRAPH.

[40]  Manuel Menezes de Oliveira Neto,et al.  A hole-filling strategy for reconstruction of smooth surfaces in range images , 2003, 16th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2003).

[41]  Thierry Siméon,et al.  A practical motion planner for all-terrain mobile robots , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[42]  Joonki Paik,et al.  Normal Vector Voting: Crease Detection and Curvature Estimation on Large, Noisy Meshes , 2002, Graph. Model..

[43]  Wolfram Burgard,et al.  Semantic labeling of places using information extracted from laser and vision sensor data , 2006 .

[44]  Kurt Konolige,et al.  CenSurE: Center Surround Extremas for Realtime Feature Detection and Matching , 2008, ECCV.

[45]  Kostas Daniilidis,et al.  Fully Automatic Registration of 3D Point Clouds , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[46]  Gene H. Golub,et al.  Optimal Surface Smoothing as Filter Design , 1996, ECCV.

[47]  Fernando Matía,et al.  Extraction of Geometrical Features in 3D Environments for Service Robotic Applications , 2008, HAIS.

[48]  Sunil Arya,et al.  Algorithms for fast vector quantization , 1993, [Proceedings] DCC `93: Data Compression Conference.

[49]  Michael Beetz,et al.  Fault-tolerant 3D Mapping with Application to an Orchard Robot , 2009 .

[50]  Joachim Lambek,et al.  What is the world of mathematics? , 2004, Ann. Pure Appl. Log..

[51]  Nico Blodow,et al.  Interpretation of Urban Scenes based on Geometric Features , 2008 .

[52]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[53]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[54]  Anton van den Hengel,et al.  Thrift: Local 3D Structure Recognition , 2007, 9th Biennial Conference of the Australian Pattern Recognition Society on Digital Image Computing Techniques and Applications (DICTA 2007).

[55]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[56]  Zoltan-Csaba Marton,et al.  Probabilistic categorization of kitchen objects in table settings with a composite sensor , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[57]  RusuRadu Bogdan,et al.  Towards 3D Point cloud based object maps for household environments , 2008 .

[58]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[59]  Reinhard Klein,et al.  Efficient RANSAC for Point‐Cloud Shape Detection , 2007, Comput. Graph. Forum.

[60]  Antonios Gasteratos,et al.  Review of Stereo Vision Algorithms: From Software to Hardware , 2008 .

[61]  A. Gruen,et al.  Least squares 3D surface and curve matching , 2005 .

[62]  Francesca Odone,et al.  Histogram intersection kernel for image classification , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[63]  Basilio Sierra,et al.  DOOR HANDLE IDENTIFICATION: A THREE-STAGE APPROACH , 2007 .

[64]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[65]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[66]  Sang Wook Lee,et al.  ICP Registration Using Invariant Features , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[67]  Kwang-Ho Bae Automated Registration of Unorganised Point Clouds from Terrestrial Laser Scanners , 2004 .

[68]  Nikos Paragios,et al.  Handbook of Mathematical Models in Computer Vision , 2005 .

[69]  Christoph Borst,et al.  Autonomous opening of a door with a mobile manipulator: A case study , 2007 .

[70]  Kurt Konolige,et al.  Real-time Localization in Outdoor Environments using Stereo Vision and Inexpensive GPS , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[71]  Zoltan-Csaba Marton,et al.  On Fast Surface Reconstruction Methods for Large and Noisy Datasets , 2009, IEEE International Conference on Robotics and Automation.

[72]  Andrew E. Johnson,et al.  Registration and integration of textured 3-D data , 1997, Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134).

[73]  Nico Blodow,et al.  Persistent Point Feature Histograms for 3D Point Clouds , 2008 .

[74]  D. Cohen-Or,et al.  Robust moving least-squares fitting with sharp features , 2005, ACM Trans. Graph..

[75]  Nico Blodow,et al.  Close-range scene segmentation and reconstruction of 3D point cloud maps for mobile manipulation in domestic environments , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[76]  Mongi A. Abidi,et al.  Surface matching by 3D point's fingerprint , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[77]  Peter Lindstrom,et al.  Out-of-core simplification of large polygonal models , 2000, SIGGRAPH.

[78]  Joachim Hertzberg,et al.  3D Mapping with Semantic Knowledge , 2005, RoboCup.

[79]  Marc Levoy,et al.  Zippered polygon meshes from range images , 1994, SIGGRAPH.

[80]  Martial Hebert,et al.  Statistical approaches to multi-scale point cloud processing , 2008 .

[81]  Kurt Konolige,et al.  Small Vision Systems: Hardware and Implementation , 1998 .

[82]  Steven M. LaValle,et al.  Planning algorithms , 2006 .

[83]  Bernt Schiele,et al.  3D object recognition from range images using local feature histograms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[84]  Markus H. Gross,et al.  Efficient simplification of point-sampled surfaces , 2002, IEEE Visualization, 2002. VIS 2002..

[85]  Stefan Schaal,et al.  The New Robotics—towards Human-centered Machines , 2007 .

[86]  Niloy J. Mitra,et al.  Estimating surface normals in noisy point cloud data , 2003, SCG '03.

[87]  Michael Lehmann,et al.  An all-solid-state optical range camera for 3D real-time imaging with sub-centimeter depth resolution (SwissRanger) , 2004, SPIE Optical Systems Design.

[88]  Chong-Wah Ngo,et al.  Towards optimal bag-of-features for object categorization and semantic video retrieval , 2007, CIVR '07.

[89]  Jean-Claude Latombe,et al.  On Delaying Collision Checking in PRM Planning: Application to Multi-Robot Coordination , 2002, Int. J. Robotics Res..

[90]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[91]  Bruno Raffin,et al.  Parallel Adaptive Octree Carving for Real-time 3D Modeling , 2007, 2007 IEEE Virtual Reality Conference.

[92]  Alberto Del Bimbo,et al.  Content-based retrieval of 3D models , 2006, TOMCCAP.

[93]  John K. Tsotsos,et al.  Active Vision for Door Localization and Door Opening using Playbot: A Computer Controlled Wheelchair for People with Mobility Impairments , 2008, 2008 Canadian Conference on Computer and Robot Vision.

[94]  Brendan J. Frey,et al.  Factor Graphs and Algorithms , 2008 .

[95]  Sunil Arya,et al.  An optimal algorithm for approximate nearest neighbor searching fixed dimensions , 1998, JACM.

[96]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[97]  Leonidas J. Guibas,et al.  Robust global registration , 2005, SGP '05.

[98]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[99]  Ashutosh Saxena,et al.  Learning to open new doors , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[100]  Joachim Hertzberg,et al.  Automatic model refinement for 3D reconstruction with mobile robots , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[101]  Andrew W. Fitzgibbon,et al.  Robust Registration of 2D and 3D Point Sets , 2003, BMVC.

[102]  Martial Hebert,et al.  Scale selection for classification of point-sampled 3D surfaces , 2005, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05).

[103]  Nico Blodow,et al.  The Assistive Kitchen — A demonstration scenario for cognitive technical systems , 2007, RO-MAN 2008 - The 17th IEEE International Symposium on Robot and Human Interactive Communication.

[104]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[105]  Marc Alexa,et al.  On Normals and Projection Operators for Surfaces Defined by Point Sets , 2004, PBG.

[106]  Nuno Vasconcelos Feature selection by maximum marginal diversity: optimality and implications for visual recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[107]  Meenakshisundaram Gopi,et al.  Surface Reconstruction based on Lower Dimensional Localized Delaunay Triangulation , 2000, Comput. Graph. Forum.

[108]  Michael Beetz,et al.  Perception for mobile manipulation and grasping using active stereo , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[109]  Patrick J. Flynn,et al.  Eigenshapes for 3D object recognition in range data , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[110]  Nico Blodow,et al.  Functional object mapping of kitchen environments , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[111]  Joachim Hertzberg,et al.  Cached k-d tree search for ICP algorithms , 2007, Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007).

[112]  Craig M. Shakarji,et al.  Least-Squares Fitting Algorithms of the NIST Algorithm Testing System , 1998, Journal of research of the National Institute of Standards and Technology.

[113]  D. Holz,et al.  3 D Pose Estimation and Mapping with Time-of-Flight Cameras , 2008 .

[114]  D. Levin,et al.  Optimizing 3D triangulations using discrete curvature analysis , 2001 .

[115]  Kurt Konolige,et al.  Real-Time Detection of Independent Motion using Stereo , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[116]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields for Relational Learning , 2007 .

[117]  Peter Clifford,et al.  Markov Random Fields in Statistics , 2012 .

[118]  Ping-Sing Tsai,et al.  Shape from Shading: A Survey , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[119]  Eric Wahl,et al.  Surflet-pair-relation histograms: a statistical 3D-shape representation for rapid classification , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[120]  Changming Sun,et al.  3D Symmetry Detection Using The Extended Gaussian Image , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[121]  Nico Blodow,et al.  Aligning point cloud views using persistent feature histograms , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[122]  Shi-Min Hu,et al.  Robust principal curvatures on multiple scales , 2006, SGP '06.

[123]  Marc Alexa,et al.  Computing and Rendering Point Set Surfaces , 2003, IEEE Trans. Vis. Comput. Graph..

[124]  Heinz Hügli,et al.  A multi-resolution ICP with heuristic closest point search for fast and robust 3D registration of range images , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[125]  Nico Blodow,et al.  Towards 3D Point cloud based object maps for household environments , 2008, Robotics Auton. Syst..

[126]  Roland Siegwart,et al.  A Fast and Robust 3D Feature Extraction Algorithm for Structured Environment Reconstruction , 2003 .

[127]  Michael Beetz,et al.  Laser-based perception for door and handle identification , 2009, 2009 International Conference on Advanced Robotics.

[128]  Jiri Matas,et al.  Randomized RANSAC with Td, d test , 2004, Image Vis. Comput..

[129]  Nico Blodow,et al.  Fast geometric point labeling using conditional random fields , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[130]  S. Umeyama,et al.  Least-Squares Estimation of Transformation Parameters Between Two Point Patterns , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[131]  Ilkay Ulusoy,et al.  3D Object Representation Using Transform and Scale Invariant 3D Features , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[132]  Tamim Asfour,et al.  ARMAR-III: An Integrated Humanoid Platform for Sensory-Motor Control , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[133]  Robert B. Fisher,et al.  A Comparison of Four Algorithms for Estimating 3-D Rigid Transformations , 1995, BMVC.

[134]  Michael Beetz,et al.  Detection and Classification of Gateways for the Acquisition of Structured Robot Maps , 2004, DAGM-Symposium.

[135]  Michael Beetz,et al.  Leaving Flatland: Efficient Real-Time 3D Navigation , 2009 .

[136]  Helmut Pottmann,et al.  Registration without ICP , 2004, Comput. Vis. Image Underst..

[137]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[138]  Terry Caelli,et al.  Computation of Surface Geometry and Segmentation Using Covariance Techniques , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[139]  Richard I. Hartley,et al.  Optimised KD-trees for fast image descriptor matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[140]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[141]  Alin Albu-Schäffer,et al.  A humanoid upper body system for two-handed manipulation , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[142]  D. Levin,et al.  Mesh-Independent Surface Interpolation , 2004 .

[143]  Howie Choset,et al.  Principles of Robot Motion: Theory, Algorithms, and Implementation ERRATA!!!! 1 , 2007 .

[144]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[145]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[146]  Andrew Zisserman,et al.  MLESAC: A New Robust Estimator with Application to Estimating Image Geometry , 2000, Comput. Vis. Image Underst..

[147]  Alexei A. Efros,et al.  Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[148]  James J. Kuffner,et al.  OpenRAVE: A Planning Architecture for Autonomous Robotics , 2008 .

[149]  Michael Beetz,et al.  Leaving Flatland: Realtime 3D Stereo Semantic Reconstruction , 2008, ICIRA.

[150]  Nico Blodow,et al.  Towards 3D object maps for autonomous household robots , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[151]  Michael Beetz,et al.  Leaving Flatland: Toward real-time 3D navigation , 2009, 2009 IEEE International Conference on Robotics and Automation.

[152]  Stefan Gumhold,et al.  Feature Extraction From Point Clouds , 2001, IMR.

[153]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[154]  Ashutosh Saxena,et al.  Learning 3-D Scene Structure from a Single Still Image , 2007, 2007 IEEE 11th International Conference on Computer Vision.