3D model selection from an internet database for robotic vision

We propose a new method for automatically accessing an internet database of 3D models that are searchable only by their user-annotated labels, for using them for vision and robotic manipulation purposes. Instead of having only a local database containing already seen objects, we want to use shared databases available over the internet. This approach while having the potential to dramatically increase the visual recognition capability of robots, also poses certain problems, like wrong annotation due to the open nature of the database, or overwhelming amounts of data (many 3D models) or the lack of relevant data (no models matching a specified label). To solve those problems we propose the following: First, we present an outlier/inlier classification method for reducing the number of results and discarding invalid 3D models that do not match our query. Second, we utilize an approach from computer graphics, the so called ‘morphing’, to this application to specialize the models, in order to describe more objects. Third, we search for 3D models using a restricted search space, as obtained from our knowledge of the environment. We show our classification and matching results and finally show how we can recover the correct scaling with the stereo setup of our robot.

[1]  Nico Blodow,et al.  Towards 3D Point cloud based object maps for household environments , 2008, Robotics Auton. Syst..

[2]  Bernard Chazelle,et al.  Shape distributions , 2002, TOGS.

[3]  Markus Ulrich,et al.  Recognition and Tracking of 3D Objects , 2008, DAGM-Symposium.

[4]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Gordon Cheng,et al.  Making Object Learning and Recognition an Active Process , 2008, Int. J. Humanoid Robotics.

[6]  Ashutosh Saxena,et al.  Robotic Grasping of Novel Objects using Vision , 2008, Int. J. Robotics Res..

[7]  Eric Wahl,et al.  Surflet-pair-relation histograms: a statistical 3D-shape representation for rapid classification , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[8]  Masayuki Inaba,et al.  Multi-cue 3D object recognition in knowledge-based vision-guided humanoid robot system , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Nico Blodow,et al.  The Assistive Kitchen — A demonstration scenario for cognitive technical systems , 2007, RO-MAN 2008 - The 17th IEEE International Symposium on Robot and Human Interactive Communication.

[10]  Gert Kootstra,et al.  Active exploration and keypoint clustering for object recognition , 2008, 2008 IEEE International Conference on Robotics and Automation.

[11]  James J. Little,et al.  Informed visual search: Combining attention and object recognition , 2008, 2008 IEEE International Conference on Robotics and Automation.

[12]  Anthony A. Maciejewski,et al.  Pose detection of 3-D objects using S2-correlated images and discrete spherical harmonic transforms , 2008, 2008 IEEE International Conference on Robotics and Automation.

[13]  Henrik I. Christensen,et al.  Automatic grasp planning using shape primitives , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[14]  Tamim Asfour,et al.  Integrated Grasp Planning and Visual Object Localization For a Humanoid Robot with Five-Fingered Hands , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Michael Beetz,et al.  Leaving Flatland: Realtime 3D Stereo Semantic Reconstruction , 2008, ICIRA.

[16]  E. Forgy Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[17]  Anne Verroust-Blondet,et al.  Three-dimensional metamorphosis: a survey , 1998, The Visual Computer.

[18]  Pietro Perona,et al.  Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.