Multi-view object recognition using view-point invariant shape relations and appearance information

We present an object recognition system coding shape by view-point invariant geometric relations and appearance. In our intelligent work-cell, the system can observe the work space of the robot by 3 pairs of Kinect and stereo cameras allowing for reliable and complete object information. We show that in such a set-up we can achieve high performance already with a low number of training samples. We show this by training the system to classify 56 objects using Random Forest algorithm. This indicates that our approach can be used in contexts such as assembly manipulation which require high reliability of object recognition.

[1]  Dirk Kraft,et al.  Real-time extraction of surface patches with associated uncertainties by means of Kinect cameras , 2012, Journal of Real-Time Image Processing.

[2]  Florentin Wörgötter,et al.  Visual Primitives: Local, Condensed, Semantically Rich Visual Descriptors and their Applications in Robotics , 2010, Int. J. Humanoid Robotics.

[3]  Yali Amit,et al.  Shape Quantization and Recognition with Randomized Trees , 1997, Neural Computation.

[4]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[5]  Dieter Fox,et al.  A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[6]  Jimmy A. Jørgensen,et al.  Enabling grasping of unknown objects through a synergistic use of edge and surface information , 2012, Int. J. Robotics Res..

[7]  Don Ray Murray,et al.  Patchlets: Representing Stereo Vision Data with Surface Elements , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[8]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[9]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[12]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[13]  Roberto Cipolla,et al.  Semantic texton forests for image categorization and segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[15]  Cordelia Schmid,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[16]  A. Paulo Moreira,et al.  Object recognition using laser range finder and machine learning techniques , 2013 .

[17]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[18]  Thomas Serre,et al.  Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  Nicolas Pinto,et al.  How far can you get with a modern face recognition test set using only simple features? , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[21]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.