Object Recognition and Tracking for Indoor Robots Using an RGB-D Sensor

In this paper, we extend and generalize our previously published approach on RGB-D based fruit recognition to be able to recognize different kinds of objects in front of our mobile system. We therefore first extend our segmentation to use depth filtering and clustering with a watershed algorithm on the depth data to detect the target to be recognized. We forward the processed data to extract RGB-D descriptors that are used to recoup complementary object information for the classification and recognition task. After having detected the object once, we apply a simple tracking method to reduce the object search space and the computational load through frequent detection queries. The proposed method is evaluated using the random forest (RF) classifier. Experimental results highlight the effectiveness as well as real-time suitability of the proposed extensions for our mobile system based on real RGB-D data.

[1]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[2]  Özgür Ulusoy,et al.  Bilvideo-7: an MPEG-7- compatible video indexing and retrieval system , 2010, IEEE MultiMedia.

[3]  Dieter Fox,et al.  Depth kernel descriptors for object recognition , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[5]  Joseph J. Lim,et al.  Recognition using regions , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Alexander Verl,et al.  A feature descriptor for texture-less object representation using 2D and 3D cues from RGB-D data , 2013, 2013 IEEE International Conference on Robotics and Automation.

[7]  Andrew Y. Ng,et al.  Convolutional-Recursive Deep Learning for 3D Object Classification , 2012, NIPS.

[8]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[9]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Fei-Fei Li,et al.  Object discovery in 3D scenes via shape analysis , 2013, 2013 IEEE International Conference on Robotics and Automation.

[11]  Özgür Ulusoy,et al.  Bilvideo-7: an MPEG-7- compatible video indexing and retrieval system , 2010 .

[12]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[13]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[14]  Dieter Fox,et al.  A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[15]  Gary R. Bradski,et al.  Real time face and object tracking as a component of a perceptual user interface , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[16]  Andreas Zell,et al.  Multi-class fruit classification using RGB-D data for indoor robots , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[17]  Andreas Zell,et al.  Visual terrain classification by flying robots , 2012, 2012 IEEE International Conference on Robotics and Automation.