Active Multi-view Object Recognition and Online Feature Selection

Adaptive action selection is crucial for object recognition on robots with limited operating capabilities. It offers much desired flexibility in trading off between the costs of acquiring information and making robust and reliable inference under uncertainty. In this paper, we describe an information-theoretic framework that combines and unifies two common techniques: view planning for resolving ambiguities and occlusions and online feature selection for reducing computational costs. Concretely, our algorithm adaptively chooses two strategies: utilize simple-to-compute features that are the most informative for the recognition task or move to new viewpoints that optimally reduce the expected uncertainties on the identity of the object. Extensive empirical studies have validated the effectiveness of the proposed framework. On a large RGB-D dataset, dynamic feature selection alone reduces the computation time at runtime by five folds, and when combining it with viewpoint selection, we significantly increase the recognition accuracy on average by 8–15% absolute, compared to systems that do not use these two strategies. Lastly, we have also demonstrated successfully the effectiveness of the framework on a quadcopter platform with limited operating time.

[1]  Lucas Paletta,et al.  Active object recognition by view integration and reinforcement learning , 2000, Robotics Auton. Syst..

[2]  Joachim Denzler,et al.  Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[4]  Gaurav S. Sukhatme,et al.  A probabilistic framework for next best view estimation in a cluttered environment , 2014, J. Vis. Commun. Image Represent..

[5]  R. Bajcsy Active perception , 1988 .

[6]  George J. Pappas,et al.  Nonmyopic View Planning for Active Object Classification and Pose Estimation , 2014, IEEE Transactions on Robotics.

[7]  Horst Bischof,et al.  On-line Boosting and Vision , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  Dieter Fox,et al.  Unsupervised Feature Learning for RGB-D Based Object Recognition , 2012, ISER.

[9]  Zoltan-Csaba Marton,et al.  Evaluation of feature selection and model training strategies for object category recognition , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  Gary R. Bradski,et al.  Fast 3D recognition and pose using the Viewpoint Feature Histogram , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Michel Verleysen,et al.  Advances in Feature Selection with Mutual Information , 2009, Similarity-Based Clustering.

[12]  Geoffrey A. Hollinger,et al.  Active Classification: Theory and Application to Underwater Inspection , 2011, ISRR.

[13]  Alberto Del Bimbo,et al.  Non-myopic information theoretic sensor management of a single pan-tilt-zoom camera for multiple object detection and tracking , 2015, Comput. Vis. Image Underst..

[14]  Matthew A. Brown,et al.  Unsupervised 3D object recognition and reconstruction in unordered datasets , 2005, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05).

[15]  Lucas Paletta,et al.  Active Object Recognition in Parametric Eigenspace , 1998, BMVC.

[16]  Dieter Fox,et al.  Depth kernel descriptors for object recognition , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  Kai Oliver Arras,et al.  People tracking in RGB-D data with on-line boosted target models , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Dorin Comaniciu,et al.  Conditional feature sensitivity: a unifying view on active recognition and feature selection , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[19]  Dieter Fox,et al.  A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[20]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[21]  John J. Leonard,et al.  Toward lifelong object segmentation from change detection in dense RGB-D maps , 2013, 2013 European Conference on Mobile Robots.

[22]  Tal Arbel,et al.  Efficient Discriminant Viewpoint Selection for Active Bayesian Recognition , 2006, International Journal of Computer Vision.