Nonmyopic View Planning for Active Object Detection

One of the central problems in computer vision is the detection of semantically important objects and the estimation of their pose. Most of the work in object detection has been based on single image processing and its performance is limited by occlusions and ambiguity in appearance and geometry. This paper proposes an active approach to object detection by controlling the point of view of a mobile depth camera. When an initial static detection phase identifies an object of interest, several hypotheses are made about its class and orientation. The sensor then plans a sequence of views, which balances the amount of energy used to move with the chance of identifying the correct hypothesis. We formulate an active hypothesis testing problem, which includes sensor mobility, and solve it using a point-based approximate POMDP algorithm. The validity of our approach is verified through simulation and real-world experiments with the PR2 robot. The results suggest that our approach outperforms the widely-used greedy view point selection and provides a significant improvement over static object detection.

[1]  Albert S. Huang,et al.  Planning to Perceive: Exploiting Mobility for Robust Object Detection , 2011, ICAPS.

[2]  Stefano Soatto,et al.  Control recognition bounds for visual learning and exploration , 2012, 2013 Information Theory and Applications Workshop (ITA).

[3]  Ramakant Nevatia,et al.  Recognizing 3-D Objects Using Surface Descriptions , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Keith D. Kastella,et al.  Sensor Management Using Relevance Feedback Learning , 2003 .

[5]  Radu Bogdan Rusu,et al.  Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments , 2010, KI - Künstliche Intelligenz.

[6]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[7]  Dimitri P. Bertsekas,et al.  Stochastic optimal control : the discrete time case , 2007 .

[8]  Nicholas J. Butko,et al.  Active perception , 2010 .

[9]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[10]  Eric Sommerlade,et al.  Information-theoretic active scene exploration , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  David Hsu,et al.  SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces , 2008, Robotics: Science and Systems.

[12]  David Hsu,et al.  POMDPs for robotic tasks with mixed observability , 2009, Robotics: Science and Systems.

[13]  David A. Castañón,et al.  Adaptive sensor management for feature-based classification , 2010, 49th IEEE Conference on Decision and Control (CDC).

[14]  M. Spaan Cooperative Active Perception using POMDPs , 2008 .

[15]  Alfred O. Hero,et al.  Sensor Management: Past, Present, and Future , 2011, IEEE Sensors Journal.

[16]  George J. Pappas,et al.  Hypothesis testing framework for active object detection , 2013, 2013 IEEE International Conference on Robotics and Automation.

[17]  Albert S. Huang,et al.  Active Exploration for Robust Object Detection , 2011, IJCAI.

[18]  Robert Eidenberger,et al.  Active perception and scene modeling by planning with probabilistic 6D object poses , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Andreas Krause,et al.  Adaptive Submodularity: Theory and Applications in Active Learning and Stochastic Optimization , 2010, J. Artif. Intell. Res..

[20]  Marco F. Huber Probabilistic Framework for Sensor Management , 2009 .

[21]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[22]  Joris De Schutter,et al.  Active robotic sensing as decision making with statistical methods , 2006 .

[23]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Tara Javidi,et al.  Active Sequential Hypothesis Testing , 2012, ArXiv.

[25]  Danica Kragic,et al.  Integrating Active Mobile Robot Object Recognition and SLAM in Natural Environments , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Vijay Kumar,et al.  Cooperative multi-robot estimation and control for radio source localization , 2014, ISER.

[27]  Richard Pito,et al.  A Solution to the Next Best View Problem for Automated Surface Acquisition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[29]  Ruzena Bajcsy,et al.  Active vision for reliable ranging: Cooperating focus, stereo, and vergence , 1993, International Journal of Computer Vision.