Information Theoretic Approaches for Next Best View Planning in Active Computer Vision

This paper describes an information theoretic approach for next best view planning in active state estimation, and its application to three computer vision tasks. In active state estimation, the state estimation process contains sensor actions which affect the state observation, and therefore the final state estimate. We use the information theoretic measure of mutual information to quantify the information content in this estimate. The optimal sensor actions are those that are expected to maximally increase the information content of the estimate. This action selection process is then applied to three seperate computer vision tasks: object recognition, object tracking and object reconstruction. Each task is formulated as an active state estimation problem. In these tasks, a given sensor action describes a camera position, or view. The information theoretic framework allows us to determine the next best view, i.e. the view that best supports the computer vision task. We show the benefits of next best view planning in several experiments, in which we compare the estimation error produced by planned views with the error produced by regularly sampled or unchanging views.

[1]  John J. Craig Zhu,et al.  Introduction to robotics mechanics and control , 1991 .

[2]  Heinrich Niemann,et al.  Vector Quantization Based Data Selection for Hand-Eye Calibration , 2004, VMV.

[3]  Christophe Dumont,et al.  A next-best-view system for autonomous 3-D object reconstruction , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[4]  Éric Marchand,et al.  Active Vision for Complete Scene Reconstruction and Exploration , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Tal Arbel,et al.  A fast discriminant approach to active object recognition and pose estimation , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[6]  David Casasent,et al.  Feature Space Trajectory Methods for Active Computer Vision , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Heinrich Niemann,et al.  Multi-step Multi-camera View Planning for Real-Time Visual Object Tracking , 2006, DAGM-Symposium.

[8]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[9]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[10]  Richard Pito,et al.  A Solution to the Next Best View Problem for Automated Surface Acquisition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Sven J. Dickinson,et al.  Active Object Recognition Integrating Attention and Viewpoint Control , 1997, Comput. Vis. Image Underst..

[12]  Wolfgang Niem Automatische Rekonstruktion starrer dreidimensionaler Objekte aus Kamerabildern , 1999 .

[13]  David W. Murray,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Reactive Control of Zoom While Fixating Using Perspective and Affine Cameras , 2022 .

[14]  David W. Murray,et al.  A method of reactive zoom control from uncertainty in tracking , 2007, Comput. Vis. Image Underst..

[15]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[16]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[17]  G. Roth,et al.  View planning for automated three-dimensional object reconstruction and inspection , 2003, CSUR.

[18]  J. Denzler,et al.  An Information Theoretic Approach to Optimal Sensor Data Selection for State Estimation , 2002 .

[19]  Joachim Denzler,et al.  Aspects of Optimal Viewpoint Selection and Viewpoint Fusion , 2006, ACCV.

[20]  Frank P. Ferrie,et al.  Entropy-based gaze planning , 2001, Image Vis. Comput..

[21]  Dorin Comaniciu,et al.  Conditional feature sensitivity: a unifying view on active recognition and feature selection , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[22]  Heinrich Niemann,et al.  Efficient Feature Tracking for Long Video Sequences , 2004, DAGM-Symposium.

[23]  Kiriakos N. Kutulakos,et al.  Recovering shape by purposive viewpoint adjustment , 1992, International Journal of Computer Vision.

[24]  Andrew J. Davison,et al.  Active search for real-time vision , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[25]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.