Integrated Viewpoint Fusion and Viewpoint Selection for Optimal Object Recognition

In the past decades, most object recognition systems were based on passive approaches. But in the last few years a lot of research was done in the eld of active object recognition, that is selectively moving a sensor/camera around a considered object in order to acquire as much information about it as possible. In this paper we present an active object recognition approach that solves the problem of choosing optimal views (viewpoint selection) and iteratively fuses the gained information for an optimal 3D object recognition (viewpoint fusion) in an integrated manner. Therefore, we apply a method for the fusion of multiple views with respect to the knowledge about the assumed camera movement between them. For viewpoint selection we formally dene the choice of additional views as an optimization problem. We show how to use reinforcement learning for this purpose and perform a training without user interaction. In this context we focus on the modeling of continuous states, continuous, one-dimensional actions and supporting rewards for an optimized recognition of real objects. The experimental results show that our combined viewpoint selection and viewpoint fusion approach is able to signicantly improve the recognition rates compared to passive object recognition with randomly chosen views.

[1]  Claus B. Madsen,et al.  A Viewpoint Planning Strategy for Determining True Angles on Polyhedral Objects by Camera Alignment , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Joachim Denzler,et al.  Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Joachim Denzler,et al.  Aspects of Optimal Viewpoint Selection and Viewpoint Fusion , 2006, ACCV.

[4]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[5]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[6]  Joachim Denzler,et al.  Viewpoint Selection - Planning Optimal Sequences of Views for Object Recognition , 2003, CAIP.

[7]  Joachim Denzler,et al.  How Fusion of Multiple Views Can Improve Object Recognition in Real-World Environments , 2003, VMV.

[8]  Heinrich Niemann,et al.  Cost Integration in Multi-step Viewpoint Selection for Object Recognition , 2005, MLDM.

[9]  Subhashis Banerjee,et al.  Recognizing large 3-D objects through next view planning using an uncalibrated camera , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[10]  Richard S. Sutton,et al.  Dimensions of Reinforcement Learning , 1998 .

[11]  Joachim Denzler,et al.  Classifier Independent Viewpoint Selection for 3-D Object Recognition , 2000, DAGM-Symposium.

[12]  Aimo A. Törn,et al.  Global Optimization , 1999, Science.

[13]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[14]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[15]  Aly A. Farag,et al.  Sensor planning for a trinocular active vision system , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[16]  Bernt Schiele,et al.  Transinformation for active object recognition , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).