Modality control of an active camera for an object recognition task

In this paper, we show an active object recognition system. This system uses a mutual information framework in order to choose the optimal parameters of an active camera for recognizing an unknown object. In a learning step, our system builds a database of all objects by means of a controlled acquisition process over a set of actions. These actions are taken from the set of different feasible configurations for our active sensor. Actions include pan, tilt and zoom values for an active camera. For every action, we compute the conditional probability density of observing some features of interest in the objects to recognize. Using a sequential decision making process, our system determines an optimal action that increases discrimination between objects in our database. This procedure iterates until a decision about the class of the unknown object can be done. We use the color patch mean over a region of interest in our image as the discrimination feature. We have used a set 8 different soda bottles as our test objects and we have obtained a recognition rate of about 99%. The system needs to iterate about 4 times (that is, to perform 4 actions) before being capable of making a decision.

[1]  Avinash C. Kak,et al.  Vision for Mobile Robot Navigation: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Joachim Denzler,et al.  Optimal Selection of Camera Parameters for State Estimation of Static Systems: An Information Theoretic Approach , 2000 .

[3]  Antonio Marín-Hernández,et al.  Collaborative tracking for robotic tasks , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).