Interactive Visual Dialog

In this paper we propose a paradigm called the Interactive Visual Dialog (IVD) as a means of facilitating a system’s ability to recognize objects presented to it by a human. The presentation centers around a supermarket checkout scenario in which an operator presents an item to be tallied to a stationary television camera. An active vision approach is used to provide feedback to the operator in the form of an image (or images) depicting what the system thinks the operator is most likely holding, shown in a viewpoint that suggests how the object should next be presented to improve the certainty of interpretation. Interaction proceeds iteratively until the system converges on the correct interpretation. We show how the IVD can be implemented using an entropy-based gaze planning strategy and a sequential Bayes recognition system using optical flow as input. Experimental results show that the system does, in practice, improve recognition accuracy, leading to convergence to a correct solution in a minimal number of iterations.

[1]  James W. Davis,et al.  An appearance-based representation of action , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[2]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[3]  Frank P. Ferrie,et al.  Viewpoint selection by navigation through entropy maps , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  Frank P. Ferrie,et al.  Monocular optical flow for real-time vision systems , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[5]  Alex Pentland,et al.  Recognition of Space-Time Gestures using a Distributed Representation , 1993 .

[6]  Dmitry B. Goldgof,et al.  The Scale Space Aspect Graph , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[8]  Kevin W. Bowyer,et al.  Aspect graphs: An introduction and survey of recent results , 1990, Int. J. Imaging Syst. Technol..

[9]  Mutsumi Watanabe,et al.  A moving object recognition method by optical flow analysis , 1996, Proceedings of 13th International Conference on Pattern Recognition.