A Smart Camera for Multimodal Human Computer Interaction

A smart camera is an embedded vision system which, in addition to image capture, performs image analysis and pattern recognition to provide as output a high-level understanding of the imaged scene. Smart cameras are essential components to build active and automated control systems for many applications, such as surveillance, machine vision, and interactive visualization systems. The heart of smart camera is the intelligent image processing algorithms that turn raw data into knowledge. The design of smart camera is challenging because on one hand video processing has insatiable demand for performance and power, and on the other hand embedded systems place considerable constraints on the design. In this paper we firstly present an overview of smart camera technologies and the process to design smart cameras as embedded systems. We then present the design and implementation of a smart camera, called GestureCam, which can recognize simple hand and head gestures. The camera uses a CMOS image sensor as capture front-end, and the image processing and gesture recognition is completely built on a single FPGA device. The experimental results have shown it to be robust with enough performance to meet real-time constraints. We plan to use the GestureCam to build next generation of natural multimodal human computer interfaces

[1]  Wayne H. Wolf,et al.  Smart Cameras as Embedded Systems , 2002, Computer.

[2]  Paul A. Beardsley,et al.  Computer Vision for Interactive Computer Graphics , 1998, IEEE Computer Graphics and Applications.

[3]  C. W. La,et al.  Boundary extraction of moving objects from image sequence , 1999, Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030).

[4]  João M. P. Cardoso,et al.  A Real Time Gesture Recognition System for Mobile Robots , 2004, ICINCO.

[5]  Nuria Oliver,et al.  GWindows: robust stereo vision for gesture-based control of windows , 2003, ICMI '03.