Real-time gesture recognition system based on Camshift algorithm and Haar-like feature

This paper propose a real-time combined method of Camshift [1] algorithm and Haar-like feature detection [2] for tracking and recognizing hand gesture in images acquired by a possibly moving camera. A Haar-like classifier is used during the initializing of the system to acquire the user's hand color. Camshift algorithm is applied with the acquired color to track the position of the hand, accompanied with a two-dimensional Kalman filter to prevent target occlusion. An offline training with Adaboost learning algorithm [3] of various hand gestures is employed to allow the classifier to recognize static hand gestures. With the information provided by Haar-like classifier, Tracking with Camshift will have more robust performance and less likely to lost targets. As the experimental result shows, the method we proposed have better performance than simply applying any of the two methods, especially in some complicate background conditions where skin-color disturbances or occlusion exist.

[1]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[2]  BlakeAndrew,et al.  C ONDENSATION Conditional Density Propagation forVisual Tracking , 1998 .

[3]  Luc Van Gool,et al.  An adaptive color-based particle filter , 2003, Image Vis. Comput..

[4]  N.D. Georganas,et al.  Real-time Vision-based Hand Gesture Recognition Using Haar-like Features , 2007, 2007 IEEE Instrumentation & Measurement Technology Conference IMTC 2007.

[5]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[6]  Gary R. Bradski,et al.  Real time face and object tracking as a component of a perceptual user interface , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[7]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[8]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.