Two Hand Tracking Using Colour Statistical Model with the K-means Embedded Particle Filter for Hand Gesture Recognition

Particle filtering is an efficient and successful technique for tracking 2D and 3D motion through an image. We present the enhanced tracking of two hands based on a statistical model using only a skin colour feature with particle filtering for gesture recognition. Our framework employs one particle filter per hand individually with the pixel-wise classification of the likelihood of the skin in the window search. The skin classifier decision was trained from a set of skin samples in YCrCb space using an elliptical model. The tracking scheme employs a reliability measurement derived from the particle distribution which is used to adaptively weight the colour classification. The K-means algorithm is used to discriminate the split and merge between left and right hand. Experiments with a set of videos including the movement of two hands in cluttered backgrounds show that adaptive use of our scheme provides improvement compared to use with other techniques such as mean-shift tracking.

[1]  Stephen J. McKenna,et al.  Hand tracking for behaviour understanding , 2002, Image Vis. Comput..

[2]  M. Jaward,et al.  Multiple object tracking using particle filters , 2006, 2006 IEEE Aerospace Conference.

[3]  Vladimir Pavlovic,et al.  Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Emilio Maggio,et al.  Particle PHD Filtering for Multi-Target Visual Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[5]  Greg Hamerly,et al.  Alternatives to the k-means algorithm that find better clusterings , 2002, CIKM '02.

[6]  Shaogang Gong,et al.  Tracking and segmenting people in varying lighting conditions using colour , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[7]  King Ngi Ngan,et al.  Face segmentation using skin-color map in videophone applications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[8]  J. Y. Lee,et al.  An Elliptical Boundary Model for Skin Color Detection , 2002 .

[9]  Yoichi Sato,et al.  Real-Time Fingertip Tracking and Gesture Recognition , 2002, IEEE Computer Graphics and Applications.

[10]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[11]  Cheng-Chew Lim,et al.  Segmentation of the face and hands in sign language video sequences using color and motion cues , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Luc Van Gool,et al.  An adaptive color-based particle filter , 2003, Image Vis. Comput..

[13]  Krystian Ignasiak,et al.  Architecture of Information System for Intelligent Cash Machine , 2007, SIGMAP.

[14]  Lixin Fan,et al.  Face detection and pose alignment using colour, shape and texture information , 2000, Proceedings Third IEEE International Workshop on Visual Surveillance.

[15]  Michael Isard,et al.  ICONDENSATION: Unifying Low-Level and High-Level Tracking in a Stochastic Framework , 1998, ECCV.

[16]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[17]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.