Robust Tracking for Processing of Videos of Communication's Gestures

This paper presents a method of image processing used in a mono-vision system in order to study semiotic gestures. We present a robust method to track the hands and face of a person performing gestural communication and the Signs' language communication. A model of skin is used to compute the observation density as a skin colour distribution in the image. Three particle filter trackers are implemented, with re-sampling and annealed update steps to increase their robustness to occultation and high acceleration variations of body parts'. Evaluations of the trackers with and without these enhancements, show the improvement that they bring.

[1]  C. Cadoz,et al.  Le geste canal de communication homme/machine: la communication "instrumentale" , 1994 .

[2]  Larry S. Davis,et al.  Ghost: a human body part labeling system using silhouettes , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[3]  C. Creider Hand and Mind: What Gestures Reveal about Thought , 1994 .

[4]  G. Kitagawa Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[5]  Patrice Dalle,et al.  Using Signing Space as a Representation for Sign Language Processing , 2005, Gesture Workshop.

[6]  Andrew Blake,et al.  A Probabilistic Exclusion Principle for Tracking Multiple Objects , 2000, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7]  Y. Bar-Shalom Tracking and data association , 1988 .

[8]  Claude Cadoz Le geste canal de communication homme / machine. Models for gestual interactions. , 1992 .

[9]  Christophe Collet,et al.  Toward an Annotation Software for Video of Sign Language, Including Image Processing Tools and Signing Space Modelling , 2004, LREC.

[10]  Hans-Peter Seidel,et al.  Interacting and Annealing Particle Filters: Mathematics and a Recipe for Applications , 2007, Journal of Mathematical Imaging and Vision.

[11]  Jean-Emmanuel Viallet,et al.  Tracking Body Parts of Multiple People for Multi-person Multimodal Interface , 2005, ICCV-HCI.

[12]  Real-Time Acrobatic Gesture Analysis , 2005, Gesture Workshop.

[13]  Christopher M. Bishop,et al.  Non-linear Bayesian Image Modelling , 2000, ECCV.

[14]  Michael Isard,et al.  Partitioned Sampling, Articulated Objects, and Interface-Quality Hand Tracking , 2000, ECCV.

[15]  Andrew Blake,et al.  Articulated body motion capture by annealed particle filtering , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[16]  Nicolas Courty,et al.  Gesture in Human-Computer Interaction and Simulation , 2006 .