Head nods analysis: interpretation of non verbal communication gestures

This paper proposes a real time frequency method to detect 2D rigid rotations of pan or tilt of a moving head. We aim at interpreting head nods involved in the non verbal communication process in the same way as human being: direction of the rotation is estimated but not its precise amplitude. The idea of the method is to analyze the image spectrum in the log polar domain where global 2D head rotations are transformed into simple energy translations. In order to make the log polar spectrum easy to interpret, a prefiltering stage inspired from the biological model of the human retina is applied: mobile contours are enhanced and static contours are attenuated, high frequency noise is eliminated and variations of illumination are cancelled. Estimated rotations are integrated in a data fusion process able to detect and to interpret in real time head nods of approbation or negation.

[1]  Marius Malciu,et al.  Model-based head tracking and 3D pose estimation , 1998, Optics & Photonics.

[2]  Jing Xiao,et al.  Robust full-motion recovery of head by dynamic templates and re-registration techniques , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[3]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1997, International Journal of Computer Vision.

[4]  S. Ullman,et al.  A model for the temporal organization of X- and Y-type receptive fields in the primate retina , 2004, Biological Cybernetics.

[5]  Jean-Marc Odobez,et al.  Robust Multiresolution Estimation of Parametric Motion Models , 1995, J. Vis. Commun. Image Represent..

[6]  Jing Xiao,et al.  Robust full‐motion recovery of head by dynamic templates and re‐registration techniques , 2003 .

[7]  Karsten P. Ulland,et al.  Vii. References , 2022 .

[8]  Antonio Torralba,et al.  An efficient neuromorphic analog network for motion estimation , 1999 .

[9]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1995, Proceedings of IEEE International Conference on Computer Vision.

[11]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.