Neural Network-based Head Pose Estimation and Multiview Fusion – Draft Version –

In this paper, we present two systems that were used for head pose estimation during the CLEAR06 Evaluation. We participated in two tasks: (1) estimating both pan and tilt orientation on synthetic, high resolution head captures, (2) estimating horizontal head orientation only on real seminar recordings that were captured with multiple cameras from different viewing angles. In both systems, we used a neural network to estimate the persons’ head orientation. In case of seminar recordings, a Bayes filter framework is further used to provide a statistical fusion scheme, integrating every camera view into one joint hypothesis. We achieved a mean error of 12.3◦ on horizontal head orientation estimation, in the monocular, high resolution task. Vertical orientation performed with 12.77◦ mean error. In case of the multi-view seminar recordings, our system could correctly identify head orientation in 34.9% (one of eight classes). If neighbouring classes were allowed, even 72.9% of the frames were correctly classified.

[1]  Roberto Cipolla,et al.  Non-intrusive gaze tracking for human-computer interaction , 1994 .

[2]  Alex Waibel,et al.  A model-based gaze tracking system , 1996, Proceedings IEEE International Joint Symposia on Intelligence and Systems.

[3]  Larry S. Davis,et al.  Computing 3-D head orientation from a monocular image sequence , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[4]  Alexander H. Waibel,et al.  Simultaneous tracking of head poses in a panoramic view , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[5]  Sharath Pankanti,et al.  Absolute head pose estimation from overhead wide-angle cameras , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[6]  Jean-Marc Odobez,et al.  A probabilistic framework for joint head tracking and pose estimation , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[7]  Rainer Stiefelhagen,et al.  Multi-view head pose estimation using neural networks , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).