Multi-camera head pose estimation using an ensemble of exemplars

We present a method for head pose estimation for moving targets in multi-camera environments. Our approach utilizes an ensemble of exemplar classifiers for joint head detection and pose estimation and provides finer-grained predictions than previous approaches. We incorporate dynamic camera selection, which allows a variable number of cameras to be selected at each time step and provides a tunable trade-off between accuracy and speed. On a benchmark dataset for multi-camera head pose estimation, our method predicts head pan angle with a mean absolute error of ~ 8° for different moving targets.

[1]  James W. Davis,et al.  Minimal-latency human action recognition using reliable-inference , 2006, Image Vis. Comput..

[2]  Alexei A. Efros,et al.  What makes Paris look like Paris? , 2015, Commun. ACM.

[3]  Subramanian Ramanathan,et al.  An Adaptation Framework for Head-Pose Classification in Dynamic Multi-view Scenarios , 2012, ACCV.

[4]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Antonis A. Argyros,et al.  3D Head Pose Estimation from Multiple Distant Views , 2009, BMVC.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Vittorio Murino,et al.  Multi-class Classification on Riemannian Manifolds for Video Surveillance , 2010, ECCV.

[9]  Qiang Ji,et al.  In the Eye of the Beholder: A Survey of Models for Eyes and Gaze , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Subramanian Ramanathan,et al.  No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion , 2013, 2013 IEEE International Conference on Computer Vision.

[11]  Alexei A. Efros,et al.  Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[12]  Montse Pardàs,et al.  Head Pose Detection Based on Fusion of Multiple Viewpoint Information , 2006, CLEAR.

[13]  Subramanian Ramanathan,et al.  Exploring Transfer Learning Approaches for Head Pose Classification from Multi-view Surveillance Images , 2013, International Journal of Computer Vision.

[14]  Shaogang Gong,et al.  Head Pose Classification in Crowded Scenes , 2009, BMVC.

[15]  Murphy-ChutorianErik,et al.  Head Pose Estimation in Computer Vision , 2009 .

[16]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Subramanian Ramanathan,et al.  Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments , 2014, 2014 22nd International Conference on Pattern Recognition.

[18]  Rafael Muñoz-Salinas,et al.  Multi-camera head pose estimation , 2012, Machine Vision and Applications.

[19]  Richard Souvenir,et al.  Pedestrian Verification for Multi-Camera Detection , 2014, ACCV.