SASE: RGB-Depth Database for Human Head Pose Estimation

Head pose estimation has become very important in relation to facial and emotional recognition, as well as in human-computer interaction. There is an ultimate need for a 3D head pose database in order to develop head pose estimation methods using RGB and depth information. There are a few available datasets, such as Biwi Kinect head pose database, which is composed using Kinect 1, but it offers low-quality depth information. In this paper, a new 3D head database, SASE, is introduced. The data in SASE is acquired with Microsoft Kinect 2 camera, including RGB and depth information. The SASE database is composed by a total of 30000 frames with annotated markers. The samples include 32 male and 18 female subjects. For each person a large sample of head poses are included, within the bounds of yaw from \(-45\) to 45, pitch \(-75\) to 75 and roll \(-45\) to \(45^\circ \) of rotation around each axis. The details of acquiring the database and its characteristics are explained in detail.

[1]  Einar Meister,et al.  Real-time mimicking of estonian speaker's mouth movements on a 3D avatar using Kinect 2 , 2015, 2015 International Conference on Information and Communication Technology Convergence (ICTC).

[2]  Petros Maragos,et al.  Estimation of eye gaze direction angles based on active appearance models , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[3]  Jiri Matas,et al.  XM2VTSDB: The Extended M2VTS Database , 1999 .

[4]  Hao Li,et al.  Realtime performance-based facial animation , 2011, ACM Trans. Graph..

[5]  Fernando De la Torre,et al.  Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Xiangjian He,et al.  Shape-appearance-correlated active appearance model , 2016, Pattern Recognit..

[7]  Sergio Escalera,et al.  Accurate 3D measurement using optical depth information , 2015 .

[8]  Josephine Sullivan,et al.  One millisecond face alignment with an ensemble of regression trees , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Xiaoming Liu,et al.  Generic Face Alignment using Boosted Appearance Model , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Wang Chengzhang,et al.  BJUT-3D Large Scale 3D Face Database and Information Processing , 2009 .

[11]  Luc Van Gool,et al.  Real Time Head Pose Estimation from Consumer Depth Cameras , 2011, DAGM-Symposium.

[12]  Raimondo Schettini,et al.  UMB-DB: A database of partially occluded 3D faces , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[13]  Harry Shum,et al.  Face alignment using texture-constrained active shape models , 2003, Image Vis. Comput..

[14]  Timothy F. Cootes,et al.  Automatic model matching using part based model constrained active appearance models for skeletal maturity , 2015, 2015 Twelve International Conference on Electronics Computer and Computation (ICECCO).

[15]  Yiying Tong,et al.  FaceWarehouse: A 3D Facial Expression Database for Visual Computing , 2014, IEEE Transactions on Visualization and Computer Graphics.

[16]  Shiguang Shan,et al.  Coarse-to-Fine Auto-Encoder Networks (CFAN) for Real-Time Face Alignment , 2014, ECCV.

[17]  Chieh-Chih Wang,et al.  3D AAM based face alignment under wide angular variations using 2D and 3D data , 2012, 2012 IEEE International Conference on Robotics and Automation.

[18]  Richard S. Palais,et al.  Euler’s fixed point theorem: The axis of a rotation , 2007 .

[19]  Xiaoou Tang,et al.  Learning Deep Representation for Face Alignment with Auxiliary Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Luc Van Gool,et al.  Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[21]  Alan C. Bovik,et al.  Texas 3D Face Recognition Database , 2010, 2010 IEEE Southwest Symposium on Image Analysis & Interpretation (SSIAI).

[22]  Jun Wang,et al.  A 3D facial expression database for facial behavior research , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[23]  Jian Sun,et al.  Face Alignment by Explicit Shape Regression , 2012, International Journal of Computer Vision.

[24]  K.W. Bowyer,et al.  Using a Multi-Instance Enrollment Representation to Improve 3D Face Recognition , 2007, 2007 First IEEE International Conference on Biometrics: Theory, Applications, and Systems.

[25]  Li Zhang,et al.  Spacetime faces: high resolution capture for modeling and animation , 2004, SIGGRAPH 2004.

[26]  Luc Van Gool,et al.  Real time 3D face alignment with Random Forests-based Active Appearance Models , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[27]  Gholamreza Anbarjafari,et al.  Data Fusion Boosted Face Recognition Based on Probability Distribution Functions in Different Colour Channels , 2009, EURASIP J. Adv. Signal Process..

[28]  Tomás Pajdla,et al.  3D with Kinect , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[29]  Jean-Luc Dugelay,et al.  KinectFaceDB: A Kinect Database for Face Recognition , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[30]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[31]  L. Trefethen,et al.  Numerical linear algebra , 1997 .

[32]  Arman Savran,et al.  Bosphorus Database for 3D Face Analysis , 2008, BIOID.

[33]  Pavel Krsek,et al.  The Trimmed Iterative Closest Point algorithm , 2002, Object recognition supported by user interaction for service robots.

[34]  Xiaogang Wang,et al.  Deep Convolutional Network Cascade for Facial Point Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Jongmoo Choi,et al.  Real-time 3D face identification from a depth camera , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[36]  Fei Yang,et al.  Robust face tracking with a consumer depth camera , 2012, 2012 19th IEEE International Conference on Image Processing.

[37]  Peter Robinson,et al.  Face Alignment Assisted by Head Pose Estimation , 2015, BMVC.

[38]  Hanspeter Pfister,et al.  Face transfer with multilinear models , 2005, SIGGRAPH 2005.

[39]  Kun Zhou,et al.  Real-time facial animation with image-based dynamic avatars , 2016, ACM Trans. Graph..