Face Recognition with RGB-D Images Using Kinect

Face Recognition is one of the most extensively researched problems in biometrics, and many techniques have been proposed in the literature. While the performance of automated algorithms is close to perfect in constrained environments with controlled illumination, pose, and expression variations, recognition in unconstrained environments is still difficult. To mitigate the effect of some of these challenges, researchers have proposed to utilize 3D images which can encode much more information about the face than 2D images. However, due to sensor cost, 3D face images are expensive to capture. On the other hand, RGB-D images obtained using consumer-level devices such as the Kinect , which provide pseudo-depth data in addition to a visible spectrum color image, have a trade-off between quality and cost. In this chapter, we discuss existing RGB-D face recognition algorithms and present a state-of-the-art algorithm based on extracting discriminatory features using entropy and saliency from RGB-D images. We also present an overview of available RGB-D face datasets along with experimental results and analysis to understand the various facets of RGB-D face recognition.

[1]  Luc Van Gool,et al.  Random Forests for Real Time 3D Face Analysis , 2012, International Journal of Computer Vision.

[2]  Yaniv Taigman,et al.  Descriptor Based Methods in the Wild , 2008 .

[3]  Yiying Tong,et al.  FaceWarehouse: A 3D Facial Expression Database for Visual Computing , 2014, IEEE Transactions on Visualization and Computer Graphics.

[4]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Maurício Pamplona Segundo,et al.  Continuous 3D Face Authentication Using RGB-D Cameras , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[6]  Guodong Guo,et al.  Face recognition robust to head pose changes based on the RGB-D sensor , 2013, 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[7]  Sven Behnke,et al.  Real-Time Plane Segmentation Using RGB-D Cameras , 2012, RoboCup.

[8]  Arun Ross,et al.  Score normalization in multimodal biometric systems , 2005, Pattern Recognit..

[9]  A. Rényi On Measures of Entropy and Information , 1961 .

[10]  King Ngi Ngan,et al.  A Head Pose Tracking System Using RGB-D Camera , 2013, ICVS.

[11]  Andrea F. Abate,et al.  2D and 3D face recognition: A survey , 2007, Pattern Recognit. Lett..

[12]  Chengjun Liu,et al.  Color Image Discriminant Models and Algorithms for Face Recognition , 2008, IEEE Transactions on Neural Networks.

[13]  Alberto Del Bimbo,et al.  Sparse Matching of Salient Facial Curves for Recognition of 3-D Faces With Missing Parts , 2013, IEEE Transactions on Information Forensics and Security.

[14]  Wolfram Burgard,et al.  Real-time 3D visual SLAM with a hand-held camera , 2011 .

[15]  Anil K. Jain,et al.  Handbook of Face Recognition, 2nd Edition , 2011 .

[16]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[17]  Ajmal S. Mian,et al.  Using Kinect for face recognition under varying poses, expressions, illumination and disguise , 2013, 2013 IEEE Workshop on Applications of Computer Vision (WACV).

[18]  R I Hg,et al.  An RGB-D Database Using Microsoft's Kinect for Windows for Face Detection , 2012, 2012 Eighth International Conference on Signal Image Technology and Internet Based Systems.

[19]  Alberto Del Bimbo,et al.  Face Recognition by Super-Resolved 3D Models From Consumer Depth Cameras , 2014, IEEE Transactions on Information Forensics and Security.

[20]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[21]  Jieping Ye,et al.  Two-Dimensional Linear Discriminant Analysis , 2004, NIPS.

[22]  Avinash C. Kak,et al.  Estimating head pose with an RGBD sensor: A comparison of appearance-based and pose-based local subspace methods , 2013, 2013 IEEE International Conference on Image Processing.

[23]  Etienne Corvée,et al.  Body Parts Detection for People Tracking Using Trees of Histogram of Oriented Gradient Descriptors , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[24]  Samarth Bharadwaj,et al.  On RGB-D face recognition using Kinect , 2013, 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[25]  Lianwen Jin,et al.  A novel feature extraction method using Pyramid Histogram of Orientation Gradients for smile recognition , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[26]  Kun Duan,et al.  Discovering localized attributes for fine-grained recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Ramesh C. Jain,et al.  Invariant surface characteristics for 3D object recognition in range images , 1985, Comput. Vis. Graph. Image Process..

[28]  Yue Lu,et al.  Face Recognition Using Scale Invariant Feature Transform and Support Vector Machine , 2008, 2008 The 9th International Conference for Young Computer Scientists.

[29]  Shree K. Nayar,et al.  Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[30]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[31]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[32]  Tieniu Tan,et al.  Combining Statistics of Geometrical and Correlative Features for 3D Face Recognition , 2006, BMVC.

[33]  Matti Pietikäinen,et al.  Face Recognition with Local Binary Patterns , 2004, ECCV.

[34]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[36]  Dieter Fox,et al.  RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[37]  Ira Kemelmacher-Shlizerman,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 3d Face Reconstruction from a Single Image Using a Single Reference Face Shape , 2022 .

[38]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Richa Singh,et al.  RGB-D Face Recognition With Texture and Attribute Features , 2014, IEEE Transactions on Information Forensics and Security.

[40]  Chun-Hao Wang,et al.  Graph cut video object segmentation using histogram of oriented gradients , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[41]  Ajmal Mian Illumination invariant recognition and 3D reconstruction of faces using desktop optics. , 2011, Optics express.

[42]  Paul J. Besl,et al.  Method for registration of 3-D shapes , 1992, Other Conferences.

[43]  Alan C. Bovik,et al.  Texas 3D Face Recognition Database , 2010, 2010 IEEE Southwest Symposium on Image Analysis & Interpretation (SSIAI).

[44]  Jean-Luc Dugelay,et al.  An Efficient LBP-Based Descriptor for Facial Depth Images Applied to Gender Recognition Using RGB-D Face Data , 2012, ACCV Workshops.

[45]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[46]  Adriana Kovashka,et al.  WhittleSearch: Image search with relative attribute feedback , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Patrick J. Flynn,et al.  Overview of the face recognition grand challenge , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[48]  Gee-Sern Hsu,et al.  RGB-D-Based Face Reconstruction and Recognition , 2014, IEEE Trans. Inf. Forensics Secur..

[49]  Víctor González-Pacheco,et al.  Integration of a low-cost RGB-D sensor in a social robot for gesture recognition , 2011, 2011 6th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[50]  Patrick J. Flynn,et al.  A survey of approaches to three-dimensional face recognition , 2004, ICPR 2004.

[51]  Fatih Murat Porikli,et al.  Region Covariance: A Fast Descriptor for Detection and Classification , 2006, ECCV.

[52]  Michael Unser,et al.  B-spline signal processing. I. Theory , 1993, IEEE Trans. Signal Process..