Robust 3D Face Shape Reconstruction from Single Images via Two-Fold Coupled Structure Learning and Off-the-Shelf Landmark Detectors

In this paper, we propose a robust method for monocular face shape reconstruction (MFSR) using a sparse set of facial landmarks that are detected by most of the off-theshelf landmark detectors. Different from the classical shape-from-shading framework, we formulate the MFSR problem as a Two-Fold Coupled Structure Learning (2FCSL) process, which consists of learning a regression between two subspaces spanned by 3D sparse landmarks and 2D sparse landmarks, and a coupled dictionary learned on 3D sparse and dense shape using K-SVD. To handle variations in face pose, we explicitly incorporate pose estimation in our method. Extensive experiments on both synthetic and real data from two challenging datasets using manual and automatic landmarks indicate that our method achieves promising performance and is robust to pose variations and landmark localization noise.

[1]  Ruigang Yang,et al.  Learning 3D shape from a single facial image via non-linear manifold embedding and alignment , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  P. Hanrahan,et al.  On the relationship between radiance and irradiance: determining the illumination from images of a convex Lambertian object. , 2001, Journal of the Optical Society of America. A, Optics, image science, and vision.

[3]  Aly A. Farag,et al.  Model-based 3D shape recovery from single images of unknown pose and illumination using a small number of feature points , 2011, 2011 International Joint Conference on Biometrics (IJCB).

[4]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[5]  Stefanos Zafeiriou,et al.  300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[6]  Ioannis A. Kakadiaris,et al.  Viewpoint invariant 3D landmark model inference from monocular 2D images using higher-order priors , 2011, 2011 International Conference on Computer Vision.

[7]  Ioannis A. Kakadiaris,et al.  Can we do better in unimodal biometric systems? A novel rank-based score normalization framework for multi-sample galleries , 2013, 2013 International Conference on Biometrics (ICB).

[8]  William A. P. Smith,et al.  A Linear Approach to Face Shape and Texture Recovery using a 3D Morphable Model , 2010, BMVC.

[9]  Hua Li,et al.  Automatic, Effective, and Efficient 3D Face Reconstruction from Arbitrary View Image , 2004, PCM.

[10]  Lei Zhang,et al.  Face recognition from a single training image under arbitrary unknown lighting using spherical harmonics , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Edwin R. Hancock,et al.  A Coupled Statistical Model for Face Shape Recovery From Brightness Images , 2007, IEEE Transactions on Image Processing.

[12]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Aly A. Farag,et al.  3D face recovery from intensities of general and unknown lighting using Partial Least Squares , 2010, 2010 IEEE International Conference on Image Processing.

[14]  William A. P. Smith,et al.  A Linear Approach of 3 D Face Shape and Texture Recovery using a 3 D Morphable Model , 2010 .

[15]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[16]  Ira Kemelmacher-Shlizerman,et al.  Molding Face Shapes by Example , 2006, ECCV.

[17]  Hossein Mobahi,et al.  Toward a Practical Face Recognition System: Robust Alignment and Illumination by Sparse Representation , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Gee-Sern Hsu,et al.  Face Recognition across Poses Using a Single 3D Reference Model , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[19]  Ira Kemelmacher-Shlizerman,et al.  Face Reconstruction from a Single Image using a Single Reference Face Shape , 2009 .

[20]  Edwin R. Hancock,et al.  Recovering Facial Shape Using a Statistical Model of Surface Normal Direction , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Ioannis A. Kakadiaris,et al.  Three-Dimensional Face Recognition in the Presence of Facial Expressions: An Annotated Deformable Model Approach , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[23]  Aly A. Farag,et al.  Model-based shape recovery from single images of general and unknown lighting , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[24]  Anil K. Jain,et al.  Deformation Modeling for Robust 3D Face Matching , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Chun Chen,et al.  Three-Dimensional Face Reconstruction From a Single Image by a Coupled RBF Network , 2012, IEEE Transactions on Image Processing.

[26]  Ioannis A. Kakadiaris,et al.  UHDB11 Database for 3D-2D Face Recognition , 2013, PSIVT.

[27]  Yuxiao Hu,et al.  Automatic 3D reconstruction for face recognition , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[28]  Ronen Basri,et al.  Lambertian Reflectance and Linear Subspaces , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  David W. Jacobs,et al.  Generalized Multiview Analysis: A discriminative latent space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Hans-Peter Meinzer,et al.  Statistical shape models for 3D medical image segmentation: A review , 2009, Medical Image Anal..

[31]  Shiguang Shan,et al.  Multi-View Discriminant Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Mario Castelán,et al.  Using Subspace Multiple Linear Regression for 3D Face Shape Prediction from a Single Image , 2009, ISVC.

[33]  Edwin R. Hancock,et al.  Acquiring height data from a single image of a face using local shape indicators , 2006, Comput. Vis. Image Underst..

[34]  Chunheng Wang,et al.  Regularized Latent Least Square Regression for Cross Pose Face Recognition , 2013, IJCAI.

[35]  Junzhou Huang,et al.  Sparse shape composition: A new framework for shape prior modeling , 2011, CVPR 2011.

[36]  Michael Lindenbaum,et al.  Shape Reconstruction of 3D Bilaterally Symmetric Surfaces , 2000, International Journal of Computer Vision.

[37]  Tal Hassner,et al.  Viewing Real-World Faces in 3D , 2013, 2013 IEEE International Conference on Computer Vision.

[38]  T. Vetter,et al.  A statistical method for robust 3D surface reconstruction from sparse data , 2004 .

[39]  Ioannis A. Kakadiaris,et al.  Bidirectional relighting for 3D-aided 2D face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[40]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[41]  Rama Chellappa,et al.  Symmetric Shape-from-Shading Using Self-ratio Image , 2001, International Journal of Computer Vision.

[42]  Patrick J. Flynn,et al.  Overview of the face recognition grand challenge , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[43]  R. Basri,et al.  Statistical Symmetric Shape from Shading for 3D Structure Recovery of Faces , 2004, eccv 2004.