Joint Albedo Estimation and Pose Tracking from Video

The albedo of a Lambertian object is a surface property that contributes to an object's appearance under changing illumination. As a signature independent of illumination, the albedo is useful for object recognition. Single image-based albedo estimation algorithms suffer due to shadows and non-Lambertian effects of the image. In this paper, we propose a sequential algorithm to estimate the albedo from a sequence of images of a known 3D object in varying poses and illumination conditions. We first show that by knowing/estimating the pose of the object at each frame of a sequence, the object's albedo can be efficiently estimated using a Kalman filter. We then extend this for the case of unknown pose by simultaneously tracking the pose as well as updating the albedo through a Rao-Blackwellized particle filter (RBPF). More specifically, the albedo is marginalized from the posterior distribution and estimated analytically using the Kalman filter, while the pose parameters are estimated using importance sampling and by minimizing the projection error of the face onto its spherical harmonic subspace, which results in an illumination-insensitive pose tracking algorithm. Illustrations and experiments are provided to validate the effectiveness of the approach using various synthetic and real sequences followed by applications to unconstrained, video-based face recognition.

[1]  Li Zhang,et al.  Shape and motion under varying illumination: unifying structure from motion, photometric stereo, and multiview stereo , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[2]  T. Inui,et al.  Group theory and its applications in physics , 1990 .

[3]  Katsushi Ikeuchi,et al.  Determining Surface Orientations of Specular Surfaces by Using the Photometric Stereo Method , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Gang Hua,et al.  Face Re-Lighting from a Single Image under Harsh Lighting Conditions , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  David J. Kriegman,et al.  Passive photometric stereo from motion , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[6]  Matthew Turk,et al.  A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[7]  K. Hohn,et al.  Determining Lightness from an Image , 2004 .

[8]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Thomas Vetter,et al.  Face Recognition Based on Fitting a 3D Morphable Model , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Marco La Cascia,et al.  Fast, Reliable Head Tracking under Varying Illumination: An Approach Based on Registration of Texture-Mapped 3D Models , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Lei Zhang,et al.  Face recognition from a single training image under arbitrary unknown lighting using spherical harmonics , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  David J. Kriegman,et al.  Shape from Varying Illumination and Viewpoint , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[13]  B. Anderson,et al.  Optimal Filtering , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  Paul A. Griffin,et al.  Statistical Approach to Shape from Shading: Reconstruction of Three-Dimensional Face Surfaces from Single Two-Dimensional Images , 1996, Neural Computation.

[15]  M. Gross,et al.  Analysis of human faces using a measurement-based skin reflectance model , 2006, ACM Trans. Graph..

[16]  Edward H. Adelson,et al.  Estimating Intrinsic Component Images using Non-Linear Regression , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17]  Amit K. Roy-Chowdhury,et al.  Integrating Motion, Illumination, and Structure in Video Sequences with Applications in Illumination-Invariant Tracking , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Peter S. Maybeck,et al.  Stochastic Models, Estimation And Control , 2012 .

[19]  Ping-Sing Tsai,et al.  Shape from Shading: A Survey , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Ronen Basri,et al.  Lambertian reflectance and linear subspaces , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[21]  Ravi Ramamoorthi,et al.  Modeling Illumination Variation with Spherical Harmonics , 2005 .

[22]  Aaron Hertzmann,et al.  Shape from Video : Dense Shape , Texture , Motion and Lighting from Monocular Image Streams , 2007 .

[23]  Gang Hua,et al.  Face Relighting from a Single Image under Arbitrary Unknown Lighting Conditions , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Tianli Yu,et al.  Recovering shape and reflectance model of non-lambertian objects from multiple views , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[25]  Rama Chellappa,et al.  Pose-robust albedo estimation from a single image , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Rama Chellappa,et al.  Appearance Characterization of Linear Lambertian Objects, Generalized Photometric Stereo, and Illumination-Invariant Face Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  E. Land,et al.  Lightness and retinex theory. , 1971, Journal of the Optical Society of America.

[28]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression Database , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  R. Basri,et al.  Statistical Symmetric Shape from Shading for 3D Structure Recovery of Faces , 2004, eccv 2004.

[30]  Roberto Cipolla,et al.  Multiview Photometric Stereo , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Larry S. Davis,et al.  Model-based object pose in 25 lines of code , 1992, International Journal of Computer Vision.

[32]  Amit K. Roy-Chowdhury,et al.  Pose and Illumination Invariant Face Recognition in Video , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Jean-Marc Odobez,et al.  Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups , 2007, CLEAR.

[34]  Sami Romdhani,et al.  A 3D Face Model for Pose and Illumination Invariant Face Recognition , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[35]  Olivier D. Faugeras,et al.  Shape From Shading , 2006, Handbook of Mathematical Models in Computer Vision.

[36]  Petar M. Djuric,et al.  Design and Implementation of Flexible Resampling Mechanism for High-Speed Parallel Particle Filters , 2006, J. VLSI Signal Process..

[37]  Robert J. Woodham,et al.  Photometric method for determining surface orientation from multiple images , 1980 .

[38]  R B Altman,et al.  Probabilistic constraint satisfaction with structural models: application to organ modeling by radial contours. , 1993, Proceedings. Symposium on Computer Applications in Medical Care.

[39]  Ronen Basri,et al.  Dense shape reconstruction of a moving object under arbitrary, unknown lighting , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[40]  Rama Chellappa,et al.  Robust Estimation of Albedo for Illumination-invariant Matching and Shape Recovery , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[41]  Rama Chellappa,et al.  Pose-Encoded Spherical Harmonics for Face Recognition and Synthesis Using a Single Image , 2008, EURASIP J. Adv. Signal Process..

[42]  Rama Chellappa,et al.  3D Facial Pose Tracking in Uncalibrated Videos , 2005, PReMI.

[43]  Edwin R. Hancock,et al.  Recovering Facial Shape Using a Statistical Model of Surface Normal Direction , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Zicheng Liu,et al.  Face relighting with radiance environment maps , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[45]  Marc Odobez ARao›Blackwellized Mixed State Particle Filter for Head Pose Tracking in Meetings , 2005 .

[46]  Russ B. Altman,et al.  A Probabilistic Algorithm for Calculating Structure: Borrowing from Simulated Annealing , 1993, UAI.

[47]  Ankur Srivastava,et al.  Algorithmic and Architectural Optimizations for Computationally Efficient Particle Filtering , 2008, IEEE Transactions on Image Processing.

[48]  Baba C. Vemuri,et al.  Beyond the Lambertian assumption: A generative model for Apparent BRDF fields of faces using anti-symmetric tensor splines , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  Javier R. Movellan,et al.  Tracking Motion, Deformation, and Texture Using Conditionally Gaussian Processes , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Rama Chellappa,et al.  Symmetric Shape-from-Shading Using Self-ratio Image , 2001, International Journal of Computer Vision.

[51]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.