论文信息 - 3D face reconstruction from video using a generic model

3D face reconstruction from video using a generic model

Reconstructing a 3D model of a human face from a video sequence is an important problem in computer vision, with applications to recognition, surveillance, multimedia etc. However, the quality of 3D reconstructions using structure from motion (SfM) algorithms is often not satisfactory. One common method of overcoming this problem is to use a generic model of a face. Existing work using this approach initializes the reconstruction algorithm with this generic model. The problem with this approach is that the algorithm can converge to a solution very close to this initial value, resulting in a reconstruction which resembles the generic model rather than the particular face in the video which needs to be modeled. We propose a method of 3D reconstruction of a human face from video in which the 3D reconstruction algorithm and the generic model are handled separately. A 3D estimate is obtained purely from the video sequence using SfM algorithms without use of the generic model. The final 3D model is obtained after combining the SfM estimate and the generic model using an energy function that corrects for the errors in the estimate by comparing local regions in the two models. The optimization is done using a Markov chain Monte Carlo (MCMC) sampling strategy. The main advantage of our algorithm over others is that it is able to retain the specific features of the face in the video sequence even when these features are different from those of the generic model. The evolution of the 3D model through the various stages of the algorithm is presented.

[1] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[2] Donald Geman,et al. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Stuart German,et al. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1988 .

[4] James J. Clark,et al. Data Fusion for Sensory Information Processing Systems , 1990 .

[5] Olivier Faugeras,et al. 3D Dynamic Scene Analysis , 1992 .

[6] Jun S. Liu,et al. Sequential Monte Carlo methods for dynamic systems , 1997 .

[7] Simon J. Godsill,et al. On sequential simulation-based methods for Bayesian filtering , 1998 .

[8] TopicsJun S. LiuDepartment. Markov Chain Monte Carlo and Related Topics , 1999 .

[9] John Oliensis,et al. A Critique of Structure-from-Motion Algorithms , 2000, Comput. Vis. Image Underst..

[10] Simon J. Godsill,et al. On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[11] Zicheng Liu,et al. Model-based bundle adjustment with application to face modeling , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[12] Pascal Fua,et al. Regularized Bundle-Adjustment to Model Heads from Image Sequences without Calibration Data , 2000, International Journal of Computer Vision.

[13] Sridhar Srinivasan,et al. Extracting Structure from Optical Flow Using the Fast Error Search Technique , 2000, International Journal of Computer Vision.

[14] Rama Chellappa,et al. Stochastic Approximation and Rate-Distortion Analysis for Robust Structure and Motion Estimation , 2003, International Journal of Computer Vision.