论文信息 - Exploring photobios

Exploring photobios

We present an approach for generating face animations from large image collections of the same person. Such collections, which we call photobios, sample the appearance of a person over changes in pose, facial expression, hairstyle, age, and other variations. By optimizing the order in which images are displayed and cross-dissolving between them, we control the motion through face space and create compelling animations (e.g., render a smooth transition from frowning to smiling). Used in this context, the cross dissolve produces a very strong motion effect; a key contribution of the paper is to explain this effect and analyze its operating range. The approach operates by creating a graph with faces as nodes, and similarities as edges, and solving for walks and shortest paths on this graph. The processing pipeline involves face detection, locating fiducials (eyes/nose/mouth), solving for pose, warping to frontal views, and image comparison based on Local Binary Patterns. We demonstrate results on a variety of datasets including time-lapse photography, personal photo collections, and images of celebrities downloaded from the Internet. Our approach is the basis for the Face Movies feature in Google's Picasa.

Ira Kemelmacher-Shlizerman | Eli Shechtman | Steven M. Seitz | Rahul Garg

[1] Lance Williams,et al. View Interpolation for Image Synthesis , 1993, SIGGRAPH.

[2] Jun Rekimoto,et al. Time-machine computing: a time-centric approach for the information environment , 1999, UIST '99.

[3] John Lasseter,et al. Principles of traditional animation applied to 3D computer animation , 1987, SIGGRAPH.

[4] Thomas O. Binford,et al. On Detecting Edges , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[6] Matti Pietikäinen,et al. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7] George Sperling,et al. Stereomotion is processed by the third-order motion system: reply to comment on “Three-systems theory of human visual motion perception: review and update” , 2002 .

[8] Matti Pietikäinen,et al. Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10] R. Basri,et al. Direct visibility of point sets , 2007, SIGGRAPH 2007.

[11] E. Adelson,et al. Phenomenal coherence of moving visual patterns , 1982, Nature.

[12] S. Ullman,et al. Geometry and photometry in three-dimensional visual recognition , 1993 .

[13] Patrick Baudisch,et al. Time quilt: scaling up zoomable photo browsers for large, unstructured photo collections , 2005, CHI EA '05.

[14] Shree K. Nayar,et al. Face swapping: automatically replacing faces in photographs , 2008, SIGGRAPH 2008.

[15] Shree K. Nayar,et al. FaceTracer: A Search Engine for Large Collections of Images with Faces , 2008, ECCV.

[16] Okan Arikan,et al. Interactive motion generation from examples , 2002, ACM Trans. Graph..

[17] Edward H. Adelson,et al. Motion without movement , 1991, SIGGRAPH.

[18] Benjamin B. Bederson,et al. PhotoMesa: a zoomable image browser using quantum treemaps and bubblemaps , 2001, UIST '01.

[19] Li Zhang,et al. Spacetime faces: high resolution capture for modeling and animation , 2004, SIGGRAPH 2004.

[20] Thomas Vetter,et al. A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[21] Steven M. Seitz,et al. Finding paths through the world's photos , 2008, SIGGRAPH 2008.

[22] D Marr,et al. Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[23] Alex Pentland,et al. Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[24] Thaddeus Beier,et al. Feature-based image metamorphosis , 1992, SIGGRAPH.

[25] Richard Szeliski,et al. PSF estimation using sharp edge prediction , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[26] Yee Whye Teh,et al. Names and faces in the news , 2004, CVPR 2004.

[27] Christoph Bregler,et al. Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.

[28] Steven M. Seitz,et al. View morphing , 1996, SIGGRAPH.

[29] Richard Szeliski,et al. Creating full view panoramic image mosaics and environment maps , 1997, SIGGRAPH.

[30] Andreas Paepcke,et al. Time as essence for photo browsing through personal digital libraries , 2002, JCDL '02.

[31] Andrew Zisserman,et al. Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video , 2006, BMVC.

[32] Jonathan Brandt,et al. Robust object detection via soft cascade , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[33] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[34] David Salesin,et al. Video object annotation, navigation, and composition , 2008, UIST '08.

[35] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .