Unsupervised Joint Alignment of Complex Images

Many recognition algorithms depend on careful positioning of an object into a canonical pose, so the position of features relative to a fixed coordinate system can be examined. Currently, this positioning is done either manually or by training a class-specialized learning algorithm with samples of the class that have been hand-labeled with parts or poses. In this paper, we describe a novel method to achieve this positioning using poorly aligned examples of a class with no additional labeling. Given a set of unaligned examplars of a class, such as faces, we automatically build an alignment mechanism, without any additional labeling of parts or poses in the data set. Using this alignment mechanism, new members of the class, such as faces resulting from a face detector, can be precisely aligned for the recognition process. Our alignment method improves performance on a face recognition task, both over unaligned images and over images aligned with a face alignment algorithm specifically developed for and trained on hand-labeled face images. We also demonstrate its use on an entirely different class of objects (cars), again without providing any information about parts or pose to the learning algorithm.

[1]  F. Quimby What's in a picture? , 1993, Laboratory animal science.

[2]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[3]  Takeo Kanade,et al.  Rotation invariant neural network-based face detection , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[4]  Timothy F. Cootes,et al.  Active Appearance Models , 1998, ECCV.

[5]  Alex Pentland,et al.  Bayesian face recognition , 2000, Pattern Recognit..

[6]  Paul A. Viola,et al.  Learning from one example through shared densities on transforms , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[7]  Shuicheng Yan,et al.  Multi-view face alignment using direct appearance models , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[8]  Vladimir Kolmogorov,et al.  Visual correspondence using energy minimization and mutual information , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9]  Paul A. Viola,et al.  Fast Multi-view Face Detection , 2003 .

[10]  Rogério Schmidt Feris,et al.  Real-time view-based face alignment using active wavelet networks , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[11]  Yi Zhou,et al.  Bayesian tangent shape model: estimating shape and pose parameters via Bayesian inference , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[12]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[13]  Yee Whye Teh,et al.  Names and faces in the news , 2004, CVPR 2004.

[14]  Stephen Lin,et al.  Face alignment under variable illumination , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[15]  W. Eric L. Grimson,et al.  Efficient Population Registration of 3D Data , 2005, CVBIA.

[16]  Harry Shum,et al.  A Bayesian mixture model for multi-view face alignment , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Erik G. Learned-Miller,et al.  Many Heads Are Better Than One: Jointly Removing Bias from Multiple MRIs Using Nonparametric Maximum Likelihood , 2005, IPMI.

[18]  Erik G. Learned-Miller,et al.  Data driven image models through continuous joint alignment , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Qiang Ji,et al.  Improving Face Recognition by Online Image Alignment , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[20]  Erik G. Learned-Miller,et al.  Discriminative Training of Hyper-feature Models for Object Identification , 2006, BMVC.