Spatialized epitome and its applications

Due to the lack of explicit spatial consideration, existing epitome model may fail for image recognition and target detection, which directly motivates us to propose the so-called spatialized epitome in this paper. Extended from the original graphical model of epitome, the spatialized epitome provides a general framework to integrate both appearance and spatial arrangement of patches in the image to achieve a more precise likelihood representation for image(s) and eliminate ambiguities in image reconstruction and recognition. From the extended graphical model of epitome, an EM learning procedure is derived under the framework of variational approximation. The learning procedure can generate an optimized summary of the image appearance with spatial distribution of the similar patches. From the spatialized epitome, we present a principled way of inferring the probability of a new input image under the learnt model and thereby enabling image recognition and target detection. We show how the incorporation of spatial information enhances the epitome's ability for discrimination on several vision tasks, e.g., misalignment/cross-pose face recognition and vehicle detection with a few training samples.

[1]  David G. Stork,et al.  Pattern Classification , 1973 .

[2]  Huamin Wang,et al.  Factoring repeated content within and among images , 2008, ACM Trans. Graph..

[3]  Brendan J. Frey,et al.  Video Epitomes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Shuicheng Yan,et al.  Misalignment-Robust Face Recognition , 2008, IEEE Transactions on Image Processing.

[5]  Ashish Kapoor,et al.  The audio epitome: a new representation for modeling and classifying auditory phenomena , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6]  Volker Tresp,et al.  Averaging, maximum penalized likelihood and Bayesian estimation for improving Gaussian mixture probability density estimates , 1998, IEEE Trans. Neural Networks.

[7]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[8]  Brendan J. Frey,et al.  Epitomic analysis of appearance and shape , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9]  Rama Chellappa,et al.  Epitomic Representation of Human Activities , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Nebojsa Jojic,et al.  Capturing long-range correlations with patch models , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[12]  Simon J. D. Prince,et al.  Epitomized priors for multi-labeling problems , 2009, CVPR.

[13]  Antonio Criminisi,et al.  Epitomic location recognition , 2008, CVPR.

[14]  Denis Simakov,et al.  Summarizing visual data using bidirectional similarity , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Carsten Rother,et al.  Clustering appearance and shape by learning jigsaws , 2006, NIPS.