Variational Transform Invariant Mixture of Probabilistic PCA

In many video-based object recognition applications, the object appearances are acquired by visual tracking or detection and are inconsistent due to misalignments. We believe the misalignments can be removed if we can reduce the inconsistency in the object appearances caused by misalignments through clustering the objects in appearance, space and time domain simultaneously. We therefore propose to learn Transform Invariant Mixtures of Probabilistic PCA (TIMPPCA) model from the data while at the same time eliminating the misalignments. The model is formulated in a generative framework, and the misalignments are considered as hidden variables in the model. Variational EM update rules are then derived based on Variational Message Passing (VMP) techniques. The proposed TIMP-PCA is applied to improve head pose estimation performance and to detect the change of attention focus in meeting room video for meeting room video indexing/retrieval and achieves promising performance.

[1]  Brendan J. Frey,et al.  Fast Transformation-Invariant Factor Analysis , 2002, NIPS.

[2]  Brendan J. Frey,et al.  A comparison of algorithms for inference and learning in probabilistic graphical models , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Brendan J. Frey,et al.  Transformed component analysis: joint estimation of spatial transformations and image components , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[5]  Erik G. Learned-Miller,et al.  Data driven image models through continuous joint alignment , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Mary P. Harper,et al.  VACE Multimodal Meeting Corpus , 2005, MLMI.

[7]  Christopher M. Bishop,et al.  Non-linear Bayesian Image Modelling , 2000, ECCV.

[8]  Charles M. Bishop,et al.  Variational Message Passing , 2005, J. Mach. Learn. Res..

[9]  Charles M. Bishop Variational principal components , 1999 .

[10]  Amnon Shashua,et al.  Manifold pursuit: a new approach to appearance based recognition , 2002, Object recognition supported by user interaction for service robots.