Analyzing Structured Deformable Shapes Via Mean Field Monte Carlo

This paper describes a novel approach to analyzing and tracking the motion of structured deformable shapes that consis t of multiple correlated deformable subparts. Due to the high dimensional nature of this problem, existing methods are plagued either by the inability of capturing detailed local deformation or the enormous complexity, induced by the curse of dimensionality. Taking advantage of the structure of the deformable shapes, the paper presents a new representation , i.e., dynamic Markov network model, to overcome the challenges induced by high dimensionality. Probabilistic vari ational analysis of this Markov network model reveals a set of fixed point equations, i.e., the mean field equations, which manifest the interactions among the posterior motions of these deformable subparts and suggest an efficient solution to such a high dimensional motion analysis problem. Combined with Monte Carlo strategies, the new algorithm, namely mean field Monte Carlo(MFMC), achieves very efficient Bayesian inference with close-to-linear com plexity. Experiments on tracking human faces and human lips demonstrate the effectiveness of the proposed method.

[1]  Michael Isard,et al.  Partitioned Sampling, Articulated Objects, and Interface-Quality Hand Tracking , 2000, ECCV.

[2]  Michael Isard,et al.  ICONDENSATION: Unifying Low-Level and High-Level Tracking in a Stochastic Framework , 1998, ECCV.

[3]  William T. Freeman,et al.  Nonparametric belief propagation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[4]  Dorin Comaniciu,et al.  Nonparametric information fusion for motion estimation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[5]  Qiang Wang,et al.  Learning object intrinsic structure for robust visual tracking , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[6]  Gang Hua,et al.  Tracking articulated body by dynamic Markov network , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[8]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[9]  Michael Isard,et al.  PAMPAS: real-valued graphical models for computer vision , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[10]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[11]  Tommi S. Jaakkola,et al.  Tutorial on variational approximation methods , 2000 .

[12]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[13]  Andrew Blake,et al.  Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications , 1996, ECCV.

[14]  Harry Shum,et al.  Hierarchical Shape Modeling for Automatic Face Localization , 2002, ECCV.

[15]  William T. Freeman,et al.  Learning low-level vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.