Multi-scale visual tracking by sequential belief propagation

A novel statistical method is proposed in this paper to overcome abrupt motion for robust visual tracking. Existing tracking methods that are based on the small motion assumption are vulnerable to abrupt motion, which may be induced by various factors, such as the unexpected dynamics changes of the target, frame dropping and camera motion, etc. Although with computational benefits, methods based on hierarchical search is inadequate to this problem because the propagation of the searching error may end up with bad estimates in fine scales. Since different scales contain different salient image features, we propose a new formulation in which searching and matching will be done collaboratively in different scales. The theoretical foundation of this new approach is based on dynamic Markov networks, where the bi-directional propagation of the belief of the target's posteriors on different scales reveals the collaboration among them. A nonparametric sequential belief propagation algorithm for the dynamic Markov network is developed by implementing the collaboration of a set of particle filters. Extensive experiments have demonstrated the effectiveness and efficiency of the proposed method to cope with various types of abrupt motions.

[1]  Michael Isard,et al.  A mixed-state condensation tracker with automatic model-switching , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[2]  Gregory D. Hager,et al.  Efficient Region Tracking With Parametric Models of Geometry and Illumination , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Andrew Blake,et al.  Learning dynamical models using expectation-maximisation , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[4]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[5]  William T. Freeman,et al.  Learning low-level vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[7]  Andrew Blake,et al.  Articulated body motion capture by annealed particle filtering , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[8]  Tommi S. Jaakkola,et al.  Tutorial on variational approximation methods , 2000 .

[9]  Michael I. Jordan,et al.  Graphical models: Probabilistic inference , 2002 .

[10]  Harry Shum,et al.  Hierarchical Shape Modeling for Automatic Face Localization , 2002, ECCV.

[11]  William T. Freeman,et al.  Nonparametric belief propagation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[12]  Song-Chun Zhu,et al.  Towards a mathematical theory of primal sketch and sketchability , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[13]  Gang Hua,et al.  Tracking articulated body by dynamic Markov network , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14]  Michael Isard,et al.  PAMPAS: real-valued graphical models for computer vision , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[15]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.