Context-driven model switching for visual tracking

Abstract A major challenge for real-world object tracking is the dynamic nature of the environmental conditions with respect to illumination, motion, visibility, etc. For such an environment which may experience drastic changes at any time, integration of multiple and complementary cues promises to increase robustness of visual tracking. Nevertheless, one has to expect that false positive tracking will occur. In order to be able to recover from such tracking failure this paper introduces a novel method for automatically choosing the object model which best fits the current context based on information-theoretic concepts. In order to validate the effectiveness of the proposed model switching, it is integrated into a multi-cue face tracking system and experimentally evaluated.

[1]  Michael Isard,et al.  Active Contours: The Application of Techniques from Graphics, Vision, Control Theory and Statistics to Visual Tracking of Shapes in Motion , 2000 .

[2]  Bernt Schiele,et al.  Using Mutual Information To Combine Object Models , 2000 .

[3]  Michael Isard,et al.  A mixed-state condensation tracker with automatic model-switching , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[4]  C. Malsburg,et al.  Self-organized integration of adaptive visual cues for face tracking , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[5]  E. Granum,et al.  Skin colour detection under changing lighting conditions , 1999 .

[6]  Michael Isard,et al.  ICONDENSATION: Unifying Low-Level and High-Level Tracking in a Stochastic Framework , 1998, ECCV.

[7]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Atsuto Maki,et al.  A computational model of depth-based attention , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[9]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[10]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[11]  Atsuto Maki,et al.  Towards an active visual observer , 1995, Proceedings of IEEE International Conference on Computer Vision.

[12]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1995, Proceedings of IEEE International Conference on Computer Vision.

[13]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Bernt Schiele,et al.  Towards Robust Multi-cue Integration for Visual Tracking , 2001, ICVS.