Tracking of Multi-state Hand Models Using Particle Filtering and a Hierarchy of Multi-scale Image Features

This paper presents an approach for simultaneous tracking and recognition of hierarchical object representations in terms of multiscale image features. A scale-invariant dissimilarity measure is proposed for comparing scale-space features at different positions and scales. Based on this measure, the likelihood of hierarchical, parameterized models can be evaluated in such a way that maximization of the measure over different models and their parameters allows for both model selection and parameter estimation. Then, within the framework of particle filtering, we consider the area of hand gesture analysis, and present a method for simultaneous tracking and recognition of hand models under variations in the position, orientation, size and posture of the hand. In this way, qualitative hand states and quantitative hand motions can be captured, and be used for controlling different types of computerised equipment.

[1]  Tony Lindeberg,et al.  Use your hand as a 3D mouse , 1998 .

[2]  Michael Isard,et al.  ICONDENSATION: Unifying Low-Level and High-Level Tracking in a Stochastic Framework , 1998, ECCV.

[3]  Ole Fogh Olsen,et al.  Multi-Scale Watershed Segmentation , 1997, Gaussian Scale-Space Theory.

[4]  Michael J. Black,et al.  A Probabilistic Framework for Matching Temporal Trajectories: CONDENSATION-Based Recognition of Gestures and Expressions , 1998, ECCV.

[5]  Arthur C. Sanderson,et al.  Multiple Resolution Representation and Probabilistic Matching of 2-D Gray-Scale Shape , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Lars Bretzner,et al.  Qualitative Multi-scale Feature Hierarchies for Object Tracking , 1999, Scale-Space.

[7]  Stephen M. Pizer,et al.  Object representation by cores: Identifying and representing primitive spatial regions , 1995, Vision Research.

[8]  Yuntao Cui,et al.  View-based hand segmentation and hand-sequence recognition with complex backgrounds , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[9]  Kazuo Kyuma,et al.  Computer vision for computer games , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[10]  Lars Bretzner,et al.  Use Your Hand as a 3-D Mouse, or, Relative Orientation from Extended Sequences of Sparse Point and Line Correspondences Using the Affine Trifocal Tensor , 1998, ECCV.

[11]  Michael Isard,et al.  A mixed-state condensation tracker with automatic model-switching , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[12]  Michael Isard,et al.  Partitioned Sampling, Articulated Objects, and Interface-Quality Hand Tracking , 2000, ECCV.

[13]  David J. Fleet,et al.  Stochastic Tracking of 3D Human Figures Using 2D Image Motion , 2000, ECCV.

[14]  David C. Hogg,et al.  Wormholes in shape space: tracking through discontinuous changes in shape , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[15]  David J. Fleet,et al.  Stochastic Tracking of 3 D Human Figures Using 2 D Image Motion , 2000 .

[16]  Takeo Kanade,et al.  Model-based tracking of self-occluding articulated objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[17]  Ali Shokoufandeh,et al.  View-based object recognition using saliency maps , 1999, Image Vis. Comput..

[18]  Michael Isard,et al.  Object localization by Bayesian correlation , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[19]  Andrew Blake,et al.  Articulated body motion capture by annealed particle filtering , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[20]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[21]  Heinrich H. Bülthoff View-based object recognition , 1997 .

[22]  Ali,et al.  View-Based Object Recognition Using Saliency Maps 1 , 1998 .

[23]  Stephen M. Pizer,et al.  A Multiresolution Hierarchical Approach to Image Segmentation Based on Intensity Extrema , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Max A. Viergever,et al.  Probabilistic Multiscale Image Segmentation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Roberto Cipolla,et al.  Robust structure from motion using motion parallax , 1993, 1993 (4th) International Conference on Computer Vision.

[26]  Jochen Triesch,et al.  Robust classification of hand postures against complex backgrounds , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[27]  Tony Lindeberg Detecting salient blob-like image structures and their scales , 1994 .

[28]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[29]  Hiromasa Nakatani,et al.  Tracking and Recognition of Hand Gesture in Three Dimensional Space , 2001 .

[30]  Andrew Blake,et al.  A Probabilistic Exclusion Principle for Tracking Multiple Objects , 2000, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[31]  Lars Bretzner,et al.  Scale selection in hybrid multi-scale representations , 2002 .