Coarse-to-fine particle filters for multi-object human computer interaction

Efficient motion tracking of faces is an important aspect for human computer interaction (HCI). In this paper we combine the condensation and the wavelet approximated reduced vector machine (W-RVM) approach. Both are joined by the core idea to spend only as much as necessary effort for easy to discriminate regions (Condensation) or vectors (W-RVM) of the feature space, but most for regions with high statistical likelihood to contain objects of interest. We adapt the W-RVM classifler for tracking by providing a probabilistic output. In this paper we utilize condensation for template based tracking of the three-dimensional camera scene. Moreover, we introduce a robust multi-object tracking by extensions to the condensation approach. The novel coarse-to-flne condensation yields a more than 10 times faster tracking than state-of-art detection methods. We demonstrate more natural HCI applications by high resolution face tracking within a large camera scene with an active dual camera system.

[1]  Sami Romdhani,et al.  Wavelet Frame Accelerated Reduced Support Vector Machines , 2008, IEEE Transactions on Image Processing.

[2]  Sami Romdhani,et al.  Over-Complete Wavelet Approximation of a Support Vector Machine for Efficient Classification , 2005, DAGM-Symposium.

[3]  Tsuhan Chen,et al.  Tracking of multiple faces for human-computer interfaces and virtual environments , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[4]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Lars Bretzner,et al.  Qualitative Multiscale Feature Hierarchies for Object Tracking , 2000, J. Vis. Commun. Image Represent..

[6]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[7]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[8]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[9]  Sami Romdhani,et al.  Efficient Face Detection by a Cascaded Support Vector Machine Using Haar-Like Features , 2004, DAGM-Symposium.

[10]  Quan Pan,et al.  Reliable and fast tracking of faces under varying pose , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[11]  Lars Bretzner,et al.  Qualitative Multi-scale Feature Hierarchies for Object Tracking , 1999, Scale-Space.

[12]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  B. Schölkopf,et al.  Efficient face detection by a cascaded support–vector machine expansion , 2004, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[14]  Matthew Turk,et al.  A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[15]  J. Elder,et al.  Towards Face Recognition at a Distance , 2006 .

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Daijin Kim,et al.  Real-time multiple people tracking using competitive condensation , 2002, Proceedings. International Conference on Image Processing.

[18]  Richard Szeliski,et al.  Tracking with Kalman snakes , 1993 .