A PCA/MDA scheme for hand posture recognition

Principal component analysis (PCA) and multiple discriminant analysis (MDA) have long been used for appearance-based hand posture recognition. In this paper, we propose a novel PCA/MDA scheme for hand posture recognition. The scheme is represented by two layers of nodes (classes). The first layer of nodes acts as a crude classification using PCA, and each input pattern is given a likelihood of being in the nodes of this layer. Then MDA is applied locally to the postures in each node of the first layer to give a precise classification of the postures. Each precise class is a node in the second layer. For training, unsupervised classification at the first layer can be obtained using expectation maximization (EM). For better training results, a feedback from each node in the second layer is introduced in the EM process. The experiments on a 100-sign vocabulary show a significant improvement from 57.0% to 63.5%, compared with the global MDA. If combined with a hidden Markov model (HMM) for movement modeling, about a 93.5% recognition rate is achieved for test data.

[1]  Tosiyasu L. Kunii,et al.  Model-based analysis of hand posture , 1995, IEEE Computer Graphics and Applications.

[2]  Eli Hagen,et al.  Towards an American Sign Language interface , 1994, Artificial Intelligence Review.

[3]  Hermann Hienz,et al.  Relevant features for video-based continuous sign language recognition , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[4]  谷口 倫一郎,et al.  Appearance-based Recognition of Hand Shapes for Sign Language in Low Resolution Image , 2000 .

[5]  Robert Mills Wilson,et al.  Come Sign With Us: Sign Language Activities for Children , 1996 .

[6]  Dimitris N. Metaxas,et al.  ASL recognition based on a coupling between HMMs and 3D motion analysis , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[7]  Takeo Kanade,et al.  Model-based tracking of self-occluding articulated objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[8]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[9]  Ying Wu,et al.  Capturing articulated human hand motion: a divide-and-conquer approach , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Yuntao Cui,et al.  Appearance-Based Hand Sign Recognition from Intensity Image Sequences , 2000, Comput. Vis. Image Underst..

[12]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[13]  Ying Wu,et al.  View-independent recognition of hand postures , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[14]  Vladimir Pavlovic,et al.  Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Takeo Kanade,et al.  Visual Tracking of High DOF Articulated Structures: an Application to Human Hand Tracking , 1994, ECCV.

[16]  Rómer Rosales,et al.  3D Hand Pose Reconstruction Using Specialized Mappings , 2001, ICCV.

[17]  Stan Sclaroff,et al.  3D hand pose reconstruction using specialized mappings , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.