A Neural-Network-Based Hand Posture Recognition Method

In various pattern recognition applications, angle variation is always a main challenging factor for producing reliable recognition. To increase the endurance ability on angle variation, this paper adopts a Hierarchical Temporal Memory (HTM) algorithm which applies temporal information to organize time-sequence change of image features, and constructs invariant features so that the influence of angle variation can be effectively learnt and overcome. The proposed multi-angle HTM-based posture recognition method consists of two main modules of Hand Posture Image Pre-processing (HPIP) and Hand Posture Recognition (HPR). In HPIP, each input image is first processed individually by skin color detection, foreground segmentation and edge detection. Then, the three processed results are further combined linearly to locate a hand posture region. In HPR, the normalized image is forwarded to a HTM model for learning and recognizing of different kinds of hand postures. Experiment results show that when using the same continuous unconstrained hand posture database, the proposed method can achieve an 89.1 % high recognition rate for discriminating three kinds of hand postures, which are scissors, stone and paper, and outperforms both Adaboost (78.1 %) and SVM (79.9 %).

[1]  S. Murali,et al.  Segmentation of Motion Objects from Surveillance Video Sequences Using Temporal Differencing Combined with Multiple Correlation , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[2]  Lei Xie,et al.  Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling , 2007, IEEE Transactions on Multimedia.

[3]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[4]  Yongmin Kim,et al.  Motion estimation based on optical flow with adaptive gradients , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[5]  Gudrun Klinker,et al.  An LED-based multitouch sensor for LCD screens , 2010, TEI '10.

[6]  A. Bouzerdoum,et al.  A Bayesian approach to skin color classification in YCbCr color space , 2000, 2000 TENCON Proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No.00CH37119).

[7]  William B. Thompson,et al.  Detecting moving objects , 1989, International Journal of Computer Vision.

[8]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Massimo Piccardi,et al.  Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[10]  Tarek M. Taha,et al.  A neocortex model implementation on reconfigurable logic with streaming memory , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[11]  Larry S. Davis,et al.  Real-time foreground-background segmentation using codebook model , 2005, Real Time Imaging.

[12]  J. Hawkins,et al.  On Intelligence , 2004 .