Dynamic hand gesture recognition using a CNN model with 3D receptive fields

In this paper, a pattern recognition model for dynamic hand gesture recognition is proposed. The proposed model combines a convolutional neural network (CNN) with a weighted fuzzy min-max (WFMM) neural network; each module performs feature extraction and feature analysis, respectively. The data representation proposed in this research is a spatiotemporal template which is based on the motion information of the target object. To process the data, we develop a modified CNN model by extending the receptive field to a three-dimensional structure. To increase the efficiency of the pattern classifier, we use a feature analysis technique utilizing the WFMM algorithm. The experimental results show that the proposed method can minimize the influence caused by the spatial and temporal variation of the feature points. The recognition performance using only the selected features for the classification process is evaluated.

[1]  P. Peer,et al.  Human skin color clustering for face detection , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[2]  Ah Chung Tsoi,et al.  Face recognition: a convolutional neural-network approach , 1997, IEEE Trans. Neural Networks.

[3]  Mubarak Shah,et al.  Actions sketch: a novel action representation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  P. K. Simpson Fuzzy Min-Max Neural Networks-Part 1 : Classification , 1992 .

[5]  Patrick K. Simpson,et al.  Fuzzy min-max neural networks. I. Classification , 1992, IEEE Trans. Neural Networks.

[6]  Hyun Seung Yang,et al.  A Weighted FMM Neural Network and Its Application to Face Detection , 2006, ICONIP.

[7]  Hui Gao,et al.  An expressive three-mode principal components model of human action style , 2003, Image Vis. Comput..

[8]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[9]  Christophe Garcia,et al.  Convolutional face finder: a neural architecture for fast and robust face detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.