Modified Dynamic Time Warping Based on Direction Similarity for Fast Gesture Recognition

We propose a modified dynamic time warping (DTW) algorithm that compares gesture-position sequences based on the direction of the gestural movement. Standard DTW does not specifically consider the two-dimensional characteristic of the user’s movement. Therefore, in gesture recognition, the sequence comparison by standard DTW needs to be improved. The proposed gesture-recognition system compares the sequences of the input gesture’s position with gesture positions saved in the database and selects the most similar gesture by filtering out unrelated gestures. The suggested algorithm uses the cosine similarity of the movement direction at each moment to calculate the difference and reflects the characteristics of the gesture movement by using the ratio of the Euclidean distance and the proportional distance to the calculated difference. Selective spline interpolation assists in solving the issue of recognition-decline at instances of gestures. Through experiments with public databases (MSRC-12 and G3D), the suggested algorithm revealed an improved performance on both databases compared to other methods.

[1]  Yong Du,et al.  Hierarchical recurrent neural network for skeleton based action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Helena M. Mentis,et al.  Instructing people for training gestural interactive systems , 2012, CHI.

[3]  I. Elamvazuthi,et al.  Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques , 2010, ArXiv.

[4]  Kazuhiro Seki,et al.  A shape-based similarity measure for time series data with ensemble learning , 2011, Pattern Analysis and Applications.

[5]  Robert L. Grossman,et al.  Proceedings of the 2001 SIAM International Conference on Data Mining , 2001 .

[6]  Ho-Sub Yoon,et al.  Hand gesture recognition using combined features of location, angle and velocity , 2001, Pattern Recognit..

[7]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[8]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.

[9]  Marcos Faúndez-Zanuy,et al.  On-line signature recognition based on VQ-DTW , 2007, Pattern Recognit..

[10]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[11]  P. S. Sastry,et al.  A survey of temporal data mining , 2006 .

[12]  Jing Zhang,et al.  Action Recognition From Depth Maps Using Deep Convolutional Neural Networks , 2016, IEEE Transactions on Human-Machine Systems.

[13]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[14]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[15]  Stan Salvador,et al.  FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space , 2004 .

[16]  Jihad El-Sana,et al.  Word spotting for handwritten documents using Chamfer Distance and Dynamic Time Warping , 2011, Electronic Imaging.

[17]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[18]  Daniel P. Lopresti,et al.  Speech biometric mapping for key binding cryptosystem , 2011, Defense + Commercial Sensing.

[19]  Alex Pentland,et al.  Space-time gestures , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Pichao Wang,et al.  Skeleton Optical Spectra-Based Action Recognition Using Convolutional Neural Networks , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Dimitrios Makris,et al.  G3D: A gaming action dataset and real time action recognition evaluation framework , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[22]  Hai-Son Le,et al.  Kinect Gesture Recognition: SVM vs. RVM , 2015, 2015 Seventh International Conference on Knowledge and Systems Engineering (KSE).

[23]  Byung-Uk Choi,et al.  Interactive auto-stereoscopic display with efficient and flexible interleaving , 2012 .