Unsupervised adaptive sign language recognition based on hypothesis comparison guided cross validation and linguistic prior filtering

Abstract Signer adaptation is important for sign language recognition systems because a fixed system cannot perform well on all kinds of signers. In supervised signer adaptation, the labeled adaptation data must be collected explicitly. To skip the data collecting process in signer adaptation, we propose a novel unsupervised adaptation method, namely the hypothesis comparison guided cross validation method. The method not only addresses the problem of the overlap between the data set to be labeled and the data set for adaptation, but also employs an additional hypothesis comparison step to decrease the noise rate of the adaptation data set. We also utilize linguistic prior knowledge to down sample the adaptation data list to further decrease the noise rate. To evaluate the effectiveness of the proposed method, the CASIIE-SL-Database is formed, which is the first specialized data set for unsupervised signer adaptation to the best of our knowledge. Experimental results show that the proposed method can achieve relative word error rate reductions of 3.93% and 4.05% respectively compared with self-teaching method and cross validation method. Though the method is proposed for signer adaptation, it can also be applied to speaker adaptation and writer adaptation directly.

[1]  Lale Akarun,et al.  A multi-class classification strategy for Fisher scores: Application to signer independent sign language recognition , 2010, Pattern Recognit..

[2]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[3]  Y. V. Venkatesh,et al.  Understanding gestures with systematic variations in movement dynamics , 2006, Pattern Recognit..

[4]  Karl-Friedrich Kraiss,et al.  Recent developments in visual sign language recognition , 2008, Universal Access in the Information Society.

[5]  Wen Gao,et al.  Signer-independent sign language recognition based on SOFM/HMM , 2001, Proceedings IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems.

[6]  Douglas D. O'Shaughnessy,et al.  Invited paper: Automatic speech recognition: History, methods and challenges , 2008, Pattern Recognit..

[7]  Karl-Friedrich Kraiss,et al.  Rapid signer adaptation for continuous sign language recognition using a combined approach of eigenvoices, MLLR, and MAP , 2008, 2008 19th International Conference on Pattern Recognition.

[8]  D. Angluin,et al.  Learning From Noisy Examples , 1988, Machine Learning.

[9]  Sadaoki Furui,et al.  Unsupervised Acoustic Model Adaptation Based on Ensemble Methods , 2010, IEEE Journal of Selected Topics in Signal Processing.

[10]  Kai-Fu Lee,et al.  Automatic Speech Recognition , 1989 .

[11]  Karl-Friedrich Kraiss,et al.  Robust Person-Independent Visual Sign Language Recognition , 2005, IbPRIA.

[12]  Zicheng Liu,et al.  Cross-dataset action detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Francisco B. Rodríguez,et al.  Extending the bioinspired hierarchical temporal memory paradigm for sign language recognition , 2012, Neurocomputing.

[14]  Dimitris N. Metaxas,et al.  A Framework for Recognizing the Simultaneous Aspects of American Sign Language , 2001, Comput. Vis. Image Underst..

[15]  Chin-Hui Lee,et al.  Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[16]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[17]  Long Xu,et al.  Hypothesis comparison guided cross validation for unsupervised signer adaptation , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[18]  Khaled Assaleh,et al.  Video-based signer-independent Arabic sign language recognition using hidden Markov models , 2009, Appl. Soft Comput..

[19]  T.V. Sreenivas,et al.  A comparative study of speaker adaptation methods , 2008, TENCON 2008 - 2008 IEEE Region 10 Conference.

[20]  C W OngSylvie,et al.  Automatic Sign Language Analysis , 2005 .

[21]  Ruiduo Yang,et al.  Handling Movement Epenthesis and Hand Segmentation Ambiguities in Continuous Sign Language Recognition Using Nested Dynamic Programming , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[23]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[24]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Daniel Schneider,et al.  Rapid Signer Adaptation for Isolated Sign Language Recognition , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[26]  Sadaoki Furui,et al.  Unsupervisec cross-validation adaptation algorithms for improved adaptation performance , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27]  Daniel P. Lopresti,et al.  Handwriting recognition research: Twenty years of achievement... and beyond , 2009, Pattern Recognit..

[28]  Roland Kuhn,et al.  Rapid speaker adaptation in eigenvoice space , 2000, IEEE Trans. Speech Audio Process..

[29]  Kouichi Murakami,et al.  Gesture recognition using recurrent neural networks , 1991, CHI.

[30]  David G. Stork,et al.  Pattern Classification , 1973 .

[31]  Steve Young,et al.  The HTK book version 3.4 , 2006 .

[32]  Wen Gao,et al.  Generating Data for Signer Adaptation , 2007, Gesture Workshop.

[33]  Surendra Ranganath,et al.  Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Surendra Ranganath,et al.  Deciphering gestures with layered meanings and signer adaptation , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[35]  Marcel J. T. Reinders,et al.  Sign Language Recognition by Combining Statistical DTW and Independent Classification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Wen Gao,et al.  Signer-Independent Continuous Sign Language Recognition Based on SRN/HMM , 2001, Gesture Workshop.