A multi-class classification strategy for Fisher scores: Application to signer independent sign language recognition

Fisher kernels combine the powers of discriminative and generative classifiers by mapping the variable-length sequences to a new fixed length feature space, called the Fisher score space. The mapping is based on a single generative model and the classifier is intrinsically binary. We propose a multi-class classification strategy that applies a multi-class classification on each Fisher score space and combines the decisions of multi-class classifiers. We experimentally show that the Fisher scores of one class provide discriminative information for the other classes as well. We compare several multi-class classification strategies for Fisher scores generated from the hidden Markov models of sign sequences. The proposed multi-class classification strategy increases the classification accuracy in comparison with the state of the art strategies based on combining binary classifiers. To reduce the computational complexity of the Fisher score extraction and the training phases, we also propose a score space selection method and show that, similar or even higher accuracies can be obtained by using only a subset of the score spaces. Based on the proposed score space selection method, a signer adaptation technique is also presented that does not require any re-training.

[1]  Lale Akarun,et al.  A belief-based sequential fusion approach for fusing manual and non-manual signs , 2008 .

[2]  Hong Man,et al.  Face recognition based on multi-class mapping of Fisher scores , 2005, Pattern Recognit..

[3]  Lale Akarun,et al.  A belief-based sequential fusion approach for fusing manual signs and non-manual signals , 2009, Pattern Recognit..

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Wu Chou,et al.  Discriminative learning in sequential pattern recognition , 2008, IEEE Signal Processing Magazine.

[6]  Thomas G. Dietterich Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.

[7]  David Haussler,et al.  Using the Fisher Kernel Method to Detect Remote Protein Homologies , 1999, ISMB.

[8]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[9]  Bernhard Schölkopf,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[10]  Agnès Just,et al.  Two-Handed Gesture Recognition , 2005 .

[11]  Surendra Ranganath,et al.  Deciphering gestures with layered meanings and signer adaptation , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[12]  H. Man,et al.  Hybrid IMM/SVM approach for wavelet-domain probabilistic model based texture classification , 2005 .

[14]  Pietro Perona,et al.  Combining generative models and Fisher kernels for object recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[15]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[16]  Pedro J. Moreno,et al.  Using the Fisher kernel method for Web audio classification , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[17]  Mark J. F. Gales,et al.  Speech Recognition using SVMs , 2001, NIPS.

[18]  Thomas Burger,et al.  Cued speech hand shape recognition - belief functions as a formalism to fuse svms and expert systems , 2007, VISAPP.

[19]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[20]  Ryan M. Rifkin,et al.  In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[21]  Oya Aran,et al.  VISION BASED SIGN LANGUAGE RECOGNITION: MODELING AND RECOGNIZING ISOLATED SIGNS WITH MANUAL AND NON-MANUAL COMPONENTS , 2008 .

[22]  L. Akarun,et al.  Facial feature tracking and expression recognition for sign language , 2008, 2008 23rd International Symposium on Computer and Information Sciences.

[23]  Nianjun Liu,et al.  Model structure selection & training algorithms for an HMM gesture recognition system , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[24]  Bülent Sankur,et al.  SignTutor: An Interactive System for Sign Language Tutoring , 2009, IEEE Multimedia.

[25]  Mark J. F. Gales,et al.  Using SVMs to classify variable length speech patterns , 2002 .

[26]  David G. Stork,et al.  Pattern Classification , 1973 .

[27]  Lale Akarun,et al.  A Database of Non-Manual Signs in Turkish Sign Language , 2007, 2007 IEEE 15th Signal Processing and Communications Applications.

[28]  Ying Wu,et al.  Hand modeling, analysis and recognition , 2001, IEEE Signal Process. Mag..

[29]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[30]  Surendra Ranganath,et al.  Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  T. V. Lakshman,et al.  Call admission control in wireless multimedia networks , 2004, IEEE Signal Processing Magazine.

[32]  Lale Akarun,et al.  Multi-class classification strategies for Fisher scores of gesture and sign sequences , 2008, 2008 19th International Conference on Pattern Recognition.

[33]  Y. V. Venkatesh,et al.  Understanding gestures with systematic variations in movement dynamics , 2006, Pattern Recognit..

[34]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[35]  Ismail Ari,et al.  Facial feature tracking and expression recognition for sign language , 2009, 2009 IEEE 17th Signal Processing and Communications Applications Conference.

[36]  Lale Akarun,et al.  Recognizing Two Handed Gestures with Generative, Discriminative and Ensemble Methods Via Fisher Kernels , 2006, MRCS.