Class-dependent two-dimensional linear discriminant analysis using two-pass recognition strategy

In this paper, we introduce a novel class-dependent extension of two-dimensional linear discriminant analysis (2DLDA) named CD-2DLDA, applied in automatic speech recognition using two-pass recognition strategy. In the first pass, the class labels of test sample are obtained using baseline recognition. The labels are then used in CD transformation of test features. In the second pass, recognition of previously transformed test samples is performed using CD-2DLDA acoustic model. The novelty of the paper lies in improvement of the present 2DLDA algorithm by its modification to more precise, class-dependent estimations repeated separately for each class. The proposed approach is evaluated in several scenarios using the TIMIT corpus in phoneme-based continuous speech recognition task. CD-2DLDA features are compared to state-of-the-art MFCCs, conventional LDA and 2DLDA features. The experimental results show that our method performs better than MFCCs and LDA. Furthermore, the results confirm that CD-2DLDA markedly outperforms the 2DLDA method.

[1]  Xiao-Bing Li,et al.  Clustering-based two-dimensional linear discriminant analysis for speech recognition , 2007, INTERSPEECH.

[2]  Jianning Liang Class-dependent LDA for feature extraction and recognition , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[3]  Yu Hu,et al.  Heteroscedastic discriminant analysis with two-dimensional constraints , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Jieping Ye,et al.  Two-Dimensional Linear Discriminant Analysis , 2004, NIPS.

[5]  Jun-Ying Gan,et al.  Face Recognition Based on Two-Dimensional Heteroscedastic Discriminant Analysis , 2009, 2009 First International Conference on Information Science and Engineering.

[6]  Jozef Juhar,et al.  Recent Progress in Development of Language Model for Slovak Large Vocabulary Continuous Speech Recognition , 2012 .

[7]  Heng Huang,et al.  Symmetric two dimensional linear discriminant analysis (2DLDA) , 2009, CVPR.

[8]  A. Akbari,et al.  Optimized linear discriminant analysis for extracting robust speech features , 2008, 2008 3rd International Symposium on Communications, Control and Signal Processing.

[9]  Ahmad Akbari,et al.  Genetic programming based optimization of class-dependent PCA for extracting robust MFCC , 2008, INTERSPEECH.

[10]  Jozef Juhar,et al.  Linear Feature Transformations in Slovak Phoneme-Based Continuous Speech Recognition , 2012 .