Clustering-based two-dimensional linear discriminant analysis for speech recognition

In this paper, a new, Clustering-based Two-Dimensional Linear Discriminant Analysis (Clustering-based 2DLDA) method is proposed for extracting discriminant features in Automatic Speech Recognition (ASR). Based on Two-Dimensional Linear Discriminant Analysis (2DLDA), which works with data represented in matrix space and is adopted to extract discriminant information in a joint spectral-temporal domain, Clustering-based 2DLDA integrates the cluster information in each class by redefining the between-class scatter matrix to tackle the fact that many clusters exist in each state in Hidden Markov Model (HMM)-based ASR. The method was evaluated in the TiDigits connected-digit string recognition and the TIMIT continuous phoneme recognition. Experimental results show that 2DLDA yields a slight improvement on the recognition performance over classical LDA, and our proposed Clustering-based 2DLDA outperforms 2DLDA.

[1]  Hakan Erdogan,et al.  Weighted pairwise scatter to improve linear discriminant analysis , 2000, INTERSPEECH.

[2]  Hynek Hermansky,et al.  A study of two dimensional linear discriminants for ASR , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Xiao-Bing Li,et al.  Modified Linear Discriminant Analysis for Speech Recognition , 2007, 2007 Canadian Conference on Electrical and Computer Engineering.

[4]  Hermann Ney,et al.  Feature combination using linear discriminant analysis and its pitfalls , 2006, INTERSPEECH.

[5]  Hsiao-Wuen Hon,et al.  Speaker-independent phone recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[6]  Andreas G. Andreou,et al.  Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition , 1998, Speech Commun..

[7]  Heinrich Niemann,et al.  Optimal linear feature transformations for semi-continuous hidden Markov models , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Jieping Ye,et al.  Two-Dimensional Linear Discriminant Analysis , 2004, NIPS.

[9]  Fabio Valente,et al.  Discriminant linear processing of time-frequency plane , 2006, INTERSPEECH.

[10]  Andrej Ljolje,et al.  Optimization of class weights for LDA feature transformations , 2006, INTERSPEECH.

[11]  H. Ney,et al.  Linear discriminant analysis for improved large vocabulary continuous speech recognition , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Hynek Hermansky,et al.  Spectral basis functions from discriminant analysis , 1998, ICSLP.