Multiple linear transforms

Heteroscedastic discriminant analysis (HDA) has been proposed as a replacement for linear discriminant analysis (LDA) in speech recognition systems that use mixtures of diagonal covariance Gaussians to model the data. Typically HDA and LDA involve a dimension reduction of the feature space. A specific version HDA that involves no dimension reduction; and is popularly known as maximum likelihood linear transform (MLLT) is often used on the feature space to give significant improvements in performance. MLLT approximately diagonalizes the class covariances, and in effect, tries to approximate the performance of a full-covariance-system. However, the performance of a full-covariance system could in some cases be much better than using MLLT-based diagonal covariance system. We propose the method of multiple linear transforms, that bridges this gap in performance, while maintaining the speed efficiency of a diagonal covariance system. This technique improves the performance of a diagonal covariance system, over what could be obtained from HDA or MLLT.

[1]  N. Campbell CANONICAL VARIATE ANALYSIS—A GENERAL MODEL FORMULATION , 1984 .

[2]  David G. Stork,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[3]  G AndreouAndreas,et al.  Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition , 1998 .

[4]  Mark J. F. Gales,et al.  Factored Semi-Tied Covariance Matrices , 2000, NIPS.

[5]  Ramesh A. Gopinath,et al.  Maximum likelihood modeling with Gaussian distributions for classification , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  Dirk Van Compernolle,et al.  Optimal feature sub-space selection based on discriminant analysis , 1999, EUROSPEECH.

[7]  George Saon,et al.  Maximum likelihood discriminant feature spaces , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[8]  Andreas G. Andreou,et al.  Investigation of silicon auditory models and generalization of linear discriminant analysis for improved speech recognition , 1997 .

[9]  Andreas G. Andreou,et al.  Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition , 1998, Speech Commun..

[10]  Mark J. F. Gales,et al.  Semi-tied covariance matrices for hidden Markov models , 1999, IEEE Trans. Speech Audio Process..

[11]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.