Acoustic model transformations based on random projections

This paper proposes a novel acoustic model transformation method for speech recognition based on random projections. Random projections have been suggested as a means of dimensionality reduction, where the original data are projected onto a subspace using a random matrix. Moreover, as we are able to produce various random matrices, it may be possible to find a transform matrix that is superior to conventional transformation matrices among random matrices. In our previous work, a random-projection-based feature combination technique has been proposed but had a high computational cost. In order to deal with this cost, in this paper, we introduce random projections on the acoustic model domain, where linear transformations are applied to an acoustic model using random matrices. Its effectiveness is confirmed by word recognition experiments on noisy speech.

[1]  Sudipto Guha,et al.  Dynamic multidimensional histograms , 2002, SIGMOD '02.

[2]  Sanjoy Dasgupta,et al.  Experiments with Random Projection , 2000, UAI.

[3]  Seiichi Nakagawa,et al.  Evaluation of segmental unit input HMM , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[4]  Satoshi Nakamura,et al.  CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments , 2006, IEICE Trans. Inf. Syst..

[5]  Jonathan G. Fiscus,et al.  REDUCED WORD ERROR RATES , 1997 .

[6]  George Bebis,et al.  Face recognition experiments with random projection , 2005, SPIE Defense + Commercial Sensing.

[7]  Santosh S. Vempala,et al.  An algorithmic theory of learning: Robust concepts and random projection , 1999, Machine Learning.

[8]  David Gelbart,et al.  Improving word accuracy with Gabor feature extraction , 2002, INTERSPEECH.

[9]  Jeff A. Bilmes,et al.  Evaluation of random-projection-based feature combination on speech recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Carla E. Brodley,et al.  Random Projection for High Dimensional Data Clustering: A Cluster Ensemble Approach , 2003, ICML.