论文信息 - Cross-lingual acoustic modeling for Indian languages based on Subspace Gaussian Mixture Models

Cross-lingual acoustic modeling for Indian languages based on Subspace Gaussian Mixture Models

Cross-lingual acoustic modeling using Subspace Gaussian Mixture Model for low-resource languages of Indian origin is investigated. Building acoustic model for a low-resource language with limited vocabulary by leveraging resources from another language with comparatively larger resources was focused upon. Experiments were done on Bengali and Tamil corpus from MANDI database, with Tamil having greater resources than Bengali. We observed that the word accuracy of cross-lingual acoustic model of Bengali was approximately 2.5% above it's CDHMM model and gave equivalent performance as it's monolingual SGMM model.

Neethu Mariam Joy | Srinivasan Umesh | Basil Abraham | K. Navneeth

[1] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .

[2] Srinivasan Umesh,et al. Subspace based for Indian languages , 2012, 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA).

[3] Kai Feng,et al. Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4] Roland Kuhn,et al. Eigenvoices for speaker adaptation , 1998, ICSLP.

[5] Kai Feng,et al. SUBSPACE GAUSSIAN MIXTURE MODELS FOR SPEECH RECOGNITION , 2009 .

[6] Patrick Kenny,et al. A Study of Interspeaker Variability in Speaker Verification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[7] Mark J. F. Gales. Cluster adaptive training of hidden Markov models , 2000, IEEE Trans. Speech Audio Process..

[8] Kai Feng,et al. The subspace Gaussian mixture model - A structured model for speech recognition , 2011, Comput. Speech Lang..

[9] Liang Lu,et al. Regularized subspace Gaussian mixture models for cross-lingual speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[10] Roland Kuhn,et al. Rapid speaker adaptation in eigenvoice space , 2000, IEEE Trans. Speech Audio Process..

[11] Tanja Schultz,et al. Language-independent and language-adaptive acoustic modeling for speech recognition , 2001, Speech Commun..