Transfer Learning with Bottleneck Feature Networks for Whispered Speech Recognition
暂无分享,去创建一个
[1] Kazuya Takeda,et al. Acoustic analysis and recognition of whispered speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[2] Dong Yu,et al. Improved Bottleneck Features Using Pretrained Deep Neural Networks , 2011, INTERSPEECH.
[3] Ian McGraw,et al. Personalized speech recognition on mobile devices , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Haizhou Li,et al. MASS: A Malay language LVCSR corpus resource , 2009, 2009 Oriental COCOSDA International Conference on Speech Database and Assessments.
[5] D. T. Grozdic,et al. Application of neural networks in whispered speech recognition , 2012, 2012 20th Telecommunications Forum (TELFOR).
[6] Boon Pang Lim,et al. Computational differences between whispered and non-whispered speech , 2011 .
[7] Florian Metze,et al. Extracting deep bottleneck features using stacked auto-encoders , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[8] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[9] Yifan Gong,et al. Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[10] Bin Ma,et al. A whispered Mandarin corpus for speech technology applications , 2014, INTERSPEECH.
[11] John H. L. Hansen,et al. Generative modeling of pseudo-target domain adaptation samples for whispered speech recognition , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] John H. L. Hansen,et al. UT-Vocal Effort II: Analysis and constrained-lexicon recognition of whispered speech , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Jan Cernocký,et al. Probabilistic and Bottle-Neck Features for LVCSR of Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[14] S. Jovicic,et al. Acoustic analysis of consonants in whispered speech. , 2008, Journal of voice : official journal of the Voice Foundation.
[15] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .
[16] Hynek Hermansky,et al. Temporal patterns (TRAPs) in ASR of noisy speech , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[17] Daniel P. W. Ellis,et al. Tandem connectionist feature extraction for conventional HMM systems , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[18] David Talkin,et al. A Robust Algorithm for Pitch Tracking ( RAPT ) , 2005 .
[19] Kazuya Takeda,et al. Analysis and recognition of whispered speech , 2005, Speech Commun..
[20] John H. L. Hansen,et al. Model and feature based compensation for whispered speech recognition , 2014, INTERSPEECH.
[21] Yu Zhang,et al. Extracting deep neural network bottleneck features using low-rank matrix factorization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[22] Mark A. Clements,et al. Enhancement and recognition of whispered speech , 2003 .
[23] Yu Zhang,et al. Language ID-based training of multilingual stacked bottleneck features , 2014, INTERSPEECH.
[24] Martin Karafiát,et al. Adaptation of multilingual stacked bottle-neck neural network structure for new language , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).