Multi-Task Learning and Weighted Cross-Entropy for DNN-Based Keyword Spotting
暂无分享,去创建一个
Arindam Mandal | Spyridon Matsoukas | Björn Hoffmeister | Sankaran Panchapagesan | Ming Sun | Shiv Vitaladevuni | Aparna Khare | S. Vitaladevuni | Arindam Mandal | Ming Sun | Björn Hoffmeister | S. Matsoukas | S. Panchapagesan | Aparna Khare
[1] L. G. Miller,et al. Improvements and applications for key word recognition using hidden Markov modeling techniques , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[2] Jinyu Li,et al. Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks. , 2013, ICLR 2013.
[3] M. Picheny,et al. Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences , 2017 .
[4] Jun Du,et al. Speech Separation based on signal-noise-dependent deep neural networks for robust speech recognition , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Vincent Vanhoucke,et al. Improving the speed of neural networks on CPUs , 2011 .
[6] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.
[7] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[8] Peter Bell,et al. Cross-lingual adaptation with multi-task adaptive networks , 2014, INTERSPEECH.
[9] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.
[10] Richard M. Schwartz,et al. The 2013 BBN Vietnamese telephone speech keyword spotting system , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Richard Rose,et al. A hidden Markov model based keyword recognition system , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[12] Jürgen Schmidhuber,et al. An Application of Recurrent Neural Networks to Discriminative Keyword Spotting , 2007, ICANN.
[13] Georg Heigold,et al. Multilingual acoustic models using distributed deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[14] Siddika Parlak,et al. Spoken term detection for Turkish Broadcast News , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[15] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.
[16] Nikko Strom,et al. Scalable distributed DNN training using commodity GPU cloud computing , 2015, INTERSPEECH.
[17] Sankaran Panchapagesan,et al. Model Compression Applied to Small-Footprint Keyword Spotting , 2016, INTERSPEECH.
[18] Chin-Hui Lee,et al. Automatic recognition of keywords in unconstrained speech using hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..
[19] Jasha Droppo,et al. Improving speech recognition in reverberation using a room-aware deep neural network and multi-task learning , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[20] Paris Smaragdis,et al. Deep learning for monaural speech separation , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Georg Heigold,et al. Small-footprint keyword spotting using deep neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[22] Jasha Droppo,et al. Multi-task learning in deep neural networks for improved phoneme recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[23] Herbert Gish,et al. Rapid and accurate spoken term detection , 2007, INTERSPEECH.
[24] Peter Bell,et al. Regularization of context-dependent deep neural networks with context-independent multi-task training , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[25] Björn Hoffmeister,et al. Model Shrinking for Embedded Keyword Spotting , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).
[26] Mitch Weintraub. Improved Keyword-Spotting Using Sri's Decipher(Tm) Large-Vocabuarly Speech-Recognition System , 1993, HLT.