Investigation on Neural Bandwidth Extension of Telephone Speech for Improved Speaker Recognition
暂无分享,去创建一个
Najim Dehak | Jesús Villalba | Phani Sankar Nidadavolu | Vicente A. Iglesias | J. Villalba | P. S. Nidadavolu | N. Dehak | Vicente A. Iglesias
[1] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.
[2] Niko Brümmer,et al. The speaker partitioning problem , 2010, Odyssey.
[3] Chin-Hui Lee,et al. A deep neural network approach to speech bandwidth expansion , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Chin-Hui Lee,et al. DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech , 2015, INTERSPEECH.
[5] Enhong Chen,et al. An experimental study on joint modeling of mixed-bandwidth data via deep neural networks for robust speech recognition , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).
[6] Joon Son Chung,et al. VoxCeleb: A Large-Scale Speaker Identification Dataset , 2017, INTERSPEECH.
[7] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[8] Yifan Gong,et al. Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[9] Najim Dehak,et al. Investigation on Bandwidth Extension for Speaker Recognition , 2018, INTERSPEECH.
[10] Niko Brümmer. AGNITIO ’ s Speaker Recognition System for EVALITA 2009 , 2009 .
[11] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Sanjeev Khudanpur,et al. X-Vectors: Robust DNN Embeddings for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).