Feature Enhancement with Deep Feature Losses for Speaker Verification
暂无分享,去创建一个
Nanxin Chen | Jes'us Villalba | Najim Dehak | Phani Sankar Nidadavolu | Saurabh Kataria | Paola Garc'ia | Najim Dehak | J. Villalba | Saurabh Kataria | Nanxin Chen | P. S. Nidadavolu | Paola Garc'ia | N. Dehak | Paola García
[1] Alejandrina Cristia,et al. HomeBank: An Online Repository of Daylong Child-Centered Audio Recordings , 2016, Seminars in Speech and Language.
[2] Xin Wang,et al. Speaker detection in the wild: Lessons learned from JSALT 2019 , 2019, Odyssey.
[3] Najim Dehak,et al. Cycle-GANs for Domain Adaptation of Acoustic Features for Speaker Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Alan McCree,et al. The JHU-MIT System Description for NIST SRE18 , 2019 .
[5] Colleen Richey,et al. The VOiCES from a Distance Challenge 2019 Evaluation Plan , 2019, ArXiv.
[6] Hao Tang,et al. VoiceID Loss: Speech Enhancement for Speaker Verification , 2019, INTERSPEECH.
[7] Joon Son Chung,et al. VoxCeleb2: Deep Speaker Recognition , 2018, INTERSPEECH.
[8] Vladlen Koltun,et al. Speech Denoising with Deep Feature Losses , 2018, INTERSPEECH.
[9] Sanjeev Khudanpur,et al. Multi-PLDA Diarization on Children's Speech , 2019, INTERSPEECH.
[10] Zheng-Hua Tan,et al. Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification , 2017, INTERSPEECH.
[11] Sanjeev Khudanpur,et al. The JHU Speaker Recognition System for the VOiCES 2019 Challenge , 2019, INTERSPEECH.
[12] Antonio Bonafonte,et al. SEGAN: Speech Enhancement Generative Adversarial Network , 2017, INTERSPEECH.
[13] Quoc V. Le,et al. Swish: a Self-Gated Activation Function , 2017, 1710.05941.
[14] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.
[15] Daniel Povey,et al. MUSAN: A Music, Speech, and Noise Corpus , 2015, ArXiv.
[16] Alan McCree,et al. State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations , 2020, Comput. Speech Lang..
[17] L. Paola García-Perera,et al. Unsupervised Feature Enhancement for Speaker Verification , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Jes'us Villalba,et al. Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-Gans , 2019, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[19] Aaron Lawson,et al. The Speakers in the Wild (SITW) Speaker Recognition Database , 2016, INTERSPEECH.
[20] Nobutaka Ito,et al. The Diverse Environments Multi-channel Acoustic Noise Database (DEMAND): A database of multichannel environmental noise recordings , 2013 .
[21] Richard M. Stern,et al. Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis , 2008, INTERSPEECH.
[22] Antoine Deleforge,et al. Hearing in a shoe-box: Binaural source position and wall absorption estimation using virtually supervised learning , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).