IITG- Indigo Submissions for NIST 2018 Speaker Recognition Evaluation and Post-Challenge Improvements
暂无分享,去创建一个
[1] Daniel Garcia-Romero,et al. Speaker diarization with plda i-vector scoring and unsupervised calibration , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[2] Sanjeev Khudanpur,et al. X-Vectors: Robust DNN Embeddings for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[3] Harsha Vardhan,et al. The Leap Speaker Recognition System for NIST SRE 2018 Challenge , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] S. R. Mahadeva Prasanna,et al. IITG-Indigo System for NIST 2016 SRE Challenge , 2017, INTERSPEECH.
[5] Niko Brümmer,et al. The BOSARIS Toolkit: Theory, Algorithms and Code for Surviving the New DCF , 2013, ArXiv.
[6] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[7] Pietro Laface,et al. Large-Scale Training of Pairwise Support Vector Machines for Speaker Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[8] Shrikanth S. Narayanan,et al. Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification , 2014, Comput. Speech Lang..
[9] Andreas Fischer,et al. Pairwise support vector machines and their application to large scale problems , 2012, J. Mach. Learn. Res..
[10] Aaron Lawson,et al. The Speakers in the Wild (SITW) Speaker Recognition Database , 2016, INTERSPEECH.
[11] Tomi Kinnunen,et al. A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[12] Steve Young,et al. The HTK hidden Markov model toolkit: design and philosophy , 1993 .
[13] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[14] Daniel Povey,et al. MUSAN: A Music, Speech, and Noise Corpus , 2015, ArXiv.
[15] Daniel Garcia-Romero,et al. Analysis of i-vector Length Normalization in Speaker Recognition Systems , 2011, INTERSPEECH.
[16] Sanjeev Khudanpur,et al. Deep neural network-based speaker embeddings for end-to-end speaker verification , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[17] Bryan Pardo,et al. REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[18] Douglas A. Reynolds,et al. The 2018 NIST Speaker Recognition Evaluation , 2019, INTERSPEECH.