Speaker Recognition for Multi-speaker Conversations Using X-vectors
暂无分享,去创建一个
Sanjeev Khudanpur | Alan McCree | Daniel Garcia-Romero | Daniel Povey | Gregory Sell | David Snyder | S. Khudanpur | Daniel Povey | D. Garcia-Romero | Gregory Sell | A. McCree | David Snyder
[1] Sanjeev Khudanpur,et al. Deep neural network-based speaker embeddings for end-to-end speaker verification , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[2] Alvin F. Martin,et al. Speaker recognition in a multi-speaker environment , 2001, INTERSPEECH.
[3] Daniel Garcia-Romero,et al. Speaker diarization with plda i-vector scoring and unsupervised calibration , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[4] Niko Brümmer,et al. The speaker partitioning problem , 2010, Odyssey.
[5] James R. Glass,et al. Exploiting Intra-Conversation Variability for Speaker Diarization , 2011, INTERSPEECH.
[6] Quan Wang,et al. Speaker Diarization with LSTM , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Alan McCree,et al. Speaker diarization using deep neural network embeddings , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Sanjeev Khudanpur,et al. Deep Neural Network Embeddings for Text-Independent Speaker Verification , 2017, INTERSPEECH.
[9] Lukás Burget,et al. Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors , 2018, INTERSPEECH.
[10] Yi Liu,et al. Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge , 2016, INTERSPEECH.
[11] Sergey Ioffe,et al. Probabilistic Linear Discriminant Analysis , 2006, ECCV.
[12] Douglas A. Reynolds,et al. Diarization of Telephone Conversations Using Factor Analysis , 2010, IEEE Journal of Selected Topics in Signal Processing.
[13] Joon Son Chung,et al. VoxCeleb: A Large-Scale Speaker Identification Dataset , 2017, INTERSPEECH.
[14] Sanjeev Khudanpur,et al. X-Vectors: Robust DNN Embeddings for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Lukás Burget,et al. Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge , 2016, INTERSPEECH.
[16] Themos Stafylakis,et al. A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[17] Aaron Lawson,et al. The 2016 Speakers in the Wild Speaker Recognition Evaluation , 2016, INTERSPEECH.
[18] Larry P. Heck,et al. Robustness to telephone handset distortion in speaker recognition by discriminative feature design , 2000, Speech Commun..
[19] Ahmad Salman,et al. Learning Speaker-Specific Characteristics With a Deep Neural Architecture , 2011, IEEE Transactions on Neural Networks.
[20] Joon Son Chung,et al. VoxCeleb2: Deep Speaker Recognition , 2018, INTERSPEECH.
[21] Georg Heigold,et al. End-to-end text-dependent speaker verification , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[22] Patrick Kenny,et al. Bayesian Speaker Verification with Heavy-Tailed Priors , 2010, Odyssey.
[23] Mireia Díez,et al. Speaker Diarization based on Bayesian HMM with Eigenvoice Priors , 2018, Odyssey.
[24] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[25] Mitchell McLaren,et al. How to train your speaker embeddings extractor , 2018, Odyssey.
[26] Florin Curelaru,et al. Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).
[27] James R. Glass,et al. On the Use of Spectral and Iterative Methods for Speaker Diarization , 2012, INTERSPEECH.
[28] Shinji Watanabe,et al. Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge , 2018, INTERSPEECH.