Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model
暂无分享,去创建一个
Lukás Burget | Themos Stafylakis | Niko Brümmer | Anna Silnova | L. Burget | Niko Brümmer | Themos Stafylakis | Anna Silnova | N. Brümmer
[1] The NIST Year 2010 Speaker Recognition Evaluation Plan 1 I NTRODUCTION , 2022 .
[2] L. Burget,et al. Promoting robustness for speaker modeling in the community: the PRISM evaluation set , 2011 .
[3] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Lukás Burget,et al. Language Recognition in iVectors Space , 2011, INTERSPEECH.
[5] Joon Son Chung,et al. VoxCeleb: A Large-Scale Speaker Identification Dataset , 2017, INTERSPEECH.
[6] Themos Stafylakis,et al. Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition , 2016, Odyssey.
[7] Alan McCree,et al. Subspace-constrained supervector PLDA for speaker verification , 2013, INTERSPEECH.
[8] Pietro Laface,et al. On the use of i–vector posterior distributions in Probabilistic Linear Discriminant Analysis , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[9] Sanjeev Khudanpur,et al. Spoken Language Recognition using X-vectors , 2018, Odyssey.
[10] Michel Loève,et al. Probability Theory I , 1977 .
[11] Lukás Burget,et al. Discriminatively trained Probabilistic Linear Discriminant Analysis for speaker verification , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Mitchell McLaren,et al. How to train your speaker embeddings extractor , 2018, Odyssey.
[13] Niko Brümmer,et al. A comparison of linear and non-linear calibrations for speaker recognition , 2014, Odyssey.
[14] R. Baierlein. Probability Theory: The Logic of Science , 2004 .
[15] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[16] Sanjeev Khudanpur,et al. Deep neural network-based speaker embeddings for end-to-end speaker verification , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[17] David A. van Leeuwen,et al. Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006 , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[18] Hervé Bredin,et al. TristouNet: Triplet loss for speaker turn embedding , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Pietro Laface,et al. I-vector transformation and scaling for PLDA based speaker recognition , 2016, Odyssey.
[20] Sanjeev Khudanpur,et al. X-Vectors: Robust DNN Embeddings for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Sanjeev Khudanpur,et al. Deep Neural Network Embeddings for Text-Independent Speaker Verification , 2017, INTERSPEECH.
[22] Themos Stafylakis,et al. Text-dependent speaker recognition using PLDA with uncertainty propagation , 2013, INTERSPEECH.
[23] Themos Stafylakis,et al. PLDA for speaker verification with utterances of arbitrary duration , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[24] Patrick Kenny,et al. Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification , 2009, INTERSPEECH.
[25] Georg Heigold,et al. End-to-end text-dependent speaker verification , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Patrick Kenny,et al. Bayesian Speaker Verification with Heavy-Tailed Priors , 2010, Odyssey.
[27] Niko Brümmer,et al. Tied Variational Autoencoder Backends for i-Vector Speaker Recognition , 2017, INTERSPEECH.
[28] Florin Curelaru,et al. Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).
[29] Jean-Luc Gauvain,et al. Spoken Language Identification Using LSTM-Based Angular Proximity , 2017, INTERSPEECH.
[30] Pietro Laface,et al. Pairwise Discriminative Speaker Verification in the ${\rm I}$-Vector Space , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[31] Xiao Liu,et al. Deep Speaker: an End-to-End Neural Speaker Embedding System , 2017, ArXiv.
[32] Niko Brümmer,et al. The speaker partitioning problem , 2010, Odyssey.
[33] Chunlei Zhang,et al. End-to-End Text-Independent Speaker Verification with Triplet Loss on Short Utterances , 2017, INTERSPEECH.
[34] Daniel Garcia-Romero,et al. Analysis of i-vector Length Normalization in Speaker Recognition Systems , 2011, INTERSPEECH.
[35] Patrick Kenny,et al. Joint Factor Analysis of Speaker and Session Variability: Theory and Algorithms , 2006 .
[36] H. Teicher,et al. Probability theory: Independence, interchangeability, martingales , 1978 .
[37] Niko Brümmer,et al. End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions , 2018, Odyssey.