Low-latency speaker spotting with online diarization and detection
暂无分享,去创建一个
Nicholas W. D. Evans | Sébastien Marcel | Guillaume Wisniewski | Jose Patino | Héctor Delgado | Claude Barras | Hervé Bredin | Ruiqing Yin | Alain Komaty
[1] Georg Heigold,et al. End-to-end text-dependent speaker verification , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Alvin F. Martin,et al. Report on Performance Results in the NIST 2010 Speaker Recognition Evaluation , 2011, INTERSPEECH.
[3] Nicholas W. D. Evans,et al. Semi-supervised On-line Speaker Diarization for Meeting Data with Incremental Maximum A-posteriori Adaptation , 2016, Odyssey.
[4] John H. L. Hansen,et al. Speaker Recognition by Machines and Humans: A tutorial review , 2015, IEEE Signal Processing Magazine.
[5] Shoei Sato,et al. Low-latency speaker diarization based on Bayesian information criterion with multiple phoneme classes , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[7] Daniel Garcia-Romero,et al. Speaker diarization with plda i-vector scoring and unsupervised calibration , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[8] Hervé Bredin,et al. TristouNet: Triplet loss for speaker turn embedding , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Douglas E. Sturim,et al. Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.
[10] Bin Ma,et al. Text-dependent speaker verification: Classifiers, databases and RSR2015 , 2014, Speech Commun..
[11] Haizhou Li,et al. An overview of text-independent speaker recognition: From features to supervectors , 2010, Speech Commun..
[12] Petr Fousek,et al. Developing On-Line Speaker Diarization System , 2017, INTERSPEECH.
[13] Erik McDermott,et al. Deep neural networks for small footprint text-dependent speaker verification , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] Jean Carletta,et al. Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus , 2007, Lang. Resour. Evaluation.
[15] Guillaume Wisniewski,et al. Combining Speaker Turn Embedding and Incremental Structure Prediction for Low-Latency Speaker Diarization , 2017, INTERSPEECH.
[16] Koichi Shinoda,et al. Online speaker clustering using incremental learning of an ergodic hidden Markov model , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[17] Luis Javier Rodríguez-Fuentes,et al. Low-latency online speaker tracking on the AMI Corpus of meeting conversations , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[18] Mohammad Hossein Moattar,et al. Variational conditional random fields for online speaker detection and tracking , 2012, Speech Commun..
[19] Daben Liu,et al. Online speaker clustering , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[20] Hervé Bredin,et al. pyannote.metrics: A Toolkit for Reproducible Evaluation, Diagnostic, and Error Analysis of Speaker Diarization Systems , 2017, INTERSPEECH.
[21] Patrick Kenny,et al. Joint Factor Analysis Versus Eigenchannels in Speaker Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[22] Vincent M. Stanford,et al. The 2021 NIST Speaker Recognition Evaluation , 2022, Odyssey.
[23] Bin Ma,et al. Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home , 2011, INTERSPEECH.
[24] Lie Lu,et al. Unsupervised speaker segmentation and tracking in real-time audio content analysis , 2005, Multimedia Systems.
[25] Jean-Luc Gauvain,et al. Spoken Language Identification Using LSTM-Based Angular Proximity , 2017, INTERSPEECH.
[26] Jason W. Pelecanos,et al. Online speaker diarization using adapted i-vector transforms , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[27] Driss Matrouf,et al. Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification , 2012, INTERSPEECH.
[28] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..
[29] Nicholas W. D. Evans,et al. Speaker Diarization: A Review of Recent Research , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[30] Claude Barras,et al. Speaker Change Detection in Broadcast TV Using Bidirectional Long Short-Term Memory Networks , 2017, INTERSPEECH.
[31] Aaron Lawson,et al. The 2016 Speakers in the Wild Speaker Recognition Evaluation , 2016, INTERSPEECH.
[32] Alvin F. Martin,et al. NIST Speaker Recognition Evaluation Chronicles - Part 2 , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.