Context and Uncertainty Modeling for Online Speaker Change Detection
暂无分享,去创建一个
[1] Hervé Bourlard,et al. Robust speaker change detection , 2004, IEEE Signal Processing Letters.
[2] Hagai Aronowitz. Trainable speaker diarization , 2007, INTERSPEECH.
[3] Jodi Kearns,et al. LibriVox: Free Public Domain Audiobooks , 2014 .
[4] Marek Hrúz,et al. Convolutional Neural Network for speaker change detection in telephone speaker diarization system , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] James H. Elder,et al. Probabilistic Linear Discriminant Analysis for Inferences About Identity , 2007, 2007 IEEE 11th International Conference on Computer Vision.
[6] Sanjeev Khudanpur,et al. X-Vectors: Robust DNN Embeddings for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] David E. Reynolds,et al. Automatic segmentation , 1986 .
[8] Quan Wang,et al. Speaker Diarization with LSTM , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Alan McCree,et al. Speaker diarization using deep neural network embeddings , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Sanjeev Khudanpur,et al. Deep Neural Network Embeddings for Text-Independent Speaker Verification , 2017, INTERSPEECH.
[11] Andreas Stolcke,et al. Artificial neural network features for speaker diarization , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[12] Jason W. Pelecanos,et al. Online speaker diarization using adapted i-vector transforms , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] M. A. Siegler,et al. Automatic Segmentation, Classification and Clustering of Broadcast News Audio , 1997 .
[14] Claude Barras,et al. Speaker Change Detection in Broadcast TV Using Bidirectional Long Short-Term Memory Networks , 2017, INTERSPEECH.
[15] Michael Picheny,et al. Pre-training of Speaker Embeddings for Low-latency Speaker Change Detection in Broadcast News , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Petr Fousek,et al. Developing On-Line Speaker Diarization System , 2017, INTERSPEECH.
[17] Hervé Bredin,et al. TristouNet: Triplet loss for speaker turn embedding , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Sree Hari Krishnan Parthasarathi,et al. Speaker change detection with privacy-preserving audio cues , 2009, ICMI-MLMI '09.
[19] Hagai Aronowitz,et al. Online two speaker diarization , 2012, Odyssey.
[20] P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .
[21] Joon Son Chung,et al. VoxCeleb: A Large-Scale Speaker Identification Dataset , 2017, INTERSPEECH.
[22] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[23] Andreas Stolcke,et al. Within-class covariance normalization for SVM-based speaker recognition , 2006, INTERSPEECH.
[24] S. Chen,et al. Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .
[25] Themos Stafylakis,et al. PLDA for speaker verification with utterances of arbitrary duration , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[26] Delphine Charlet,et al. Speaker Tracking by Anchor Models Using Speaker Segment Cluster Information , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[27] Shinji Watanabe,et al. Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge , 2018, INTERSPEECH.
[28] Jean-Pierre Martens,et al. Factor analysis for speaker segmentation and improved speaker diarization , 2015, INTERSPEECH.