论文信息 - Integrating online i-vector extractor with information bottleneck based speaker diarization system

Integrating online i-vector extractor with information bottleneck based speaker diarization system

Reference EPFL-CONF-209082 Related documents: http://publications.idiap.ch/index.php/publications/showcite/Madikeri_Idiap-RR-20-2015 Record created on 2015-06-19, modified on 2017-05-10

Petr Motlícek | Ivan Himawan | Srikanth R. Madikeri | Marc Ferras

[1] Petr Motlícek,et al. Combining SGMM speaker vectors and KL-HMM approach for speaker diarization , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2] Hervé Bourlard,et al. Filterbank slope based features for speaker diarization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3] Marijn Huijbregts,et al. The ICSI RT07s Speaker Diarization System , 2007, CLEAR.

[4] Jitendra Ajmera,et al. A robust speaker clustering algorithm , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[5] Deepu Vijayasenan,et al. An Information Theoretic Approach to Speaker Diarization of Meeting Recordings , 2010 .

[6] Naftali Tishby,et al. The Power of Word Clusters for Text Classification , 2006 .

[7] Fabio Valente,et al. An Information Theoretic Approach to Speaker Diarization of Meeting Data , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[8] James R. Glass,et al. Exploiting Intra-Conversation Variability for Speaker Diarization , 2011, INTERSPEECH.

[9] Themos Stafylakis,et al. I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10] Nicholas W. D. Evans,et al. Speaker Diarization: A Review of Recent Research , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[11] George Saon,et al. Speaker adaptation of neural network acoustic models using i-vectors , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[12] X. Anguera,et al. Speaker diarization for multi-party meetings using acoustic fusion , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[13] Florian Metze,et al. Towards speaker adaptive training of deep neural network acoustic models , 2014, INTERSPEECH.

[14] Fabio Valente,et al. DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings , 2012, INTERSPEECH.

[15] James R. Glass,et al. On the Use of Spectral and Iterative Methods for Speaker Diarization , 2012, INTERSPEECH.

[16] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[17] Lukás Burget,et al. Transcribing Meetings With the AMIDA Systems , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[18] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .

[19] James H. Elder,et al. Probabilistic Linear Discriminant Analysis for Inferences About Identity , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[20] Fabio Valente,et al. Agglomerative information bottleneck for speaker diarization of meetings data , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[21] Xavier Anguera Miró,et al. Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences , 2006, INTERSPEECH.