Integer linear programming for speaker diarization and cross-modal identification in TV broadcast
暂无分享,去创建一个
[1] Douglas A. Reynolds,et al. A study of new approaches to speaker diarization , 2009, INTERSPEECH.
[2] M E J Newman,et al. Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.
[3] Anil K. Jain,et al. Data clustering: a review , 1999, CSUR.
[4] Nicholas W. D. Evans,et al. Speaker Diarization: A Review of Recent Research , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[5] Sylvain Meignier,et al. Automatic named identification of speakers using diarization and ASR systems , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[6] Jordi Luque,et al. On the use of agglomerative and spectral clustering in speaker diarization of meetings , 2012, Odyssey.
[7] Douglas A. Reynolds,et al. An overview of automatic speaker diarization systems , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[8] Gwenn Englebienne,et al. Multimodal Speaker Diarization , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[9] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[10] A. Land,et al. An Automatic Method for Solving Discrete Programming Problems , 1960, 50 Years of Integer Programming.
[11] Jean-Luc Gauvain,et al. Multistage speaker diarization of broadcast news , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[12] Olivier Galibert,et al. The REPERE Corpus : a multimodal corpus for person recognition , 2012, LREC.
[13] Christopher D. Manning,et al. Enforcing Transitivity in Coreference Resolution , 2008, ACL.
[14] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.
[15] Rainer Stiefelhagen,et al. “Knock! Knock! Who is it?” probabilistic person identification in TV-series , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[16] Francisco Javier,et al. On the use of agglomerative and spectral clustering in speaker diarization of meetings , 2012 .
[17] Georges Quénot,et al. From Text Detection in Videos to Person Identification , 2012, 2012 IEEE International Conference on Multimedia and Expo.
[18] Mickael Rouvier,et al. I-vectors and ILP clustering adapted to cross-show speaker diarization , 2012, INTERSPEECH.
[19] Tomi Kinnunen,et al. INTERSPEECH 2013 14thAnnual Conference of the International Speech Communication Association , 2013, Interspeech 2015.
[20] James R. Glass,et al. On the Use of Spectral and Iterative Methods for Speaker Diarization , 2012, INTERSPEECH.
[21] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[22] Dong Wang,et al. A Comparative Study of Bottom-Up and Top-Down Approaches to Speaker Diarization , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[23] Georges Quénot,et al. Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast , 2012, INTERSPEECH.
[24] Thomas S. Huang,et al. A spectral clustering approach to speaker diarization , 2006, INTERSPEECH.
[25] R. Smith,et al. An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).
[26] S. Chen,et al. Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .