Partially Supervised Speaker Clustering
暂无分享,去创建一个
Thomas S. Huang | Mark Hasegawa-Johnson | Hao Tang | Stephen M. Chu | M. Hasegawa-Johnson | Hao Tang | T. Huang
[1] Gunnar Fant,et al. Acoustic Theory Of Speech Production , 1960 .
[2] Xiaofei He,et al. Locality Preserving Projections , 2003, NIPS.
[3] David G. Stork,et al. Pattern Classification , 1973 .
[4] Douglas A. Reynolds,et al. A study of new approaches to speaker diarization , 2009, INTERSPEECH.
[5] B. Scholkopf,et al. Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).
[6] Jean-Luc Gauvain,et al. Multistage speaker diarization of broadcast news , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[7] Thomas S. Huang,et al. Fishervoice and semi-supervised speaker clustering , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[8] P. Somervuo,et al. Bayesian Analysis of Speaker Diarization with Eigenvoice Priors , 2008 .
[9] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .
[10] Jingqi Yuan,et al. Statistical monitoring of fed-batch process using dynamic multiway neighborhood preserving embedding , 2008 .
[11] José Manuel Pardo,et al. Robust Speaker Diarization for meetings , 2006 .
[12] Douglas A. Reynolds,et al. Blind clustering of speech utterances based on speaker and language characteristics , 1998, ICSLP.
[13] Douglas A. Reynolds,et al. Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..
[14] G. Schwarz. Estimating the Dimension of a Model , 1978 .
[15] Inderjit S. Dhillon,et al. Concept Decompositions for Large Sparse Text Data Using Clustering , 2004, Machine Learning.
[16] F. Kubala,et al. Automatic Speaker Clustering , 1997 .
[17] Hsin-Min Wang,et al. Speaker clustering of speech utterances using a voice characteristic reference space , 2004, INTERSPEECH.
[18] Shihong Lao,et al. Discriminant analysis in correlation similarity measure space , 2007, ICML '07.
[19] Shuicheng Yan,et al. Correlation Metric for Generalized Feature Extraction , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[20] Roland Kuhn,et al. Eigenvoices for speaker adaptation , 1998, ICSLP.
[21] Stephen Lin,et al. Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[22] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..
[23] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[24] Patrick Kenny,et al. Eigenvoice modeling with sparse training data , 2005, IEEE Transactions on Speech and Audio Processing.
[25] Herbert Gish,et al. Clustering speakers by their voices , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[26] Anil K. Jain,et al. Algorithms for Clustering Data , 1988 .
[27] Robert H. Gross,et al. Web Page Categorization and Feature Selection Using Association Rule and Principal Component Cluster , 1997 .
[28] Marijn Huijbregts,et al. The ICSI RT07s Speaker Diarization System , 2007, CLEAR.
[29] Shigeo Abe DrEng. Pattern Classification , 2001, Springer London.
[30] G. Ruske,et al. Robust speaker clustering in eigenspace , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..
[31] Douglas E. Sturim,et al. Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.
[32] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.
[33] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..
[34] Wei Xu,et al. Machine Learning for Multimedia Content Analysis , 2007 .
[35] Ponani S. Gopalakrishnan,et al. Clustering via the Bayesian information criterion with applications in speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[36] Douglas A. Reynolds,et al. An overview of automatic speaker diarization systems , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[37] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[38] Shuicheng Yan,et al. Neighborhood preserving embedding , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.
[39] Nicu Sebe,et al. Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.
[40] Paul A. Viola,et al. Alignment by Maximization of Mutual Information , 1997, International Journal of Computer Vision.
[41] Anil K. Jain. Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..
[42] Roland Kuhn,et al. Rapid speaker adaptation in eigenvoice space , 2000, IEEE Trans. Speech Audio Process..
[43] Frédéric Bimbot,et al. Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMs , 2004, INTERSPEECH.
[44] David G. Stork,et al. Pattern Classification (2nd ed.) , 1999 .
[45] P. Mahalanobis. On the generalized distance in statistics , 1936 .
[46] Yi Liu,et al. Recent advances in the IBM GALE Mandarin transcription system , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.