Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes
暂无分享,去创建一个
Thomas Fang Zheng | Dong Wang | Lantian Li | Chenhao Zhang | Dong Wang | T. Zheng | Lantian Li | Chenhao Zhang
[1] Haizhou Li,et al. An overview of text-independent speaker recognition: From features to supervectors , 2010, Speech Commun..
[2] Wang Dong. Multi-Layer Channel Normalization for Frequency-Dynamic Feature Extraction , 2003 .
[3] S. J. Young,et al. Tree-based state tying for high accuracy acoustic modelling , 1994 .
[4] Tanja Schultz,et al. Language-independent and language-adaptive acoustic modeling for speech recognition , 2001, Speech Commun..
[5] Sridha Sridharan,et al. Making Confident Speaker Verification Decisions With Minimal Speech , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[6] Yun Lei,et al. A novel scheme for speaker recognition using a phonetically-aware deep neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Jeff A. Bilmes,et al. A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .
[8] Patrick Kenny,et al. Joint Factor Analysis Versus Eigenchannels in Speaker Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[9] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[10] Sridha Sridharan,et al. i-vector Based Speaker Recognition on Short Utterances , 2011, INTERSPEECH.
[11] A. Colomé,et al. Lexical Activation in Bilinguals' Speech Production: Language-Specific or Language-Independent? , 2001 .
[12] Thomas Fang Zheng,et al. A K-phoneme-class based multi-model method for short utterance speaker recognition , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.
[13] Bin Ma,et al. The RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases , 2012, Interspeech 2012.
[14] Douglas A. Reynolds,et al. Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..
[15] Patrick Kenny,et al. An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech , 2010, Odyssey.
[16] Toby Berger,et al. Efficient text-independent speaker verification with structural Gaussian mixture models and neural network , 2003, IEEE Trans. Speech Audio Process..
[17] Florin Curelaru,et al. Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).
[18] Douglas A. Reynolds,et al. A Tutorial on Text-Independent Speaker Verification , 2004, EURASIP J. Adv. Signal Process..
[19] XIONG Zhenyu,et al. An Automatic Prompting Texts Selecting Algorithm for di-IFs Balanced Speech Corpus , 2003 .
[20] S. Furui,et al. Cepstral analysis technique for automatic speaker verification , 1981 .
[21] Xiaojun Wu,et al. A Universal Phoneme-Set Based Language Independent Short Utterance Speaker Recognition , 2011 .
[22] Mark J. F. Gales,et al. Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..
[23] Man-Wai Mak,et al. A Comparison of Various Adaptation Methods for Speaker Verification With Limited Enrollment Data , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[24] Thomas Fang Zheng,et al. Improved context-dependent acoustic modeling for continuous Chinese speech recognition , 2001, INTERSPEECH.
[25] Thomas Fang Zheng,et al. Deep Speaker Vectors for Semi Text-independent Speaker Verification , 2015, ArXiv.
[26] A. Hall. Methods for demonstrating Resemblance in Taxonomy and Ecology , 1967, Nature.
[27] Biing-Hwang Juang,et al. The use of cohort normalized scores for speaker verification , 1992, ICSLP.
[28] Erik McDermott,et al. Deep neural networks for small footprint text-dependent speaker verification , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[29] E. Vajda. Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet , 2000 .
[30] Sridha Sridharan,et al. Factor analysis modelling for speaker verification with short utterances , 2008, Odyssey.
[31] Alvin F. Martin,et al. The DET curve in assessment of detection task performance , 1997, EUROSPEECH.
[32] Thomas Fang Zheng,et al. A Multi-Model Method for Short-Utterance Speaker Recognition , 2011 .
[33] Figen Ertaş,et al. FUNDAMENTALS OF SPEAKER RECOGNITION , 2011 .
[34] Paul Taylor,et al. Automatically clustering similar units for unit selection in speech synthesis , 1997, EUROSPEECH.
[35] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..
[36] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..
[37] Eliathamby Ambikairajah,et al. A segment selection technique for speaker verification , 2010, Speech Commun..
[38] Andreas Stolcke,et al. Within-class covariance normalization for SVM-based speaker recognition , 2006, INTERSPEECH.
[39] William M. Campbell,et al. Support vector machines for speaker and language recognition , 2006, Comput. Speech Lang..
[40] James H. Elder,et al. Probabilistic Linear Discriminant Analysis for Inferences About Identity , 2007, 2007 IEEE 11th International Conference on Computer Vision.
[41] Themos Stafylakis,et al. Deep Neural Networks for extracting Baum-Welch statistics for Speaker Recognition , 2014, Odyssey.
[42] Andreas Stolcke,et al. Speaker Recognition With Session Variability Normalization Based on MLLR Adaptation Transforms , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[43] Vincent M. Stanford,et al. The 2021 NIST Speaker Recognition Evaluation , 2022, Odyssey.
[44] Mohamed Kamal Omar,et al. Training Universal Background Models for Speaker Recognition , 2010, Odyssey.
[45] Simon Dobrisek,et al. Acoustical modelling of phone transitions: biphones and diphones - what are the differences? , 1999, EUROSPEECH.
[46] Jr. J.P. Campbell,et al. Speaker recognition: a tutorial , 1997, Proc. IEEE.
[47] Dong Yu,et al. Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..
[48] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..
[49] Charles Elkan,et al. Expectation Maximization Algorithm , 2010, Encyclopedia of Machine Learning.