Dissimilarity measures for hidden Markov models and their application in multilingual speech recognition
暂无分享,去创建一个
[1] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .
[2] Dirk Van Compernolle. Recognizing speech of goats, wolves, sheep and ... non-natives , 2001, Speech Commun..
[3] Mark J. F. Gales,et al. Generalised linear Gaussian models , 2001 .
[4] Steve Young,et al. Token passing: a simple conceptual model for connected speech recognition systems , 1989 .
[5] E. L. Harder,et al. The Institute of Electrical and Electronics Engineers, Inc. , 2019, 2019 IEEE International Conference on Software Architecture Companion (ICSA-C).
[6] Tanja Schultz,et al. Experiments on cross-language acoustic modeling , 2001, INTERSPEECH.
[7] R. Näätänen,et al. Psychobiology: Speech sounds learned by sleeping newborns , 2002, Nature.
[8] William P. Ziemer,et al. Modern Real Analysis , 1994 .
[9] William J. Byrne,et al. Towards language independent acoustic modeling , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[10] Imre Kiss,et al. Speaker- and language-independent speech recognition in mobile communication systems , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[11] Joachim Köhler,et al. Erstellung einer statistisch modellierten multilingualen Lautbibliothek für die Spracherkennung , 2000 .
[12] A Lewis,et al. THE SCIENCE OF SOUND , 1997 .
[13] Mark J. F. Gales,et al. The generation and use of regression class trees for MLLR adaptation , 1996 .
[14] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..
[15] Andrej Zgank,et al. Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering , 2001, INTERSPEECH.
[16] Esko Turunen. Survey of Theory and Applications of Łukasiewicz-Pavelka Fuzzy Logic , 2001 .
[17] Olli Viikki,et al. Cepstral domain segmental feature vector normalization for noise robust speech recognition , 1998, Speech Commun..
[18] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[19] Jean-Claude Junqua. Robust Speech Recognition in Embedded Systems and PC Applications , 2000 .
[20] David A. van Leeuwen,et al. Multilingual large vocabulary speech recognition: the European SQALE project , 1997, Comput. Speech Lang..
[21] V. Rich. Personal communication , 1989, Nature.
[22] Steve J. Young,et al. The HTK tied-state continuous speech recogniser , 1993, EUROSPEECH.
[23] Martine Adda-Decker. Towards multilingual interoperability in automatic speech recognition , 2001, Speech Commun..
[24] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.
[25] Matti Vihola,et al. Two dissimilarity measures for HMMS and their application in phoneme model clustering , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[26] Bojan Imperl,et al. The clustering algorithm for the definition of multilingual set of context dependent speech models , 1999, EUROSPEECH.
[27] Markus Falkhausen,et al. Calculation of distance measures between hidden Markov models , 1995, EUROSPEECH.
[28] Daniel P. W. Ellis,et al. Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition , 1999 .
[29] K. Laurila. Robust speech recognition methods for voice dialing , 2000 .
[30] Frederick Jelinek,et al. Statistical methods for speech recognition , 1997 .
[31] Pascale Fung,et al. MAP-based cross-language adaptation augmented by linguistic knowledge: from English to Chinese , 1999, EUROSPEECH.
[32] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..
[33] Lin-Shan Lee,et al. Voice dictation of Mandarin Chinese , 1997, IEEE Signal Process. Mag..
[34] Solomon Kullback,et al. Information Theory and Statistics , 1970, The Mathematical Gazette.
[35] Tanja Schultz,et al. Polyphone decision tree specialization for language adaptation , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[36] Philip C. Woodland,et al. Speaker adaptation of HMMs using linear regression , 1994 .
[37] D. Horga. HANDBOOK OF THE INTERNATIONAL PHONETIC ASSOCIATION. A GUIDE TO THE USE OF THE INTERNATIONAL PHONETIC ALPHABET Cambridge: Cambridge University Press (1999), (204 stranice) , 1999 .
[38] Reinhold Häb-Umbach. Automatic generation of phonetic regression class trees for MLLR adaptation , 2001, IEEE Trans. Speech Audio Process..
[39] Chao Huang,et al. Large vocabulary Mandarin speech recognition with different approaches in modeling tones , 2000, INTERSPEECH.
[40] Paul Dalsgaard,et al. On the use of data-driven clustering technique for identification of poly- and mono-phonemes for four European languages , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.
[41] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[42] B. Hofmann-Wellenhof,et al. Introduction to spectral analysis , 1986 .
[43] A. Waibel,et al. Multilinguality in speech and spoken language systems , 2000, Proceedings of the IEEE.
[44] Mark J. F. Gales,et al. Tail distribution modelling using the richter and power exponential distributions , 1999, EUROSPEECH.
[45] Joachim Köhler. Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks , 1998, ICASSP.
[46] P. Woodland,et al. Flexible speaker adaptation using maximum likelihood linear regression , 1995 .
[47] Mark J. F. Gales,et al. Semi-tied covariance matrices for hidden Markov models , 1999, IEEE Trans. Speech Audio Process..
[48] Sadaoki Furui,et al. Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..
[49] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[50] L. R. Rabiner,et al. A probabilistic distance measure for hidden Markov models , 1985, AT&T Technical Journal.
[51] Ronald A. Cole,et al. The OGI multi-language telephone speech corpus , 1992, ICSLP.
[52] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..
[53] John H. L. Hansen,et al. Discrete-Time Processing of Speech Signals , 1993 .
[54] Ulla Uebler,et al. Multilingual speech recognition in seven languages , 2001, Speech Commun..
[55] Sheldon M. Ross,et al. Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.
[56] E. Vajda. Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet , 2000 .
[57] Imre Kiss. On speech recognition in mobile communications , 2001 .
[58] Y.K. Muthusamy,et al. Reviewing automatic language identification , 1994, IEEE Signal Processing Magazine.
[59] Alex Waibel,et al. The GlobalPhone Project: Multilingual LVCSR with JANUS-3 , 1997 .
[60] Sadaoki Furui. Flexible speech recognition , 1995, EUROSPEECH.
[61] Joachim Köhler. Multilingual phone models for vocabulary-independent speech recognition tasks , 2001, Speech Commun..
[62] B. Ripley,et al. Pattern Recognition , 1968, Nature.
[63] L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .