论文信息 - Dissimilarity measures for hidden Markov models and their application in multilingual speech recognition - 字舞流文

Dissimilarity measures for hidden Markov models and their application in multilingual speech recognition

vi

Matti Vihola | M. Vihola

[1] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[2] Dirk Van Compernolle. Recognizing speech of goats, wolves, sheep and ... non-natives , 2001, Speech Commun..

[3] Mark J. F. Gales,et al. Generalised linear Gaussian models , 2001 .

[4] Steve Young,et al. Token passing: a simple conceptual model for connected speech recognition systems , 1989 .

[5] E. L. Harder,et al. The Institute of Electrical and Electronics Engineers, Inc. , 2019, 2019 IEEE International Conference on Software Architecture Companion (ICSA-C).

[6] Tanja Schultz,et al. Experiments on cross-language acoustic modeling , 2001, INTERSPEECH.

[7] R. Näätänen,et al. Psychobiology: Speech sounds learned by sleeping newborns , 2002, Nature.

[8] William P. Ziemer,et al. Modern Real Analysis , 1994 .

[9] William J. Byrne,et al. Towards language independent acoustic modeling , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[10] Imre Kiss,et al. Speaker- and language-independent speech recognition in mobile communication systems , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[11] Joachim Köhler,et al. Erstellung einer statistisch modellierten multilingualen Lautbibliothek für die Spracherkennung , 2000 .

[12] A Lewis,et al. THE SCIENCE OF SOUND , 1997 .

[13] Mark J. F. Gales,et al. The generation and use of regression class trees for MLLR adaptation , 1996 .

[14] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[15] Andrej Zgank,et al. Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering , 2001, INTERSPEECH.

[16] Esko Turunen. Survey of Theory and Applications of Łukasiewicz-Pavelka Fuzzy Logic , 2001 .

[17] Olli Viikki,et al. Cepstral domain segmental feature vector normalization for noise robust speech recognition , 1998, Speech Commun..

[18] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[19] Jean-Claude Junqua. Robust Speech Recognition in Embedded Systems and PC Applications , 2000 .

[20] David A. van Leeuwen,et al. Multilingual large vocabulary speech recognition: the European SQALE project , 1997, Comput. Speech Lang..

[21] V. Rich. Personal communication , 1989, Nature.

[22] Steve J. Young,et al. The HTK tied-state continuous speech recogniser , 1993, EUROSPEECH.

[23] Martine Adda-Decker. Towards multilingual interoperability in automatic speech recognition , 2001, Speech Commun..

[24] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[25] Matti Vihola,et al. Two dissimilarity measures for HMMS and their application in phoneme model clustering , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[26] Bojan Imperl,et al. The clustering algorithm for the definition of multilingual set of context dependent speech models , 1999, EUROSPEECH.

[27] Markus Falkhausen,et al. Calculation of distance measures between hidden Markov models , 1995, EUROSPEECH.

[28] Daniel P. W. Ellis,et al. Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition , 1999 .

[29] K. Laurila. Robust speech recognition methods for voice dialing , 2000 .

[30] Frederick Jelinek,et al. Statistical methods for speech recognition , 1997 .

[31] Pascale Fung,et al. MAP-based cross-language adaptation augmented by linguistic knowledge: from English to Chinese , 1999, EUROSPEECH.

[32] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[33] Lin-Shan Lee,et al. Voice dictation of Mandarin Chinese , 1997, IEEE Signal Process. Mag..

[34] Solomon Kullback,et al. Information Theory and Statistics , 1970, The Mathematical Gazette.

[35] Tanja Schultz,et al. Polyphone decision tree specialization for language adaptation , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[36] Philip C. Woodland,et al. Speaker adaptation of HMMs using linear regression , 1994 .

[37] D. Horga. HANDBOOK OF THE INTERNATIONAL PHONETIC ASSOCIATION. A GUIDE TO THE USE OF THE INTERNATIONAL PHONETIC ALPHABET Cambridge: Cambridge University Press (1999), (204 stranice) , 1999 .

[38] Reinhold Häb-Umbach. Automatic generation of phonetic regression class trees for MLLR adaptation , 2001, IEEE Trans. Speech Audio Process..

[39] Chao Huang,et al. Large vocabulary Mandarin speech recognition with different approaches in modeling tones , 2000, INTERSPEECH.

[40] Paul Dalsgaard,et al. On the use of data-driven clustering technique for identification of poly- and mono-phonemes for four European languages , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[41] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[42] B. Hofmann-Wellenhof,et al. Introduction to spectral analysis , 1986 .

[43] A. Waibel,et al. Multilinguality in speech and spoken language systems , 2000, Proceedings of the IEEE.

[44] Mark J. F. Gales,et al. Tail distribution modelling using the richter and power exponential distributions , 1999, EUROSPEECH.

[45] Joachim Köhler. Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks , 1998, ICASSP.

[46] P. Woodland,et al. Flexible speaker adaptation using maximum likelihood linear regression , 1995 .

[47] Mark J. F. Gales,et al. Semi-tied covariance matrices for hidden Markov models , 1999, IEEE Trans. Speech Audio Process..

[48] Sadaoki Furui,et al. Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..

[49] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[50] L. R. Rabiner,et al. A probabilistic distance measure for hidden Markov models , 1985, AT&T Technical Journal.

[51] Ronald A. Cole,et al. The OGI multi-language telephone speech corpus , 1992, ICSLP.

[52] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[53] John H. L. Hansen,et al. Discrete-Time Processing of Speech Signals , 1993 .

[54] Ulla Uebler,et al. Multilingual speech recognition in seven languages , 2001, Speech Commun..

[55] Sheldon M. Ross,et al. Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.

[56] E. Vajda. Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet , 2000 .

[57] Imre Kiss. On speech recognition in mobile communications , 2001 .

[58] Y.K. Muthusamy,et al. Reviewing automatic language identification , 1994, IEEE Signal Processing Magazine.

[59] Alex Waibel,et al. The GlobalPhone Project: Multilingual LVCSR with JANUS-3 , 1997 .

[60] Sadaoki Furui. Flexible speech recognition , 1995, EUROSPEECH.

[61] Joachim Köhler. Multilingual phone models for vocabulary-independent speech recognition tasks , 2001, Speech Commun..

[62] B. Ripley,et al. Pattern Recognition , 1968, Nature.

[63] L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .