A statistical model for robust integration of narrowband cues in speech
暂无分享,去创建一个
[1] Kenneth Steiglitz,et al. Neural networks for voiced/unvoiced speech classification , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[2] G. A. Miller,et al. An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .
[3] Harvey b. Fletcher,et al. Speech and hearing in communication , 1953 .
[4] M. D. Wang,et al. Consonant confusions in noise: a study of perceptual features. , 1973, The Journal of the Acoustical Society of America.
[5] Misha Pavel,et al. Towards ASR on partially corrupted speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[6] Hervé Bourlard,et al. A mew ASR approach based on independent processing and recombination of partial frequency bands , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[7] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[8] Richard Lippmann,et al. Speech recognition by machines and humans , 1997, Speech Commun..
[9] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[10] Hynek Hermansky,et al. Sub-band based recognition of noisy speech , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[11] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..
[12] Partha Niyogi,et al. Incorporating voice onset time to improve letter recognition accuracies , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[13] Sara H. Basson,et al. NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[14] Joseph W. Hall,et al. Detection in noise by spectro-temporal pattern analysis. , 1984, The Journal of the Acoustical Society of America.
[15] Simon King,et al. Detection of phonological features in continuous speech using neural networks , 2000, Comput. Speech Lang..
[16] Carol Y. Espy-Wilson,et al. A feature‐based semivowel recognition system , 1994 .
[17] Leslie S. SmithCCCN. A Neurally Motivated Technique for Voicing Detection andF 0 Estimation for Speech , 1996 .
[18] Katrin Kirchhoff,et al. Robust speech recognition using articulatory information , 1998 .
[19] Leslie S. Smith. A Noise-Robust Auditory Modelin Front End for Voiced Speech , 1997, ICANN.
[20] Hervé Bourlard,et al. Subband-based speech recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[21] Richard F. Lyon,et al. A perceptual pitch detector , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[22] G. Kramer. Auditory Scene Analysis: The Perceptual Organization of Sound by Albert Bregman (review) , 2016 .
[23] John N. Holmes. Robust measurement of fundamental frequency and degree of voicing , 1998, ICSLP.
[24] Judea Pearl,et al. Probabilistic reasoning in intelligent systems , 1988 .
[25] Li Deng,et al. Speech recognition using the atomic speech units constructed from overlapping articulatory features , 1994, EUROSPEECH.
[26] Partha Niyogi,et al. Distinctive feature detection using support vector machines , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[27] Tomás Lozano-Pérez,et al. A Framework for Multiple-Instance Learning , 1997, NIPS.
[28] B. Moore,et al. Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.
[29] Nikki Mirghafori,et al. Sooner or later: exploring asynchrony in multi-band speech recognition , 1999, EUROSPEECH.
[30] Stephen T. Neely,et al. Signals, Sound, and Sensation , 1997 .
[31] Jan Van der Spiegel,et al. An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech , 1999, ISCAS'99. Proceedings of the 1999 IEEE International Symposium on Circuits and Systems VLSI (Cat. No.99CH36349).
[32] Malcolm Slaney,et al. An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .
[33] Yoshitaka Nakajima,et al. Auditory Scene Analysis: The Perceptual Organization of Sound Albert S. Bregman , 1992 .
[34] Carlos D. Brody,et al. Computing with Action Potentials , 1997, NIPS.
[35] Sharlene A. Liu,et al. Landmark detection for distinctive feature-based speech recognition , 1996 .
[36] Wolfgang Hess,et al. Pitch Determination of Speech Signals: Algorithms and Devices , 1983 .
[37] Noam Chomsky,et al. The Sound Pattern of English , 1968 .
[38] Francis Jack Smith,et al. Union: A new approach for combining sub-band observations for noisy speech recognition , 2001, Speech Commun..
[39] E. Owens,et al. An Introduction to the Psychology of Hearing , 1997 .
[40] George H. Freeman,et al. An HMM‐based speech recognizer using overlapping articulatory features , 1996 .
[41] Phil D. Green,et al. Robust automatic speech recognition with missing and unreliable acoustic data , 2001, Speech Commun..
[42] Jont B. Allen,et al. How do humans process and recognize speech? , 1993, IEEE Trans. Speech Audio Process..