An enquiring system of unknown words in TV news by spontaneous repetition (application of speaker normalization by speaker subspace projection)
暂无分享,去创建一个
We have constructed a system in which we can enquire about unknown words included in TV news speech by repeating them spontaneously. For example, we might hear "Japan would join the PKO" from the TV news, and if "PKO" is an unknown word then we can enquire about it by saying "What is the PKO?" The system recognizes the word "PKO" and explains its meaning. In this system, it estimates a common section between the announcer's speech and the user's speech, and recognizes the word corresponding to the common section. We solved the problem of speaker difference in extracting the common sections by speaker subspace projection.
[1] Yves Grenier,et al. Spectral transformations through canonical correlation analysis for speaker adptation in ASR , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[2] Erkki Oja,et al. Subspace methods of pattern recognition , 1983 .
[3] Herbert Gish,et al. Phonetic-based word spotter: various configurations and application to event spotting , 1993, EUROSPEECH.
[4] Yasuo Ariki,et al. Speaker recognition based on subspace methods , 1994, ICSLP.