论文信息 - Corpus Based Evaluation of Entropy Rate Speech Segmentation

Corpus Based Evaluation of Entropy Rate Speech Segmentation

The sequence of estimates of the speech signal’s entropy rate is investigated as a potential basis for speech segmentation. Rising and falling edges of that entropy rate curve and its maxima and minima are considered as candidates for segment boundaries. These prominent points are compared to the phonetic segment boundaries and to acoustic landmarks. The comparison is made using the American TIMIT database and the German ‘Kiel corpus of read speech’ which are phonetically manually labelled speech corpora.

Wolfgang Wokurek

[1] Jonathan G. Fiscus,et al. Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[2] Bruno O. Shubert,et al. Random variables and stochastic processes , 1979 .

[3] I. Miller. Probability, Random Variables, and Stochastic Processes , 1966 .

[4] Sharlene A. Liu,et al. Landmark detection for distinctive feature-based speech recognition , 1996 .

[5] John G. Proakis,et al. Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[6] C. E. SHANNON,et al. A mathematical theory of communication , 1948, MOCO.