论文信息 - A Bayesian Predictive Method for Automatic Speech Segmentation

A Bayesian Predictive Method for Automatic Speech Segmentation

Implicit speech segmentation is basically to find time instances when the spectral distortion is large. Spectral variation function is a widely used measure of spectral distortion. However, SVF is a data-dependent measure. In order to make the measurement data-independent, a likelihood ratio is constructed to measure the spectral distortion. This ratio can be computed efficiently with a Bayesian predictive model. The prior of the Bayesian predictive model is estimated from unlabeled data via an unsupervised machine learning technique - Gaussian mixture model (GMM). The experimental results show that effectiveness of this novel method. The performance on TIMIT corpus indicates the potential applications in speech recognition, synthesis and coding

Ming Liu | Thomas S. Huang | Ming Liu | T. Huang

[1] P. Jusczyk,et al. Infants′ Detection of the Sound Patterns of Words in Fluent Speech , 1995, Cognitive Psychology.

[2] Jerry D. Gibson,et al. Speech analysis and segmentation by parametric filtering , 1996, IEEE Trans. Speech Audio Process..

[3] Maurizio Omologo,et al. Automatic segmentation and labeling of speech based on Hidden Markov Models , 1993, Speech Commun..

[4] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[5] Jan P. van Hemert,et al. Automatic segmentation of speech , 1991, IEEE Trans. Signal Process..

[6] John H. L. Hansen,et al. Automatic segmentation of speech recorded in unknown noisy channel characteristics , 1998, Speech Communication.

[7] Régine André-Obrecht,et al. A new statistical approach for the automatic segmentation of continuous speech signals , 1988, IEEE Trans. Acoust. Speech Signal Process..

[8] Rajesh M. Hegde,et al. Segmentation of speech into syllable-like units , 2003, INTERSPEECH.