Output-based objective speech quality using vector quantization techniques

Output-based speech quality (OBQ) refers to an objective speech quality measure that uses only received speech without access to the input speech record. This paper proposes two new OBQ measures and evaluates their performance. Perceptual linear prediction (PLP) coefficients are used to provide speaker independence required by the objective measure. Two distortion measures are introduced for predicting speech quality based on vector quantization of the output speech record. These are the transition probability distance and the median minimum distance measure. The OBQ parameters are tested on four different speech datasets. The correlation is computed between subjective scores and the objective quality measures under a variety conditions, and the results indicate that the proposed algorithms are quite robust against speaker, text and distortion variation.

[1]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[2]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[3]  Robert F. Kubichek,et al.  Standards and technology issues in objective voice quality assessment , 1991, Digit. Signal Process..

[4]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[5]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .