Speech Quality Evaluation: A New Application of Digital Watermarking

Speech quality evaluation is an important research topic. The traditional way for speech quality evaluation is using subjective tests. They are reliable, but very expensive, time consuming, and cannot be used in certain applications such as online monitoring. Objective models, based on human perception, were developed to predict the results of subjective tests. The existing objective methods require either the original speech or complicated computation model, which makes some applications of quality evaluation impossible. In this paper, we propose a novel speech quality evaluation method using digital watermarking. Our algorithm evaluates the speech quality without the need of reference speech or any computational model. The watermark is embedded in the discrete wavelet domain or temporal domain of a speech signal by using quantization technique. This algorithm can evaluate perceptual quality of speech that is distorted by Gaussian noise, MP3 compression, low-pass filtering, and packet loss. The experimental results show that the method yields accurate quality scores which are very close to the results of the perceptual evaluation of speech quality

[1]  David L. Neuhoff,et al.  Quantization , 2022, IEEE Trans. Inf. Theory.

[2]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Doh-Suk Kim,et al.  Perceptual model for non-intrusive speech quality assessment , 2004, ICASSP.

[4]  Ronghui Tu Semi-fragile digital audio watermarking , 2003 .

[5]  Jiying Zhao,et al.  Speech Quality Evaluation: A New Application of Digital Watermarking , 2005, 2005 IEEE Instrumentationand Measurement Technology Conference Proceedings.

[6]  Adam Wolisz,et al.  A Perceptual Quality Model for Adaptive VoIP Applications , 2004 .

[7]  Tiago H. Falk,et al.  Objective Speech Quality Assessment Using Gaussian Mixture Models , 2004 .

[8]  Rafik A. Goubran,et al.  Speech quality prediction in VoIP using the extended E-model , 2003, GLOBECOM '03. IEEE Global Telecommunications Conference (IEEE Cat. No.03CH37489).

[9]  Libin Cai,et al.  Audio quality measurement by using digital watermarking , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[10]  Antony W. Rix,et al.  Perceptual speech quality assessment - a review , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Gregory W. Wornell,et al.  Dither modulation: a new approach to digital watermarking and information embedding , 1999, Electronic Imaging.

[12]  Tiago H. Falk,et al.  Non-intrusive GMM-based speech quality measurement , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[13]  Hossam Afifi,et al.  Audio quality assessment in packet networks: an "inter-subjective" neural network model , 2001, Proceedings 15th International Conference on Information Networking.

[14]  W. J. Tam,et al.  Image quality measurement by using digital watermarking , 2003, The 2nd IEEE Internatioal Workshop on Haptic, Audio and Visual Environments and Their Applications, 2003. HAVE 2003. Proceedings..

[15]  Rafik Goubran,et al.  Assessment of effects of packet loss on speech quality in VoIP , 2003, The 2nd IEEE Internatioal Workshop on Haptic, Audio and Visual Environments and Their Applications, 2003. HAVE 2003. Proceedings..

[16]  Jiying Zhao,et al.  A novel semi-fragile audio watermarking scheme , 2003, The 2nd IEEE Internatioal Workshop on Haptic, Audio and Visual Environments and Their Applications, 2003. HAVE 2003. Proceedings..