Speech quality evaluation of artificial bandwidth extension: comparing subjective judgments and instrumental predictions

Artificial bandwidth extension (ABE) methods have been developed to enhance the quality and intelligibility of bandlimited speech transmitted over a telephone connection. Subjective listening tests are the most reliable way of evaluating the quality of ABE, but listening tests are time-consuming and expensive to arrange. Instrumental measures have also been used to estimate the subjective quality of ABE. This study extends the results of an earlier subjective evaluation of ABE methods by instrumental quality predictions computed with WB-PESQ (ITU-T Recommendation P.862.2) and POLQA (ITU-T Recommendation P.863). The instrumental quality predictions are compared with the subjective quality scores. The results indicate that POLQA correlates better with the subjective quality than WB-PESQ. Neither WB-PESQ nor POLQA can predict the rank order of the evaluated ABE methods in all conditions.

[1]  Martin Etnestad Johansen,et al.  Bandwidth Extension of Telephony Speech , 2009 .

[2]  Timothy B. Terriberry,et al.  Definition of the Opus Audio Codec , 2012, RFC.

[3]  Peter Jax,et al.  On artificial bandwidth extension of telephone speech , 2003, Signal Process..

[4]  Geun-Bae Song,et al.  A study of HMM-based bandwidth extension of speech signals , 2009, Signal Process..

[5]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[6]  Hong-Goo Kang,et al.  Speech Bandwidth Extension Using Temporal Envelope Modeling , 2008, IEEE Signal Processing Letters.

[7]  Anssi Rämö,et al.  Voice quality evaluation of various codecs , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Paavo Alku,et al.  Speech quality prediction for artificial bandwidth extension algorithms , 2013, INTERSPEECH.

[9]  Peter Vary,et al.  Artificial Bandwidth Extension of Wideband Speech by Pitch-Scaling of Higher Frequencies , 2013, GI-Jahrestagung.

[10]  Patrick Bauer,et al.  An HMM-based artificial bandwidth extension evaluated by cross-language training and test , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Engin Erzin,et al.  Artificial bandwidth extension of spectral envelope along a Viterbi path , 2013, Speech Commun..

[12]  Patrick Bauer,et al.  On speech quality assessment of artificial bandwidth extension , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Anssi Rämö,et al.  Voice Quality Characterization of IETF Opus Codec , 2011, INTERSPEECH.

[14]  Paavo Alku,et al.  Bandwidth Extension of Telephone Speech Using a Neural Network and a Filter Bank Implementation for Highband Mel Spectrum , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Paavo Alku,et al.  Subjective voice quality evaluation of artificial bandwidth extension: comparing different audio bandwidths and speech codecs , 2014, INTERSPEECH.

[16]  Peter Kabal,et al.  Memory-Based Approximation of the Gaussian Mixture Model Framework for Bandwidth Extension of Narrowband Speech , 2011, INTERSPEECH.

[17]  Bernd Geiser Beyond Wideband Telephony — Bandwidth Extension for Super-Wideband Speech , 2008 .

[18]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[19]  Paavo Alku,et al.  Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages , 2008, IEEE Transactions on Audio, Speech, and Language Processing.