A Basic Experiment on Time-Varying Speech Quality

We present a general formulation of a basic open question regarding the perception of time-varying speech quality. We then describe the design, implementation, conduct, and analysis of a practical experiment that addresses a small but fundamental part of that open question. In this experiment, listeners rate the overall speech quality of single sentence stimuli that contain two different levels of nominal speech quality and two transitions between these levels. We present several results including those related to human integration of speech quality and the recency effect. Finally, we discuss these results and suggest potential additional work that might build upon them.

[1]  L. Ozarow,et al.  On a source-coding problem with two channels and three receivers , 1980, The Bell System Technical Journal.

[2]  Sylvain Busson,et al.  Effects of context on the subjective assessment of time-varying speech quality : Listening / conversation, laboratory / real environment , 2004 .

[3]  M. Hansen,et al.  Continuous assessment of time-varying speech quality. , 1999, The Journal of the Acoustical Society of America.

[4]  Kari Jarvinen Standardisation of the adaptive multi-rate codec , 2000, 2000 10th European Signal Processing Conference.

[5]  Abbas El Gamal,et al.  Achievable rates for multiple descriptions , 1982, IEEE Trans. Inf. Theory.

[6]  Yang Gao,et al.  The SMV algorithm selected by TIA and 3GPP2 for CDMA applications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Stephen D. Voran A multiple-description PCM speech coder using structured dual vector quantizers , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[8]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[9]  Roch Lefebvre,et al.  The adaptive multirate wideband speech codec (AMR-WB) , 2002, IEEE Trans. Speech Audio Process..

[10]  Phil Gray,et al.  An Experimental Investigation of the Accumulation of Perceived Error in Time-Varying Speech Distortions , 1997 .

[11]  S. Hayashi,et al.  Design and description of CS-ACELP: a toll quality 8 kb/s speech coder , 1998, IEEE Trans. Speech Audio Process..

[12]  Allen Gersho,et al.  A 1200/2400 bps coding suite based on MELP , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[13]  Ari Lakaniemi,et al.  Subjective VoIP speech quality evaluation based on network measurements , 2001, ICC 2001. IEEE International Conference on Communications. Conference Record (Cat. No.01CH37240).

[14]  Vivek K. Goyal,et al.  Multiple description perceptual audio coding with correlating transforms , 2000, IEEE Trans. Speech Audio Process..

[15]  S. D. Voran The channel-optimized multiple-description scalar quantizer , 2002, Proceedings of 2002 IEEE 10th Digital Signal Processing Workshop, 2002 and the 2nd Signal Processing Education Workshop..