Measuring and Predicting the Perceived Quality of Music and Speech Subjected to Combined Linear and Nonlinear Distortion

The results of experiments in which subjects rated the perceived quality of speech and music that had been subjected to various forms of both linear and nonlinear distortion are reported. Experiment 1 made use of artificial distortions (such as ripples in frequency response combined with peak clipping). Experiment 2 included both artificial distortions and real distortions introduced by transducers. The results were compared with the predictions of a new model based on a weighted sum of predictions for linear distortion alone and for nonlinear distortion alone. There was a very good correspondence between the obtained and predicted ratings. Correlations were greater than 0.85 for speech stimuli and 0.90 for music stimuli. It is concluded that the new model can predict accurately the perceived quality of speech and music subjected to combined linear and nonlinear distortion.

[1]  E. Poulton Models for biases in judging sensory magnitude. , 1979, Psychological bulletin.

[2]  Roy D. Patterson,et al.  Auditory preprocessing and recognition of speech , 1989 .

[3]  Brian C J Moore,et al.  Interference effects and phase sensitivity in hearing , 2002, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[4]  R. Plomp,et al.  Effect of phase on the timbre of complex tones. , 1969, The Journal of the Acoustical Society of America.

[5]  Jont B. Allen,et al.  Short term spectral analysis, synthesis, and modification by discrete Fourier transform , 1977 .

[6]  B. Moore,et al.  Perceived naturalness of spectrally distorted speech and music. , 2003, The Journal of the Acoustical Society of America.

[7]  Brian C. J. Moore,et al.  The Effect of Nonlinear Distortion on the Perceived Quality of Music and Speech Signals , 2003 .

[8]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[9]  Thomas Baer,et al.  A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[10]  R. Patterson,et al.  The deterioration of hearing with age: frequency selectivity, the critical ratio, the audiogram, and speech threshold. , 1982, The Journal of the Acoustical Society of America.

[11]  Alexander Voishvillo,et al.  Multitone testing of sound system components: Some results and conclusions, Part 1: History and theory , 2001 .

[12]  Brian C. J. Moore,et al.  Development and Validation of a Method for Predicting the Perceived Naturalness of Sounds Subjected to Spectral Distortion , 2004 .

[13]  Brian C. J. Moore,et al.  Formulae describing frequency selectivity as a function of frequency and level, and their use in calculating excitation patterns , 1987, Hearing Research.

[14]  B. Moore,et al.  A Model of Loudness Applicable to Time-Varying Sounds , 2002 .

[15]  R. Patterson,et al.  Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[16]  B. Moore An introduction to the psychology of hearing, 3rd ed. , 1989 .

[17]  Brian C. J. Moore,et al.  Predicting the Perceived Quality of Nonlinearly Distorted Music and Speech Signals , 2004 .

[18]  Alexander Terekhov,et al.  Multitone Testing of Sound System Components'Some Results and Conclusions, Part 2: Modeling and Application , 2001 .

[19]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[20]  R. M. Sachs,et al.  Anthropometric manikin for acoustic research. , 1975, The Journal of the Acoustical Society of America.