Computational models for predicting sound quality

—The quality of an audio device, such as a microphone, amplifier, or headphone, depends on how accurately the device transmits the properties of the sound source to the ear(s) of the listener. Two types of " distortion " can occur in this transmission: (1) " Linear " distortion, which may be described as a deviation of the frequency response from the " target " response; (2) Nonlinear distortion, which is characterised by frequency components in the output of the device that were not present in the input. These two forms of distortion have different perceptual effects. Their effects on sound quality can be predicted using a model of auditory processing with the following stages: (1) A filter to take into account the transmission of sound from the device to the ear of the listener; (2) A filter to simulate the effects of transmission through the middle ear; (3) An array of bandpass filters to simulate the auditory filters that exist in the cochlea of the inner ear. For predicting the perceptual effects of linear distortion, a model operating in the frequency domain can be used. For predicting the perceptual effects of nonlinear distortion, a model operating in the time domain is required, since the detailed waveforms at the outputs of the auditory filters need to be considered. The models described have been shown to give accurate predictions for a wide range of " artificial " and " real " linear and nonlinear distortions. Brian Moore is Emeritus Professor of Auditory Perception in the University of Cambridge. His research interests are: the perception of sound in normal and impaired hearing; design of signal processing hearing aids for sensorineural hearing loss; methods for fitting hearing aids to the individual; perception of music and of musical instruments.

[1]  H. Dillon,et al.  An international comparison of long‐term average speech spectra , 1994 .

[2]  Brian C. J. Moore,et al.  Measuring and Predicting the Perceived Quality of Music and Speech Subjected to Combined Linear and Nonlinear Distortion , 2004 .

[3]  B. Moore,et al.  Perceived naturalness of spectrally distorted speech and music. , 2003, The Journal of the Acoustical Society of America.

[4]  Rainer Huber,et al.  An Objective Audio Quality Measure Based on Power and Envelope Power Cues , 2018, Journal of the Audio Engineering Society.

[5]  B C Moore,et al.  Comparison of the electroacoustic characteristics of five hearing aids , 2001, British journal of audiology.

[6]  R. Patterson,et al.  Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[7]  Brian C. J. Moore,et al.  Predicting the Perceived Quality of Nonlinearly Distorted Music and Speech Signals , 2004 .

[8]  Psychophysical tuning curves for frequencies below 100 Hz. , 2011, The Journal of the Acoustical Society of America.

[9]  B. Kollmeier,et al.  Modeling auditory processing of amplitude modulation. II. Spectral and temporal integration. , 1997, The Journal of the Acoustical Society of America.

[10]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[11]  Thomas Biberger,et al.  Envelope and intensity based prediction of psychoacoustic masking and speech intelligibility. , 2016, The Journal of the Acoustical Society of America.

[12]  B. Moore,et al.  A Model of Loudness Applicable to Time-Varying Sounds , 2002 .

[13]  Andrew J. Oxenham,et al.  Estimates of Human Cochlear Tuning at Low Levels Using Forward and Simultaneous Masking , 2003, Journal of the Association for Research in Otolaryngology.

[14]  Thomas Baer,et al.  A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[15]  Brian C. J. Moore,et al.  The Effect of Nonlinear Distortion on the Perceived Quality of Music and Speech Signals , 2003 .

[16]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[17]  R. Patterson Auditory filter shapes derived with noise stimuli. , 1976, The Journal of the Acoustical Society of America.