Parametric Coding of Stereo Audio

Parametric-stereo coding is a technique to efficiently code a stereo audio signal as a monaural signal plus a small amount of parametric overhead to describe the stereo image. The stereo properties are analyzed, encoded, and reinstated in a decoder according to spatial psychoacoustical principles. The monaural signal can be encoded using any (conventional) audio coder. Experiments show that the parameterized description of spatial properties enables a highly efficient, high-quality stereo audio representation.

[1]  A. Zeiberg,et al.  Lateralization of complex binaural stimuli: a weighted-image model. , 1988, The Journal of the Acoustical Society of America.

[2]  A. Mills Lateralization of High‐Frequency Tones , 1960 .

[3]  H S Colburn,et al.  Interaural correlation sensitivity. , 1998, The Journal of the Acoustical Society of America.

[4]  S van de Par,et al.  The contribution of static and dynamically varying ITDs and IIDs to binaural detection. , 1999, The Journal of the Acoustical Society of America.

[5]  M A Fernandes,et al.  The role of monaural frequency selectivity in binaural analysis. , 1984, The Journal of the Acoustical Society of America.

[6]  Jeroen Breebaart,et al.  Low Complexity Parametric Stereo Coding , 2004 .

[7]  W. Gaik,et al.  Combined evaluation of interaural time and intensity differences: psychoacoustic results and computer modeling. , 1993, The Journal of the Acoustical Society of America.

[8]  Jeroen Breebaart,et al.  High-quality Parametric Spatial Audio Coding at Low Bitrates , 2004 .

[9]  R H Wilson,et al.  Effects of signal duration on the 500-Hz masking-level difference. , 1986, Scandinavian audiology.

[10]  Torsten Marquardt,et al.  Detection of static and dynamic changes in interaural correlation. , 2002, The Journal of the Acoustical Society of America.

[11]  Manfred R. Schroeder,et al.  Synthesis of low-peak-factor signals and binary sequences with low autocorrelation (Corresp.) , 1970, IEEE Trans. Inf. Theory.

[12]  N. Durlach,et al.  Interaural time and amplitude jnds for a 500-Hz tone. , 1969, The Journal of the Acoustical Society of America.

[13]  AG Armin Kohlrausch,et al.  Discrimination of different temporal envelope structures of diotic and dichotic target signals within diotic wide-band noise , 2005 .

[14]  B Kollmeier,et al.  Auditory filter bandwidths in binaural and monaural listening conditions. , 1992, The Journal of the Acoustical Society of America.

[15]  H S Colburn,et al.  Theory of binaural interaction based on auditory-nerve data. II. Detection of tones in noise. , 1977, The Journal of the Acoustical Society of America.

[16]  E R Hafter,et al.  Masking-level differences obtained with a pulsed tonal masker. , 1970, The Journal of the Acoustical Society of America.

[17]  J. D. Johnston,et al.  Sum-difference stereo transform coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  B Kollmeier,et al.  Binaural forward and backward masking: evidence for sluggishness in binaural detection. , 1990, The Journal of the Acoustical Society of America.

[19]  N. Durlach Equalization and Cancellation Theory of Binaural Masking‐Level Differences , 1963 .

[20]  Matti Karjalainen,et al.  Analyzing Virtual Sound Source Attributes Using a Binaural Auditory Model , 1999 .

[21]  Stanley P. Lipshitz,et al.  Stereo Microphone Techniques: Are the Purists Wrong? , 1985 .

[22]  Christof Faller,et al.  Binaural cue coding: a novel and efficient representation of spatial audio , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  Heiko Purnhagen,et al.  A Closer Look into MPEG-4 High Efficiency AAC , 2003 .

[24]  H. Gaskell The precedence effect , 1983, Hearing Research.

[25]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[26]  P M Zurek,et al.  The precedence effect and its possible role in the avoidance of interaural ambiguities. , 1980, The Journal of the Acoustical Society of America.

[27]  David M. Green,et al.  Signal‐Detection Analysis of Equalization and Cancellation Model , 1966 .

[28]  Jeroen Breebaart,et al.  ADVANCES IN PARAMETRIC CODING FOR HIGH-QUALITY AUDIO , 2003 .

[29]  A Kohlrausch Auditory filter shape derived from binaural masking experiments. , 1988, The Journal of the Acoustical Society of America.

[30]  C Trahiotis,et al.  The normalized correlation: accounting for binaural detection across center frequency. , 1996, The Journal of the Acoustical Society of America.

[31]  Christof Faller,et al.  Binaural cue coding-Part II: Schemes and applications , 2003, IEEE Trans. Speech Audio Process..

[32]  D W Grantham,et al.  Interaural intensity discrimination: insensitivity at 1000 Hz. , 1984, The Journal of the Acoustical Society of America.

[33]  J. Zwislocki,et al.  Just Noticeable Differences in Dichotic Phase , 1956 .

[34]  C Trahiotis,et al.  The effects of signal duration on NoSo and NoS pi thresholds at 500 Hz and 4 kHz. , 1999, The Journal of the Acoustical Society of America.

[35]  W A Yost,et al.  Discriminations of interaural phase differences. , 1974, The Journal of the Acoustical Society of America.

[36]  Christof Faller,et al.  Estimation of auditory spatial cues for Binaural Cue Coding , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[37]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. I. Simulation of lateralization for stationary signals. , 1986, The Journal of the Acoustical Society of America.

[38]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[39]  L. A. Jeffress,et al.  Differences of Interaural Phase and Level in Detection and Lateralization: 250 Hz , 1971 .

[40]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[41]  A. Kohlrausch,et al.  Binaural processing model based on contralateral inhibition. I. Model structure. , 2001, The Journal of the Acoustical Society of America.

[42]  William A. Yost,et al.  Tone-on-tone binaural masking with an antiphasic masker , 1974 .

[43]  Ajm Adrian Houtsma,et al.  Bit rate reduction and binaural masking release in digital coding of stereo sound , 1996 .

[44]  Ray Meddis,et al.  Across frequency integration in a model of lateralization , 1992 .

[45]  Christof Faller,et al.  Binaural Cue Coding Applied to Stereo and Multi-Channel Audio Compression , 2002 .

[46]  R H Wilson,et al.  Influence of signal duration on the masking-level difference. , 1987, Journal of speech and hearing research.

[47]  Christof Faller,et al.  Design and Evaluation of Binaural Cue Coding Schemes , 2002 .

[48]  J.D. Johnston,et al.  A study of why cross channel prediction is not applicable to perceptual audio coding , 2001, IEEE Signal Processing Letters.

[49]  Christof Faller,et al.  Efficient representation of spatial audio using perceptual parametrization , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[50]  Eberhard Zwicker,et al.  Binaural masking-level difference as a function of masker and test-signal duration , 1984, Hearing Research.

[51]  H S Colburn,et al.  Interaural correlation discrimination: i. bandwidth and level dependence. , 1981, The Journal of the Acoustical Society of America.

[52]  Ajm Adrian Houtsma,et al.  Further bit rate reduction through binaural processing , 1996 .

[53]  B Kollmeier,et al.  Binaural and monaural auditory filter bandwidths and time constants in probe tone detection experiments. , 1998, The Journal of the Acoustical Society of America.

[54]  W. Yost Weber’s fraction for the intensity of pure tones presented binaurally , 1972 .

[55]  Ronaldus Maria Aarts,et al.  Two-to-Five Channel Sound Processing * , 2002 .

[56]  Kristofer Kjörling,et al.  Spectral Band Replication, a Novel Approach in Audio Coding , 2002 .

[57]  L. A. Jeffress,et al.  Effect of Varying the Interaural Noise Correlation on the Detectability of Tonal Signals , 1963 .

[58]  Christof Faller,et al.  Binaural cue coding-Part I: psychoacoustic fundamentals and design principles , 2003, IEEE Trans. Speech Audio Process..

[59]  C Trahiotis,et al.  Binaural detection as a function of interaural correlation and bandwidth of masking noise: implications for estimates of spectral resolution. , 1998, The Journal of the Acoustical Society of America.

[60]  Gerhard Stoll,et al.  ISO-MPEG-1 Audio: A Generic Standard for Coding of High-: Quality Digital Audio , 1994 .

[61]  Joseph W. Hall,et al.  NoSo and NoS pi thresholds as a function of masker level for narrow-band and wideband masking noise. , 1984, The Journal of the Acoustical Society of America.

[62]  C Trahiotis,et al.  The effects of randomizing values of interaural disparities on binaural detection and on discrimination of interaural correlation. , 1997, The Journal of the Acoustical Society of America.

[63]  Werner Oomen,et al.  Parametric Coding for High-Quality Audio , 2002 .

[64]  L. A. Jeffress,et al.  Differences of interaural phase and level in detection and lateralization: 250 Hz. , 1971, The Journal of the Acoustical Society of America.

[65]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[66]  Raymond N. J. Veldhuis,et al.  Subband coding of stereophonic digital audio signals , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[67]  L. A. Jeffress,et al.  Effect of Noise Crosscorrelation on Binaural‐Signal Detection , 1962 .

[68]  Tilman Liebchen,et al.  Lossless Audio Coding Using Adaptive Multichannel Prediction , 2002 .

[69]  R. G. Klumpp,et al.  Some Measurements of Interaural Time Difference Thresholds , 1956 .

[70]  Jürgen Herre,et al.  Intensity Stereo Coding , 1994 .

[71]  P M Zurek,et al.  Adjustment and discrimination measurements of the precedence effect. , 1993, The Journal of the Acoustical Society of America.

[72]  William A. Yost Lateral position of sinusoids presented with interaural intensive and temporal differences , 1981 .

[73]  C Trahiotis,et al.  Discrimination of interaural envelope correlation and its relation to binaural unmasking at high frequencies. , 1992, The Journal of the Acoustical Society of America.

[74]  Karlheinz Brandenburg,et al.  MP3 and AAC Explained , 1999 .

[75]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[76]  Richard M. Stern,et al.  Lateralization and detection of low‐frequency binaural stimuli: Effects of distribution of internal delay , 1996 .

[77]  Heiko Purnhagen LOW COMPLEXITY PARAMETRIC STEREO CODING IN MPEG-4 , 2004 .

[78]  Christof Faller,et al.  Why Binaural Cue Coding is Better than Intensity Stereo Coding , 2002 .

[79]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[80]  J V Tobias,et al.  Interaural intensity difference limen. , 1965, Journal of speech and hearing research.

[81]  S van de Par,et al.  A new approach to comparing binaural masking level differences at low and high frequencies. , 1997, The Journal of the Acoustical Society of America.

[82]  Heiko Purnhagen,et al.  Synthetic Ambience in Parametric Stereo Coding , 2004 .