Combination of binaural and harmonic masking release effects in the detection of a single component in complex tones

ABSTRACT Both harmonic and binaural signal properties are relevant for auditory processing. To investigate how these cues combine in the auditory system, detection thresholds for an 800‐Hz tone masked by a diotic (i.e., identical between the ears) harmonic complex tone were measured in six normal‐hearing subjects. The target tone was presented either diotically or with an interaural phase difference (IPD) of 180° and in either harmonic or “mistuned” relationship to the diotic masker. Three different maskers were used, a resolved and an unresolved complex tone (fundamental frequency: 160 and 40 Hz) with four components below and above the target frequency and a broadband unresolved complex tone with 12 additional components. The target IPD provided release from masking in most masker conditions, whereas mistuning led to a significant release from masking only in the diotic conditions with the resolved and the narrowband unresolved maskers. A significant effect of mistuning was neither found in the diotic condition with the wideband unresolved masker nor in any of the dichotic conditions. An auditory model with a single analysis frequency band and different binaural processing schemes was employed to predict the data of the unresolved masker conditions. Sensitivity to modulation cues was achieved by including an auditory‐motivated modulation filter in the processing pathway. The predictions of the diotic data were in line with the experimental results and literature data in the narrowband condition, but not in the broadband condition, suggesting that across‐frequency processing is involved in processing modulation information. The experimental and model results in the dichotic conditions show that the binaural processor cannot exploit modulation information in binaurally unmasked conditions. HighlightsMasking release by mistuning and binaural disparity do not combine additively.Masking release by mistuning was not found in dichotic listening conditions.Modulation filtering explains masking release by mistuning in unresolved conditions.

[1]  John F Culling,et al.  Voice segregation by difference in fundamental frequency: evidence for harmonic cancellation. , 2011, The Journal of the Acoustical Society of America.

[2]  B. Moore,et al.  Thresholds for hearing mistuned partials as separate tones in harmonic complexes. , 1986, The Journal of the Acoustical Society of America.

[3]  J. L. Goldstein Auditory nonlinearity. , 1967, The Journal of the Acoustical Society of America.

[4]  A. Kohlrausch,et al.  Binaural processing model based on contralateral inhibition. I. Model structure. , 2001, The Journal of the Acoustical Society of America.

[5]  Roy D. Patterson,et al.  Distortion products and the perceived pitch of harmonic complex tones , 2001 .

[6]  Ning Ma,et al.  Exploiting correlogram structure for robust speech recognition with multiple speech sources , 2007, Speech Commun..

[7]  R. W. Hukin,et al.  Comparison of the effect of onset asynchrony on auditory grouping in pitch matching and vowel identification , 1995, Perception & psychophysics.

[8]  S McAdams,et al.  Hearing a mistuned harmonic in an otherwise periodic complex tone. , 1990, The Journal of the Acoustical Society of America.

[9]  J. C. R. Licklider,et al.  The Influence of Interaural Phase Relations upon the Masking of Speech by White Noise , 1948 .

[10]  Katrin Krumbholz,et al.  Binaural sluggishness precludes temporal pitch processing based on envelope cues in conditions of binaural unmasking. , 2009, The Journal of the Acoustical Society of America.

[11]  H. Duifhuis,et al.  Audibility of high harmonics in a periodic pulse. , 1970, The Journal of the Acoustical Society of America.

[12]  F L Wightman,et al.  Detectability of a pulsed tone in the presence of a masker with time-varying interaural correlation. , 1978, The Journal of the Acoustical Society of America.

[13]  Franz Pernkopf,et al.  Joint Position-Pitch Tracking for 2-Channel Audio , 2007, 2007 International Workshop on Content-Based Multimedia Indexing.

[14]  Roy D Patterson,et al.  Duifhuis pitch: neuromagnetic representation and auditory modeling. , 2014, Journal of neurophysiology.

[15]  J. Hebrank,et al.  Spectral cues used in the localization of sound sources on the median plane. , 1974, The Journal of the Acoustical Society of America.

[16]  L. A. Jeffress,et al.  Masking of Tonal Signals , 1956 .

[17]  B. Epp,et al.  Combination of masking releases for different center frequencies and masker amplitude statistics. , 2009, The Journal of the Acoustical Society of America.

[18]  L. Carney,et al.  Power-Law Dynamics in an Auditory-Nerve Model Can Account for Neural Adaptation to Sound-Level Statistics , 2010, The Journal of Neuroscience.

[19]  R. W. Hukin,et al.  Auditory objects of attention: the role of interaural time differences. , 1999, Journal of experimental psychology. Human perception and performance.

[20]  S van de Par,et al.  Binaural processing model based on contralateral inhibition. II. Dependence on spectral parameters. , 2001, The Journal of the Acoustical Society of America.

[21]  D M Green,et al.  Detection of a mistuned component in a harmonic complex. , 1994, The Journal of the Acoustical Society of America.

[22]  Modulation cues influence binaural masking-level difference in masking-pattern experiments. , 2012, The Journal of the Acoustical Society of America.

[23]  T Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. I. Model structure. , 1996, The Journal of the Acoustical Society of America.

[24]  Bastian Epp,et al.  Superposition of masking releases , 2008, Journal of Computational Neuroscience.

[25]  J H Grose,et al.  Effects of flanking band proximity, number, and modulation pattern on comodulation masking release. , 1990, The Journal of the Acoustical Society of America.

[26]  Alain de Cheveigné,et al.  Pitch shifts of mistuned partials: a time-domain model. , 1999 .

[27]  Effect of Mistuning on the Detection of a Tone Masked by a Harmonic Tone Complex , 2012, PloS one.

[28]  T. Dau,et al.  Characterizing frequency selectivity for envelope fluctuations. , 2000, The Journal of the Acoustical Society of America.

[29]  B. Moore,et al.  Thresholds for the detection of inharmonicity in complex tones. , 1985, The Journal of the Acoustical Society of America.

[30]  W. Hartmann,et al.  On the Duifhuis pitch effect. , 1997, The Journal of the Acoustical Society of America.

[31]  Claude Alain,et al.  Contribution of harmonicity and location to auditory object formation in free field: evidence from event-related brain potentials. , 2005, The Journal of the Acoustical Society of America.

[32]  Joseph W. Hall,et al.  Exploring the additivity of binaural and monaural masking release. , 2011, The Journal of the Acoustical Society of America.

[33]  G. Kramer Auditory Scene Analysis: The Perceptual Organization of Sound by Albert Bregman (review) , 2016 .

[34]  R. Meddis,et al.  A unitary model of pitch perception. , 1997, The Journal of the Acoustical Society of America.

[35]  T. Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. II. Simulations and measurements. , 1996, The Journal of the Acoustical Society of America.

[36]  R. Beutelmann,et al.  Effect of Harmonicity on the Detection of a Signal in a Complex Masker and on Spatial Release from Masking , 2011, PloS one.

[37]  Yoshitaka Nakajima,et al.  Auditory Scene Analysis: The Perceptual Organization of Sound Albert S. Bregman , 1992 .

[38]  Torsten Dau,et al.  Binaural processing of modulated interaural level differences. , 2008, The Journal of the Acoustical Society of America.

[39]  Michael V Keebler,et al.  Can temporal fine structure represent the fundamental frequency of unresolved harmonics? , 2009, The Journal of the Acoustical Society of America.

[40]  L. Auger The Journal of the Acoustical Society of America , 1949 .

[41]  Volker Hohmann,et al.  Lateralization of stimuli with independent fine-structure and envelope-based temporal disparities. , 2008, The Journal of the Acoustical Society of America.

[42]  B. Roberts Spectral pattern, grouping, and the pitches of complex tones and their components , 2005 .

[43]  B. Moore Perceptual Consequences of Cochlear Damage , 1995 .

[44]  H. Levitt Transformed up-down methods in psychoacoustics. , 1971, The Journal of the Acoustical Society of America.

[45]  Joseph W. Hall,et al.  Detection in noise by spectro-temporal pattern analysis. , 1984, The Journal of the Acoustical Society of America.

[46]  B Kollmeier,et al.  Binaural forward and backward masking: evidence for sluggishness in binaural detection. , 1990, The Journal of the Acoustical Society of America.

[47]  N. Durlach Equalization and Cancellation Theory of Binaural Masking‐Level Differences , 1963 .

[48]  B. Kollmeier,et al.  Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers. , 1997, The Journal of the Acoustical Society of America.