Time-frequency integration characteristics of hearing are optimized for perception of speech-like acoustic patterns.

Several psychoacoustic phenomena such as loudness perception, absolute thresholds of hearing, and perceptual grouping in time are affected by temporal integration of the signal in the auditory system. Similarly, the frequency resolution of the hearing system, often expressed in terms of critical bands, implies signal integration across neighboring frequencies. Although progress has been made in understanding the neurophysiological mechanisms behind these processes, the underlying reasons for the observed integration characteristics have remained poorly understood. The current work proposes that the temporal and spectral integration are a result of a system optimized for pattern detection from ecologically relevant acoustic inputs. This argument is supported by a simulation where the average time-frequency structure of speech that is derived from a large set of speech signals shows a good match to the time-frequency characteristics of the human auditory system. The results also suggest that the observed integration characteristics are learnable from acoustic inputs of the auditory environment using a Hebbian-like learning rule.

[1]  G. F. Cooper,et al.  Development of the Brain depends on the Visual Environment , 1970, Nature.

[2]  R. Patterson Auditory filter shapes derived with noise stimuli. , 1976, The Journal of the Acoustical Society of America.

[3]  A. Flock,et al.  Role of inner and outer hair cells in mechanical frequency selectivity of the cochlea , 1985, Hearing Research.

[4]  R. Parncutt Prenatal and infant conditioning, the mother schema, and the origins of music and religion , 2009 .

[5]  Unto K. Laine,et al.  A method for noise-robust context-aware pattern discovery and recognition from categorical sequences , 2012, Pattern Recognit..

[6]  Prenatal experience and neonatal responsiveness to vocal expressions of emotion. , 1999, Developmental psychobiology.

[7]  Prasanta Kumar Ghosh,et al.  Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures. , 2011, The Journal of the Acoustical Society of America.

[8]  C. Schreiner,et al.  Short-term adaptation of auditory receptive fields to dynamic stimuli. , 2004, Journal of neurophysiology.

[9]  B. Moore,et al.  Temporal window shape as a function of frequency and level. , 1989, The Journal of the Acoustical Society of America.

[10]  B C Moore,et al.  The shape of the ear's temporal window. , 1988, The Journal of the Acoustical Society of America.

[11]  N. Viemeister,et al.  Temporal integration and multiple looks. , 1991, The Journal of the Acoustical Society of America.

[12]  Ray Meddis,et al.  The psychophysics of absolute threshold and signal duration: a probabilistic approach. , 2011, The Journal of the Acoustical Society of America.

[13]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[14]  D M Green,et al.  The critical masking interval. , 1972, The Journal of the Acoustical Society of America.

[15]  E. Zwicker Dependence of post-masking on masker duration and its relation to temporal effects in loudness. , 1984, The Journal of the Acoustical Society of America.

[16]  Andrew J. Oxenham,et al.  Effects of masker frequency and duration in forward masking: further evidence for the influence of peripheral nonlinearity , 2000, Hearing Research.

[17]  P. Jusczyk From general to language-specific capacities: the WRAPSA Model of how speech perception develops , 1993 .

[18]  Lou Boves,et al.  Computational modelling of spoken-word recognition processes: Design choices and evaluation , 2010 .

[19]  R V Shannon,et al.  Forward masking in patients with cochlear implants. , 1990, The Journal of the Acoustical Society of America.

[20]  A. Oxenham,et al.  Basilar-membrane nonlinearity and the growth of forward masking. , 1996, The Journal of the Acoustical Society of America.

[21]  E M Relkin,et al.  A reexamination of forward masking in the auditory nerve. , 1988, The Journal of the Acoustical Society of America.

[22]  M. Jeannerod,et al.  Role of visual experience in the development of optokinetic response in kittens , 2004, Experimental Brain Research.

[23]  R. Smith Short-term adaptation in single auditory nerve fibers: some poststimulatory effects. , 1977 .

[24]  B. Moore,et al.  Modeling the additivity of nonsimultaneous masking , 1994, Hearing Research.

[25]  T. Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. II. Simulations and measurements. , 1996, The Journal of the Acoustical Society of America.

[26]  G M Gerken,et al.  Auditory temporal integration and the power function model. , 1990, The Journal of the Acoustical Society of America.

[27]  M. J. Penner Persistence and integration: Two consequences of a sliding integrator , 1975 .

[28]  A. Bass,et al.  Seasonal Plasticity of Peripheral Auditory Frequency Sensitivity , 2003, The Journal of Neuroscience.

[29]  G. Klump,et al.  Temporal integration in the gerbil: The effects of age, hearing loss and temporally unmodulated and modulated speech-like masker noises , 2007, Hearing Research.

[30]  Peter Heil,et al.  A unifying basis of auditory thresholds based on temporal summation , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[31]  M. A. Bouman,et al.  Relation between Hearing Threshold and Duration for Tone Pulses , 1959 .

[32]  T Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. I. Model structure. , 1996, The Journal of the Acoustical Society of America.

[33]  M. Merzenich,et al.  Plasticity in the frequency representation of primary auditory cortex following discrimination training in adult owl monkeys , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[34]  T. Dau,et al.  A computational model of human auditory signal processing and perception. , 2008, The Journal of the Acoustical Society of America.

[35]  R. Lasky,et al.  The Development of the Auditory System from Conception to Term , 2005 .

[36]  Gary J. Rose,et al.  Long-term temporal integration in the anuran auditory system , 1998, Nature Neuroscience.

[37]  D. W. Smith,et al.  Effects of outer hair cell loss on the frequency selectivity of the patas monkey auditory system , 1987, Hearing Research.

[38]  A. Oxenham,et al.  Masking by Inaudible Sounds and the Linearity of Temporal Summation , 2006, The Journal of Neuroscience.

[39]  E. de Boer,et al.  Auditory Time Constants: A Paradox? , 1985 .

[40]  A. Oxenham,et al.  A further test of the linearity of temporal summation in forward masking. , 2007, The Journal of the Acoustical Society of America.

[41]  N. Viemeister Temporal modulation transfer functions based upon modulation thresholds. , 1979, The Journal of the Acoustical Society of America.

[42]  Wentian Li Mutual information functions versus correlation functions , 1990 .

[43]  Terence W. Picton,et al.  Temporal integration in the human auditory cortex as represented by the development of the steady-state magnetic field , 2002, Hearing Research.

[44]  E. Zwicker,et al.  Subdivision of the audible frequency range into critical bands , 1961 .

[45]  P. Kuhl Early language acquisition: cracking the speech code , 2004, Nature Reviews Neuroscience.

[46]  Michael S. Lewicki,et al.  Efficient coding of natural sounds , 2002, Nature Neuroscience.

[47]  Paul M. Hofman,et al.  Relearning sound localization with new ears , 1998, Nature Neuroscience.

[48]  A. Oxenham,et al.  Forward masking: adaptation or integration? , 2001, The Journal of the Acoustical Society of America.

[49]  M M Merzenich,et al.  Neural Mechanisms Underlying Temporal Integration, Segmentation, and Input Sequence Representation: Some Implications for the Origin of Learning Disabilities a , 1993, Annals of the New York Academy of Sciences.

[50]  A. Faulkner,et al.  Adaptation by normal listeners to upward spectral shifts of speech: implications for cochlear implants. , 1999, The Journal of the Acoustical Society of America.

[51]  W. R. Garner,et al.  The masked threshold of pure tones as a function of duration. , 1947, Journal of experimental psychology.

[52]  J. C. Stevens,et al.  Brightness and loudness as functions of stimulus duration , 1966 .

[53]  W. Fitch The Evolution of Language: A Comparative Review , 2005 .