Sound representation methods for spectro-temporal receptive field estimation

The spectro-temporal receptive field (STRF) of an auditory neuron describes the linear relationship between the sound stimulus in a time-frequency representation and the neural response. Time-frequency representations of a sound in turn require a nonlinear operation on the sound pressure waveform and many different forms for this non-linear transformation are possible. Here, we systematically investigated the effects of four factors in the non-linear step in the STRF model: the choice of logarithmic or linear filter frequency spacing, the time-frequency scale, stimulus amplitude compression and adaptive gain control. We quantified the goodness of fit of these different STRF models on data obtained from auditory neurons in the songbird midbrain and forebrain. We found that adaptive gain control and the correct stimulus amplitude compression scheme are paramount to correctly modelling neurons. The time-frequency scale and frequency spacing also affected the goodness of fit of the model but to a lesser extent and the optimal values were stimulus dependant.

[1]  Robert J. Dooling,et al.  4 – Auditory Perception in Birds , 1982 .

[2]  K. Sen,et al.  Spectral-temporal Receptive Fields of Nonlinear Auditory Neurons Obtained Using Natural Sounds , 2022 .

[3]  S A Shamma,et al.  Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex. , 2001, Journal of neurophysiology.

[4]  M. Sachs,et al.  Rate versus level functions for auditory-nerve fibers in cats: tone-burst stimuli. , 1974, The Journal of the Acoustical Society of America.

[5]  C. Schreiner,et al.  Nonlinear Spectrotemporal Sound Analysis by Neurons in the Auditory Midbrain , 2002, The Journal of Neuroscience.

[6]  Geoffrey A. Manley,et al.  The Hearing Organ of Birds and Crocodilia , 2000 .

[7]  Malcolm Slaney,et al.  Lyon's Cochlear Model , 1997 .

[8]  J. Fritz,et al.  Dynamics of Precise Spike Timing in Primary Auditory Cortex , 2004, The Journal of Neuroscience.

[9]  N. C. Singh,et al.  Modulation spectra of natural sounds and ethological theories of auditory processing. , 2003, The Journal of the Acoustical Society of America.

[10]  Sarah M N Woolley,et al.  Response properties of single neurons in the zebra finch auditory midbrain: response patterns, frequency coding, intensity coding, and spike latencies. , 2004, Journal of neurophysiology.

[11]  Ce Schreiner,et al.  Spectral envelope coding in cat primary auditory cortex: Properties of ripple transfer functions , 1994 .

[12]  A. Aertsen,et al.  The Spectro-Temporal Receptive Field , 1981, Biological Cybernetics.

[13]  Sarah M. N. Woolley,et al.  Modulation Power and Phase Spectrum of Natural Sounds Enhance Neural Encoding Performed by Single Auditory Neurons , 2004, The Journal of Neuroscience.

[14]  M. Merzenich,et al.  Optimizing sound features for cortical neurons. , 1998, Science.

[15]  A M Aertsen,et al.  Reverse-correlation methods in auditory research , 1983, Quarterly Reviews of Biophysics.

[16]  Jonathan Z. Simon,et al.  Robust Spectrotemporal Reverse Correlation for the Auditory System: Optimizing Stimulus Design , 2000, Journal of Computational Neuroscience.

[17]  Mario A. Ruggero,et al.  Physiology and Coding of Sound in the Auditory Nerve , 1992 .

[18]  D. Margoliash,et al.  Neuronal populations and single cells representing learned auditory objects , 2003, Nature.

[19]  Michael S. Lewicki,et al.  Efficient coding of natural sounds , 2002, Nature Neuroscience.

[20]  Richard F. Lyon,et al.  A computational model of filtering, detection, and compression in the cochlea , 1982, ICASSP.

[21]  Mark S. Seidenberg,et al.  Limits on Reacquisition of Song in Adult Zebra Finches Exposed to White Noise , 2004, The Journal of Neuroscience.

[22]  W. T. Peake,et al.  Experiments in Hearing , 1963 .

[23]  Roman Borisyuk,et al.  Oscillatory model of novelty detection. , 2001 .

[24]  Khaled H. Hamed,et al.  Time-frequency analysis , 2003 .

[25]  D. P. Phillips Neural representation of sound amplitude in the auditory cortex: effects of noise masking , 1990, Behavioural Brain Research.

[26]  E. Evans,et al.  Intensity coding in the auditory periphery of the cat: Responses of cochlear nerve and cochlear nucleus neurons to signals in the presence of bandstop masking noise , 1982, Hearing Research.

[27]  Darragh Smyth,et al.  Methods for first-order kernel estimation: simple-cell receptive fields from responses to natural scenes , 2003, Network.

[28]  R. Schlauch,et al.  Basilar membrane nonlinearity and loudness. , 1998, The Journal of the Acoustical Society of America.

[29]  M. Nicolelis,et al.  Feature article: the structure and function of dynamic cortical and thalamic receptive fields. , 2001, Cerebral cortex.

[30]  Yuan-Ting Zhang,et al.  The application of bionic wavelet transform to speech signal processing in cochlear implants using neural network simulations , 2002, IEEE Trans. Biomed. Eng..

[31]  Vasilis Z. Marmarelis,et al.  Analysis of Physiological Systems , 1978, Computers in Biology and Medicine.

[32]  S. S. Stevens The direct estimation of sensory magnitudes-loudness. , 1956, The American journal of psychology.

[33]  Christian K. Machens,et al.  Linearity of Cortical Receptive Fields Measured with Natural Sounds , 2004, The Journal of Neuroscience.

[34]  N. C. Singh,et al.  Estimating spatio-temporal receptive fields of auditory and visual neurons from their responses to natural stimuli , 2001 .

[35]  R J Dooling,et al.  Hearing in passerine and psittacine birds: a comparative study of absolute and masked auditory thresholds. , 1987, Journal of comparative psychology.

[36]  C. Schreiner,et al.  Spectral envelope coding in cat primary auditory cortex: linear and non‐linear effects of stimulus characteristics , 1998, The European journal of neuroscience.

[37]  D. P. Phillips,et al.  Responses of single neurons in cat auditory cortex to time-varying stimuli: linear amplitude modulations , 2004, Experimental Brain Research.

[38]  A. Aertsen,et al.  Prediction of the responses of auditory neurons in the midbrain of the grass frog based on the spectro-temporal receptive field , 1983, Hearing Research.

[39]  Ad Aertsen,et al.  Spectro-temporal characterization of auditory neurons , 1981 .

[40]  Alexander Borst,et al.  Quantifying variability in neural responses and its application for the validation of model predictions , 2004, Network.

[41]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[43]  Lee M. Miller,et al.  Naturalistic Auditory Contrast Improves Spectrotemporal Coding in the Cat Inferior Colliculus , 2003, The Journal of Neuroscience.