Sparse codes of harmonic natural sounds and their modulatory interactions

Sparse coding and its related theories have been successful to explain various response properties of early stages of sensory information processing such as primary visual cortex and peripheral auditory system, which suggests that the emergence of such properties results from adaptation of the nerve system to natural stimuli. The present study continues this line of research in a higher stage of auditory processing, focusing on harmonic structures that are often found in behaviourally important natural sound like animal vocalization. It has been physiologically shown that monkey primary auditory cortices (A1) have neurons with response properties capturing such harmonic structures: their response and modulation peaks are often found at frequencies that are harmonically related to each other. We hypothesize that such relations emerge from sparse coding of harmonic natural sounds. Our simulation shows that similar harmonic relations emerge from frequency-domain sparse codes of harmonic sounds, namely, piano performance and human speech. Moreover, the modulatory behaviours can be explained by competitive interactions of model neurons that capture partially common harmonic structures.

[1]  The Journal of Comparative Neurology , 1899, The American Naturalist.

[2]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[3]  Gerhard Stoll,et al.  Scaling of pitch strength , 1979, Hearing Research.

[4]  R A Reale,et al.  Geometry and orientation of neuronal processes in cat primary auditory cortex (AI) related to characteristic-frequency maps. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[5]  D. P. Phillips,et al.  Intracortical connections and their physiological correlates in the primary auditory cortex (AI) of the cat , 1988, The Journal of comparative neurology.

[6]  J. Kaas,et al.  Connections of primary auditory cortex in the new world monkey, Saguinus , 1989, The Journal of comparative neurology.

[7]  C. Schreiner,et al.  Physiology and topography of neurons with multipeaked tuning curves in cat primary auditory cortex. , 1991, Journal of neurophysiology.

[8]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[9]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[10]  C. Schreiner,et al.  Modular organization of frequency integration in primary auditory cortex. , 2000, Annual review of neuroscience.

[11]  E. Vajda Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet , 2000 .

[12]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[13]  S. Shamma On the role of space and time in auditory processing , 2001, Trends in Cognitive Sciences.

[14]  Satoshi Nakamura,et al.  RWCP Sound Scene Database in Real Acoustic Environment , 2002 .

[15]  Michael S. Lewicki,et al.  Efficient coding of natural sounds , 2002, Nature Neuroscience.

[16]  Konrad P. Körding,et al.  Sparse Spectrotemporal Coding of Sounds , 2003, EURASIP J. Adv. Signal Process..

[17]  Xiaoqin Wang,et al.  Spectral integration in A1 of awake primates: neurons with single- and multipeaked tuning characteristics. , 2003, Journal of neurophysiology.

[18]  Bruno A. Olshausen,et al.  Learning sparse, overcomplete representations of time-varying natural images , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[19]  Bruno A Olshausen,et al.  Sparse coding of sensory inputs , 2004, Current Opinion in Neurobiology.

[20]  Shin Ishii,et al.  Nonlinear and Noisy Extension of Independent Component Analysis: Theory and Its Application to a Pitch Sensation Model , 2005, Neural Computation.

[21]  Michael S. Lewicki,et al.  Efficient auditory coding , 2006, Nature.