Simulating the phonological auditory cortex from vowel representation spaces to categories

Abstract Vowels are important clues supporting speech perception. Nevertheless in Computational Perception the definition of vowels is a very complex and elusive issue. The purpose of the present paper is to give a possible definition under the perceptual point of view. A vowel could be defined as an assignment of an acoustic–phonetic pattern to a specific categorical representation space. This assignment would be competitively instantiated in the cortical structures, depending on the specific phonological framework of the listener's language. An experimental framework is designed to test this definition on a Neuromorphic Speech Processing Architecture. Results from experiments to test reference patterns in Spanish, and possible extension to other languages with a larger repertoire of categories are presented and discussed.

[1]  Günter Ehret,et al.  Time-critical integration of formants for perception of communication calls in mice , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Alex Acero,et al.  Spoken Language Processing , 2001 .

[3]  Nobuo Suga,et al.  Basic Acoustic Patterns and Neural Mechanisms Shared by Humans and Animals for Auditory Perception , 2012 .

[4]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[5]  Naomi H. Feldman,et al.  The influence of categories on perception: explaining the perceptual magnet effect as optimal statistical inference. , 2009, Psychological review.

[6]  S. Shamma On the role of space and time in auditory processing , 2001, Trends in Cognitive Sciences.

[7]  B. C. Griffith,et al.  The discrimination of speech sounds within and across phoneme boundaries. , 1957, Journal of experimental psychology.

[8]  Shihab Shamma Physiological foundations of temporal integration in the perception of speech , 2003, J. Phonetics.

[9]  Steven Greenberg,et al.  Speech Processing in the Auditory System: An Overview , 2004 .

[10]  Joanne L. Miller,et al.  Speech Perception , 1990, Springer Handbook of Auditory Research.

[11]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[12]  José Carlos Príncipe,et al.  A Reproducing Kernel Hilbert Space Framework for Spike Train Signal Processing , 2009, Neural Computation.

[13]  D. O. Hebb,et al.  The organization of behavior , 1988 .

[14]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[15]  María Victoria Rodellar Biarge,et al.  Time-frequency representations in speech perception , 2009, Neurocomputing.

[16]  María Victoria Rodellar Biarge,et al.  Neuromorphic detection of speech dynamics , 2011, Neurocomputing.

[17]  Jont B. Allen,et al.  Nonlinear Cochlear Signal Processing and Masking in Speech Perception , 2008 .

[18]  Dennis L Barbour,et al.  Temporal coherence sensitivity in auditory cortex. , 2002, Journal of neurophysiology.