Voiced-speech representation by an analog silicon model of the auditory periphery

An analog CMOS integration of a model for the auditory periphery is presented. The model consists of middle ear, basilar membrane, and hair cell/synapse modules which are derived from neurophysiological studies. The circuit realization of each module is discussed, and experimental data of each module's response to sinusoidal excitation are given. The nonlinear speech processing capabilities of the system are demonstrated using the voiced syllable |ba|. The multichannel output of the silicon model corresponds to the time-varying instantaneous firing rates of auditory nerve fibers that have different characteristic frequencies. These outputs are similar to the physiologically obtained responses. The actual implementation uses subthreshold CMOS technology and analog continuous-time circuits, resulting in a real-time, micropower device with potential applications as a preprocessor of auditory stimuli.

[1]  Don H. Johnson,et al.  The response of single auditory-nerve fibers in the cat to single tones: synchrony and average discharge rate , 1974 .

[2]  C. D. Geisler,et al.  A composite auditory model for processing speech sounds. , 1987, The Journal of the Acoustical Society of America.

[3]  Carver Mead,et al.  Analog VLSI and neural systems , 1989 .

[4]  John Lazzaro,et al.  Circuit Models of Sensory Transduction in the Cochlea , 1989, Analog VLSI Implementation of Neural Systems.

[5]  M. Sachs,et al.  Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[6]  Oded Ghitza,et al.  Auditory nerve representation as a front-end for speech recognition in a noisy environment , 1986 .

[7]  Richard F. Lyon,et al.  An analog electronic cochlea , 1988, IEEE Trans. Acoust. Speech Signal Process..

[8]  M. Sachs,et al.  Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate. , 1979, The Journal of the Acoustical Society of America.

[9]  Stephanie Seneff,et al.  A computational model for the peripheral auditory system: Application of speech recognition research , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Andreas G. Andreou,et al.  A Contrast Sensitive Silicon Retina with Reciprocal Synapses , 1991, NIPS.

[11]  Andreas G. Andreou,et al.  Current-mode subthreshold MOS circuits for analog VLSI neural systems , 1991, IEEE Trans. Neural Networks.

[12]  Edgar Sanchez-Sinencio,et al.  Generation of continuous-time two integrator loop OTA filter structures , 1988 .

[13]  Andreas G. Andreou,et al.  An analog integrated speech front-end based on the auditory periphery , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[14]  Andreas G. Andreou,et al.  Speech preprocessing using analog VLSI , 1990 .

[15]  Andreas G. Andreou,et al.  Synthetic Neural Circuits Using Current-Domain Signal Representations , 1989, Neural Computation.

[16]  K. Payton Vowel processing by a model of the auditory periphery: A comparison to eighth‐nerve responses , 1988 .

[17]  W. T. Peake,et al.  Middle-ear characteristics of anesthetized cats. , 1967, The Journal of the Acoustical Society of America.

[18]  Campbell L. Searle,et al.  Time‐domain analysis of auditory‐nerve fiber firing rates , 1989 .

[19]  Victor Zue,et al.  A comparative study of acoustic representations of speech for vowel classification using multi-layer perceptrons , 1990, ICSLP.

[20]  Phillip E Allen,et al.  CMOS Analog Circuit Design , 1987 .

[21]  E D Young,et al.  Auditory nerve representation of vowels in background noise. , 1983, Journal of neurophysiology.