An analog VLSI chip with asynchronous interface for auditory feature extraction

We present an analog VLSI chip intended to serve as a front end of a speech recognition system. The chip architecture is inspired by biological auditory models common to humans and primate vertebrates. We include experimental results on a 1.2-/spl mu/m CMOS custom analog VLSI implementation and speech recognition results obtained from software simulations of the hardware on the TI-DIGITS database.

[1]  Campbell L. Searle,et al.  Time‐domain analysis of auditory‐nerve fiber firing rates , 1989 .

[2]  John Wawrzynek,et al.  Systems technologies for silicon auditory models , 1994, IEEE Micro.

[3]  Kuansan Wang,et al.  Auditory representations of acoustic signals , 1992, IEEE Trans. Inf. Theory.

[4]  Andreas G. Andreou,et al.  Linearised differential transconductors in subthreshold CMOS , 1995 .

[5]  Carver Mead,et al.  Analog VLSI and neural systems , 1989 .

[6]  M. Sachs,et al.  Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[7]  Stéphane Mallat,et al.  Zero-crossings of a wavelet transform , 1991, IEEE Trans. Inf. Theory.

[8]  William J. Byrne,et al.  Noise robustness in the auditory representation of speech signals , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Charles Robert Jankowski,et al.  A comparison of auditory models for automatic speech recognition , 1992 .

[10]  Andreas G. Andreou,et al.  A design framework for low power analog filter banks , 1995 .

[11]  J. Allen,et al.  Cochlear micromechanics--a physical model of transduction. , 1980, The Journal of the Acoustical Society of America.

[12]  John Wawrzynek,et al.  Silicon Auditory Processors as Computer Peripherals , 1992, NIPS.

[13]  M. Sachs,et al.  Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate. , 1979, The Journal of the Acoustical Society of America.

[14]  K. Payton Speech processing by a model of the auditory periphery , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  K. Payton Vowel processing by a model of the auditory periphery: A comparison to eighth‐nerve responses , 1988 .

[16]  C. Neti,et al.  Neuromorphic speech processing for noisy environments , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[17]  Dieter Geller,et al.  Improvements in connected digit recognition using linear discriminant analysis and mixture densities , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Richard F. Lyon,et al.  An analog electronic cochlea , 1988, IEEE Trans. Acoust. Speech Signal Process..

[19]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[20]  Weimin Liu,et al.  Voiced-speech representation by an analog silicon model of the auditory periphery , 1992, IEEE Trans. Neural Networks.

[21]  Andreas G. Andreou,et al.  Cochlear models implemented with linearized transconductors , 1996, 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World. ISCAS 96.

[22]  Soo-Young Lee,et al.  Feature extraction based on zero-crossings with peak amplitudes for robust speech recognition in noisy environments , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[23]  B. Kedem,et al.  Spectral analysis and discrimination by zero-crossings , 1986, Proceedings of the IEEE.

[24]  Andreas G. Andreou,et al.  Investigation of silicon auditory models and generalization of linear discriminant analysis for improved speech recognition , 1997 .

[25]  Gert Cauwenberghs,et al.  Fault-tolerant dynamic multilevel storage in analog VLSI , 1994 .