An analog VLSI chip with asynchronous interface for auditory feature extraction

We describe the architecture and circuit implementation of an analog VLSI feature extraction chip that has an asynchronous digital interface and is designed to serve as an auditory based front-end for a digit recognition system. The single chip system encodes signal energies and zero crossing time intervals of frequency components in a cochlear filter bank. The chip has been fabricated in a 1.2 /spl mu/m n-well, double polysilicon, double metal CMOS process and it is fully functional. Power consumption when operated from a 5 V supply is only a few milliwatts.

[1]  Oded Ghitza,et al.  Auditory nerve representation as a front-end for speech recognition in a noisy environment , 1986 .

[2]  Kuansan Wang,et al.  Auditory representations of acoustic signals , 1992, IEEE Trans. Inf. Theory.

[3]  C. L. Searle,et al.  Time-domain analysis of auditory-nerve-fiber firing rates. , 1990, The Journal of the Acoustical Society of America.

[4]  Carver Mead,et al.  Analog VLSI and neural systems , 1989 .

[5]  S. A. Shamma,et al.  Zero-Crossing and Noise Suppression in Auditory Wavelet Transformations , 1992 .

[6]  Andreas G. Andreou,et al.  Application of Discriminant Analysis to Speech Recognition with Auditory Features , 1995 .

[7]  C. Neti,et al.  Neuromorphic speech processing for noisy environments , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[8]  John Wawrzynek,et al.  Silicon Auditory Processors as Computer Peripherals , 1992, NIPS.

[9]  Andreas G. Andreou,et al.  Cochlear models implemented with linearized transconductors , 1996, 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World. ISCAS 96.

[10]  William J. Byrne,et al.  Noise robustness in the auditory representation of speech signals , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Soo-Young Lee,et al.  Feature extraction based on zero-crossings with peak amplitudes for robust speech recognition in noisy environments , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12]  Dieter Geller,et al.  Improvements in connected digit recognition using linear discriminant analysis and mixture densities , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Andreas G. Andreou,et al.  A design framework for low power analog filter banks , 1995 .

[14]  Charles Robert Jankowski,et al.  A comparison of auditory models for automatic speech recognition , 1992 .

[15]  J. Brant Arseneau,et al.  VLSI and neural systems , 1990 .

[16]  Weimin Liu,et al.  Voiced-speech representation by an analog silicon model of the auditory periphery , 1992, IEEE Trans. Neural Networks.

[17]  Hussein Baher,et al.  Analog and Digital Signal Processing , 1990 .

[18]  B. Kedem,et al.  Spectral analysis and discrimination by zero-crossings , 1986, Proceedings of the IEEE.

[19]  K. Payton Vowel processing by a model of the auditory periphery: A comparison to eighth‐nerve responses , 1988 .

[20]  Gert Cauwenberghs,et al.  Fault-tolerant dynamic multilevel storage in analog VLSI , 1994 .

[21]  Stéphane Mallat,et al.  Zero-crossings of a wavelet transform , 1991, IEEE Trans. Inf. Theory.

[22]  John Wawrzynek,et al.  Systems technologies for silicon auditory models , 1994, IEEE Micro.

[23]  M. Sachs,et al.  Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate. , 1979, The Journal of the Acoustical Society of America.

[24]  Gert Cauwenberghs,et al.  A circuit model of hair-cell transduction for temporal processing and auditory feature extraction , 1996, 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World. ISCAS 96.

[25]  Andreas G. Andreou,et al.  Investigation of silicon auditory models and generalization of linear discriminant analysis for improved speech recognition , 1997 .

[26]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[27]  Andreas G. Andreou,et al.  Linearised differential transconductors in subthreshold CMOS , 1995 .

[28]  M. Sachs,et al.  Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.