A computational model of filtering, detection, and compression in the cochlea

We claim that speech analysis algorithms should be based on computational models of human audition, starting at the ears. While much is known about how hearing works, little of this knowledge has been applied in the speech analysis field. We propose models of the inner ear, or cochlea, which are expressed as time- and place-domain signal processing operations; i.e. the models are computational expressions of the important functions of the cochlea. The main parts of the models concern mechanical filtering effects and the mapping of mechanical vibrations into neural representation. Our model cleanly separates these effects into time-invariant linear filtering based on a simple cascade/parallel filterbank network of second-order sections, plus transduction and compression based on half-wave rectification with a nonlinear coupled automatic gain control network. Compared to other speech analysis techniques, this model does a much better job of preserving important detail in both time and frequency, which is important for robust sound analysis. We discuss the ways in which this model differs from more detailed cochlear models.

[1]  D O Kim,et al.  Response patterns of single cochlear nerve fibers to click stimuli: descriptions for cat. , 1972, The Journal of the Acoustical Society of America.

[2]  J. Pierce,et al.  The cochlear compromise. , 1976, The Journal of the Acoustical Society of America.

[3]  D O Kim,et al.  Cochlear mechanics: nonlinear behavior in two-tone responses as reflected in cochlear-nerve-fiber responses and in ear-canal sound pressure. , 1980, The Journal of the Acoustical Society of America.

[4]  F S Werblin,et al.  The control of sensitivity in the retina. , 1973, Scientific American.

[5]  D O Kim,et al.  A population study of cochlear nerve fibers: comparison of spatial distributions of average-rate and phase-locking measures of responses to single tones. , 1979, Journal of neurophysiology.

[6]  M R Schroeder,et al.  An integrable model for the basilar membrane. , 1973, The Journal of the Acoustical Society of America.

[7]  Jont B. Allen Cochlear modeling - 1980 , 1981 .

[8]  J J Zwislocki Sound analysis in the ear: a history of discoveries. , 1981, American scientist.

[9]  J. L. Hall,et al.  Model for mechanical to neural transduction in the auditory receptor. , 1974, The Journal of the Acoustical Society of America.