A Single-Input Hearing Aid Based on the Auditory Perceptual Features to Improve Speech Intelligibility in Noise
暂无分享,去创建一个
One of the main problems of the sensorineural hearing impaired listeners is the partial or complete loss of frequency selectivity. It is now well established that most of the auditory perceptual features are very well represented in the ear by the spectrum of the incoming sound signal. Thus the loss of frequency selectivity means that it is difficult for the impaired listener to discriminate between two sounds or to understand speech in a noisy environment. This handicap is referred to as the cocktail party eflect. It is widely accepted, and proven by the experiments carried out on impaired listeners, that one of the main causes for this impairment is the broad and tilted auditory filter shapes in the damaged cochlea compared to an undamaged normal ear. As a result, in noisy surroundings these broad filters allow more noise than a normal ear making detection of signal in noise difficult. Therefore to improve intelligibility a hearing aid must, not only suppress the noise in speech but also alleviate the problems of reduced frequency selectivity. There are a few hearing aids proposed in the literature to enhance speech in noise. Most of them are based on Adaptive noise cancellation or Adaptive beamforming principles. They have proved to be very useful in situations where there are few noise sources or when there is a reference noise available. Very often the environment contains many uncorrelated noise sources effectively creating a diffusive noise source. Hence obtaining a reference noise signal that is correlated with the noise in the other inputs is impossible. In these situations the above methods produce very little speech enhancement. There are many conventional single-input systems to suppress noise but like the multi-microphone methods mentioned above, they have proved to be of very little use in increasing the intelligibility of the speech for the hearing impaired. In this paper we illustrate how a single-input system incorporating the auditory perceptual features could be employed to increase intelligibility in hearing aids. Spectral Subtraction (SS) is an efficient way of reducing noise in single-input systems. In this method an estimate of the magnitude spectrum of the noise, #(U), is obtained during nonspeech activity and is subtracted from the magnitude spectrum of the noisy speech, X(w), to obtain the enhanced speech, S(u). This performs satisfactorily when the noise source is stationary. The main drawback of this system is it does not consider the problems of the hearing impaired and as a result is of very little benefit to them. Furthermore it introduces a residual or mwacal nobe in the processed speech. It is shown in this paper that by incorporating the perceptual features like masking and excitation patterns the above problems can be eliminated. The technique proposed here, firstly transforms the power (not magnitude) spectrum of the noisy speech (X(w)) into auditory excitation patterns, E(w). The auditory system consists of a bank of constant bandwidth band-pass filters on a logarithmic scale (Bark Scale). Excitation patterns are obtained by convolving the signal spectrum with these filters. E(w) now represents the power spectrum of the signal as it would appear if it had been processed by a normal ear. These excitation patterns enable the hearing impaired to group frequency components quite successfully, thereby increasing their frequency selectivity in spite of the broad filters in their auditory system. This transformation also removes unwanted noise and speech that might have been processed by the impaired ear.