The Effects of Syllabic Compression and Frequency Shaping on Speech Intelligibility in Hearing

The effect of syllabic compression on speech intelligibility is rarely positive and in those cases that positive effects have been found, the same positive results could in general be obtained by frequency shaping of the frequency response curve. We programmed a syllabic compressor on a digital processor; the compressor differed from a conventional syllabic compressor by incorporating a delay in the signal path to suppress overshoots and thus minimize transient distortion. Furthermore, the time constants were short: attack time of 5 msec and release time of 15 msec. The compressor was only active in the high-frequency band. An essentially linear signal was added to deliver the low-frequency speech components. The processing resulted in a frequency response that mirrored the hearing loss near threshold and became much flatter for higher level input signals. Speech intelligibility scores for nonsense consonant-vowel-consonant words embedded in carrier phrases were determined for hearing-impaired persons with sloping audiograms and discrimination losses for speech. Results showed little additional effect of frequency shaping to the existing improved speech score for compressed speech. Optimum results were found for a compression ratio 2 with lower speech scores for linear amplification and for compression ratio 8. We next determined the effect of providing high-frequency emphasis to the speech signal and/or to the compression control signal to compensate for the upward spread of masking. The frequency response at the root-mean-square level was adjusted according to the half-gain rule. The positive effects of moderate compression could be found again; the high-frequency emphasis, however, was positive for the vowels but made consonant recognition poorer. We concluded that smoothing the speech intelligibility score improved for moderate compression with relative little effect of frequency shaping. Adding high-frequency emphasis to a half-gain rule response curve was not advantageous.

[1]  Eduard Stikvoort,et al.  Digital Dynamic Range Compressor for Audio , 1986 .

[2]  H Levitt,et al.  Evaluation of orthogonal polynomial compression. , 1991, The Journal of the Acoustical Society of America.

[3]  Herman J. M. Steeneken,et al.  Speech data-base for intelligibility and speech quality measurements , 1990 .

[4]  L D Braida,et al.  Principal-component amplitude compression for the hearing impaired. , 1987, The Journal of the Acoustical Society of America.

[5]  M P Haggard,et al.  Two-state compression of spectral tilt: individual differences and psychoacoustical limitations to the benefit from compression. , 1987, Journal of rehabilitation research and development.

[6]  E Villchur,et al.  Signal processing to improve speech intelligibility in perceptive deafness. , 1973, The Journal of the Acoustical Society of America.

[7]  L D Braida,et al.  Multiband compression limiting for hearing-impaired listeners. , 1987, Journal of rehabilitation research and development.

[8]  W A Dreschler,et al.  Syllabic compression and speech intelligibility in hearing impaired listeners. , 1993, Scandinavian audiology. Supplementum.