Analysis of acoustic features for speech intelligibility prediction models analysis of acoustic features for speech intelligibility prediction models

We have developed a new model to predict speech intelligibility of synthetic sounds processed by nonlinear speech enhancement algorithms. The model involves two recent auditory models: the dynamic compressive gammachirp (dcGC) auditory filterbank and the speech envelope power spectrum model (sEPSM). The dcGC-sEPSM was compared with commonly used prediction models based on perceptual intelligibility scores of speech sounds enhanced by classic spectral subtraction and state-of-the-art Wiener filtering. As a result, the dcGC-sEPSM predicted the scores better than the coherence SII (CSII), the short-time objective intelligibility (STOI), and the original sEPSM using the gammatone filterbank. There was, however, still inconsistency between the prediction and data. In this work, we show the analysis of acoustic features used in the prediction models. The CSII calculates the magnitude-squared coherence between clean and processed spectra to derive a signal-to-distortion ratio. The STOI calculates the correlation...