Comparison of STOI-type Intermediate Feature for Listening Difficulty Rating Prediction Model

Automatic quality evaluation of outdoor public-address system is required for a disaster situation. We had earlier developed a system to predict the listening difficulty rating (LDR) of a PA system. However, the prediction error was large, and the performance was not satisfactory. Here, we use STOI-type indices as intermediate features for improvement of the LDR prediction system. The results show that ESTOI and SIMI should be chosen optimally for each SNR condition.

[1]  Seung Ho Choi,et al.  A Deep Learning-Based Approach to Non-Intrusive Objective Speech Intelligibility Estimation , 2018, IEICE Trans. Inf. Syst..

[2]  Hiroshi Sato,et al.  Listening difficulty as a subjective measure for evaluation of speech transmission performance in public spaces. , 2004, The Journal of the Acoustical Society of America.

[3]  Jesper Jensen,et al.  Speech Intelligibility Prediction Based on Mutual Information , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[4]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  S. Rigatti Random Forest. , 2017, Journal of insurance medicine.

[6]  Yoshifumi Chisaki,et al.  Emission timing controller by single board computer for public address system , 2015, 2015 IEEE 4th Global Conference on Consumer Electronics (GCCE).

[7]  Jesper Jensen,et al.  An Algorithm for Predicting the Intelligibility of Speech Masked by Modulated Noise Maskers , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Yosuke Kobayashi,et al.  Listening difficulty estimation model using short-time objective intelligibility measure for outdoor public address systems , 2020, Acoustical Science and Technology.