Non-Intrusive Speech Quality Prediction Using Modulation Energies and LSTM-Network
暂无分享,去创建一个
Stefan Goetze | Simon Doclo | Tiago H. Falk | Benjamin Cauchi | Kai Siedenburg | João F. Santos | S. Doclo | Stefan Goetze | T. Falk | J. F. Santos | Kai Siedenburg | Benjamin Cauchi
[1] Birger Kollmeier,et al. Predicting speech intelligibility with deep neural networks , 2018, Comput. Speech Lang..
[2] Richard C. Hendriks,et al. Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Dorothea Kolossa,et al. Blind Non-Intrusive Speech Intelligibility Prediction Using Twin-HMMs , 2016, INTERSPEECH.
[4] Tiago H. Falk,et al. Temporal Dynamics for Blind Measurement of Room Acoustical Parameters , 2010, IEEE Transactions on Instrumentation and Measurement.
[5] J. C. Steinberg,et al. Factors Governing the Intelligibility of Speech Sounds , 1945 .
[6] Yong Wang,et al. Using Model Trees for Classification , 1998, Machine Learning.
[7] Patrick A. Naylor,et al. Predicting the quality of processed speech by combining modulation-based features and model trees , 2016, ITG Symposium on Speech Communication.
[8] Alastair H. Moore,et al. The ACE challenge — Corpus description and performance evaluation , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
[9] Rainer Martin,et al. Objective Intelligibility Measures Based on Mutual Information for Speech Subjected to Speech Enhancement Processing , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[10] J. Berger,et al. P.563—The ITU-T Standard for Single-Ended Speech Quality Assessment , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[11] Rainer Martin,et al. A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[12] Mike Brookes,et al. Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[13] Birger Kollmeier,et al. PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[14] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..
[15] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.
[16] R. Maas,et al. A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research , 2016, EURASIP Journal on Advances in Signal Processing.
[17] P. Flipsen,et al. Measuring the intelligibility of conversational speech in children , 2006, Clinical linguistics & phonetics.
[18] Kevin Barraclough,et al. I and i , 2001, BMJ : British Medical Journal.
[19] K. U. Simmer,et al. Multi-microphone noise reduction techniques as front-end devices for speech recognition , 2000, Speech Commun..
[20] Jesper Jensen,et al. A non-intrusive Short-Time Objective Intelligibility measure , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Jingdong Chen,et al. Microphone Array Signal Processing , 2008 .
[22] T Houtgast,et al. A physical method for measuring speech-transmission quality. , 1980, The Journal of the Acoustical Society of America.
[23] Tiago H. Falk,et al. An improved non-intrusive intelligibility metric for noisy and reverberant speech , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).
[24] Toon van Waterschoot,et al. Adaptive Speech Dereverberation Using Constrained Sparse Multichannel Linear Prediction , 2017, IEEE Signal Processing Letters.
[25] Steve Renals,et al. WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[26] R. C. Sprinthall. Basic Statistical Analysis , 1982 .
[27] Rainer Martin,et al. Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..
[28] Emanuel A. P. Habets,et al. A study on speech quality and speech intelligibility measures for quality assessment of single-channel dereverberation algorithms , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).
[29] Stephan Gerlach,et al. Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech , 2015, EURASIP J. Adv. Signal Process..
[30] Sven Nordholm,et al. Multichannel Signal Enhancement Algorithms for Assisted Listening Devices: Exploiting spatial diversity using multiple microphones , 2015, IEEE Signal Processing Magazine.
[31] Henry Cox,et al. Robust adaptive beamforming , 2005, IEEE Trans. Acoust. Speech Signal Process..
[32] James M. Kates,et al. Objective Quality and Intelligibility Prediction for Users of Assistive Listening Devices: Advantages and limitations of existing tools , 2015, IEEE Signal Processing Magazine.
[33] Jesper Jensen,et al. An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[34] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.
[35] Rainer Martin,et al. Parameterized MMSE spectral magnitude estimation for the enhancement of noisy speech , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[36] Emmanuel Vincent,et al. Audio Source Separation and Speech Enhancement , 2018 .
[37] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[38] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[39] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[40] Mike Brookes,et al. A data-driven non-intrusive measure of speech quality and intelligibility , 2016, Speech Commun..
[41] Tiago H. Falk,et al. Blind Room Acoustics Characterization Using Recurrent Neural Networks and Modulation Spectrum Dynamics , 2016 .
[42] Doh-Suk Kim,et al. ANIQUE+: A new American national standard for non-intrusive estimation of narrowband speech quality , 2007, Bell Labs Technical Journal.
[43] Joseph Lipka,et al. A Table of Integrals , 2010 .