Binary Mask Estimation Strategies for Constrained Imputation-Based Speech Enhancement
暂无分享,去创建一个
[1] Xiaohui Zhang,et al. Improving deep neural network acoustic models using generalized maxout networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[3] Philipos C. Loizou,et al. Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[4] Brian R Glasberg,et al. Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.
[5] Hadi Veisi,et al. Speech enhancement using hidden Markov models in Mel-frequency domain , 2013, Speech Commun..
[6] Jon Barker,et al. An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.
[7] Ben P. Milner,et al. Using hidden Markov models for speech enhancement , 2014, INTERSPEECH.
[8] Heiga Zen,et al. Speech Synthesis Based on Hidden Markov Models , 2013, Proceedings of the IEEE.
[9] Danny Crookes,et al. A Corpus-Based Approach to Speech Enhancement From Nonstationary Noise , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[10] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .
[11] Rainer Martin,et al. Spectral Subtraction Based on Minimum Statistics , 2001 .
[12] Phil D. Green,et al. Robust automatic speech recognition with missing and unreliable acoustic data , 2001, Speech Commun..
[13] Robert M. Nickel,et al. Speech Enhancement With Inventory Style Speech Resynthesis , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[14] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .
[15] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .
[16] Ning Ma,et al. Speech Spectral Envelope Enhancement by HMM-Based Analysis/Resynthesis , 2013, IEEE Signal Processing Letters.
[17] Jon Barker,et al. The second ‘CHiME’ speech separation and recognition challenge: An overview of challenge systems and outcomes , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[18] Sanjeev Khudanpur,et al. Parallel training of DNNs with Natural Gradient and Parameter Averaging , 2014 .
[19] Alan W. Black,et al. Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[20] Sanjeev Khudanpur,et al. A time delay neural network architecture for efficient modeling of long temporal contexts , 2015, INTERSPEECH.
[21] Andrew W. Senior,et al. Improving DNN speaker independence with I-vector inputs , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[22] Jon Barker,et al. The second ‘chime’ speech separation and recognition challenge: Datasets, tasks and baselines , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[23] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[24] Guy J. Brown,et al. Computational auditory scene analysis , 1994, Comput. Speech Lang..
[25] Ning Ma,et al. Improving audio-visual speech recognition using deep neural networks with dynamic stream reliability estimates , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Martin Cooke,et al. A glimpsing model of speech perception in noise. , 2006, The Journal of the Acoustical Society of America.