Audio-visual speech enhancement using deep neural networks
暂无分享,去创建一个
Yu Tsao | Hsin-Min Wang | Ying-Hui Lai | Jen-Chun Lin | Hsiu-Wen Chang | Syu-Siang Wang | Jen-Cheng Hou | Yu Tsao | H. Wang | Syu-Siang Wang | Ying-Hui Lai | Jen-Cheng Hou | Jen-Chun Lin | Hsiu-Wen Chang
[1] Dong Wang,et al. Removal by Denoising Autoencoder in Speech Recognition , 2015 .
[2] Yonghong Yan,et al. Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and English. , 2011, The Journal of the Acoustical Society of America.
[3] Yu Tsao,et al. Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).
[4] Rainer Martin,et al. Speech enhancement based on minimum mean-square error estimation and supergaussian priors , 2005, IEEE Transactions on Speech and Audio Processing.
[5] Björn W. Schuller,et al. Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR , 2015, LVA/ICA.
[6] Li-Rong Dai,et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[7] Maja Pantic,et al. Gauss-Newton Deformable Part Models for Face Alignment In-the-Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[8] Junfeng Li,et al. Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication , 2011, Speech Commun..
[9] Jen-Tzung Chien,et al. Bayesian Factorization and Learning for Monaural Source Separation , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[10] H Levitt,et al. Noise reduction in hearing aids: a review. , 2001, Journal of rehabilitation research and development.
[11] James M. Kates,et al. The Hearing-Aid Speech Perception Index (HASPI) , 2014, Speech Commun..
[12] Paris Smaragdis,et al. Experiments on deep learning for speech denoising , 2014, INTERSPEECH.
[13] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.
[14] Pascal Scalart,et al. Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[15] Yu Tsao,et al. An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition , 2013, INTERSPEECH.
[16] Yu Tsao,et al. Speech enhancement based on deep denoising autoencoder , 2013, INTERSPEECH.
[17] Kevin P. Murphy,et al. Dynamic Bayesian Networks for Audio-Visual Speech Recognition , 2002, EURASIP J. Adv. Signal Process..
[18] James M. Kates,et al. The Hearing-Aid Speech Quality Index (HASQI) , 2010 .
[19] Paris Smaragdis,et al. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[20] Yu Tsao,et al. Ensemble modeling of denoising autoencoder for speech spectrum restoration , 2014, INTERSPEECH.
[21] Yi Hu,et al. A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..
[22] Ulpu Remes,et al. Techniques for Noise Robustness in Automatic Speech Recognition , 2012 .
[23] Bhiksha Raj,et al. Complex recurrent neural networks for denoising speech signals , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
[24] Yu Tsao,et al. Generalized maximum a posteriori spectral amplitude estimation for speech enhancement , 2016, Speech Commun..
[25] Theodore H. Venema,et al. Compression for Clinicians , 1998 .
[26] Dorothea Kolossa,et al. Twin-HMM-based audio-visual speech enhancement , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[27] Dong Wang,et al. Music removal by convolutional denoising autoencoder in speech recognition , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).
[28] Jacob Benesty,et al. New insights into the noise reduction Wiener filter , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[29] H. McGurk,et al. Hearing lips and seeing voices , 1976, Nature.
[30] Chalapathy Neti,et al. Recent advances in the automatic recognition of audiovisual speech , 2003, Proc. IEEE.
[31] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[32] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..
[33] Bhiksha Raj,et al. Techniques for Noise Robustness in Automatic Speech Recognition , 2012, Techniques for Noise Robustness in Automatic Speech Recognition.
[34] Yu Tsao,et al. SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement , 2016, INTERSPEECH.
[35] Saeed Gazor,et al. An adaptive KLT approach for speech enhancement , 2001, IEEE Trans. Speech Audio Process..
[36] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .
[37] Björn W. Schuller,et al. Single-channel speech separation with memory-enhanced recurrent neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[38] Dorothea Kolossa,et al. Audiovisual speech recognition with missing or unreliable data , 2009, AVSP.
[39] A. Cuhadar,et al. Evaluation of Speech Enhancement Techniques for Speaker Identification in Noisy Environments , 2007, Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007).
[40] Jacob Benesty,et al. Fundamentals of Noise Reduction , 2008 .
[41] Yifan Gong,et al. Robust automatic speech recognition : a bridge to practical application , 2015 .
[42] Yariv Ephraim,et al. A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..
[43] Javier Ortega-Garcia,et al. Overview of speech enhancement techniques for automatic speaker recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[44] David G. Stork,et al. Speechreading by Humans and Machines , 1996 .
[45] Alexey Ozerov,et al. Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[46] Jun Du,et al. An Experimental Study on Speech Enhancement Based on Deep Neural Networks , 2014, IEEE Signal Processing Letters.