A Recurrent Variational Autoencoder for Speech Enhancement
暂无分享,去创建一个
[1] Algorithms to measure audio programme loudness and true-peak audio level , 2011 .
[2] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[4] Antoine Liutkus,et al. Cauchy Multichannel Speech Enhancement with a Deep Speech Prior , 2019, 2019 27th European Signal Processing Conference (EUSIPCO).
[5] Geoffrey E. Hinton,et al. A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.
[6] Radu Horaud,et al. Audio-Noise Power Spectral Density Estimation Using Long Short-Term Memory , 2019, IEEE Signal Processing Letters.
[7] Tatsuya Kawahara,et al. Bayesian Multichannel Speech Enhancement with a Deep Speech Prior , 2018, 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).
[8] Rémi Gribonval,et al. From Blind to Guided Audio Source Separation: How models and side information can improve the separation of sound , 2014, IEEE Signal Processing Magazine.
[9] David M. Blei,et al. Variational Inference: A Review for Statisticians , 2016, ArXiv.
[10] Björn W. Schuller,et al. Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR , 2015, LVA/ICA.
[11] Li-Rong Dai,et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[12] James L. Massey,et al. Proper complex random processes with applications to information theory , 1993, IEEE Trans. Inf. Theory.
[13] Paris Smaragdis,et al. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[14] Jean-Francois Cardoso,et al. THE THREE EASY ROUTES TO INDEPENDENT COMPONENT ANALYSIS; CONTRASTS AND GEOMETRY , 2001 .
[15] Jesper Jensen,et al. An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[16] Emmanuel Vincent,et al. A General Flexible Framework for the Handling of Prior Information in Audio Source Separation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[17] Hirokazu Kameoka,et al. Supervised Determined Source Separation with Multichannel Variational Autoencoder , 2019, Neural Computation.
[18] D. Hunter,et al. A Tutorial on MM Algorithms , 2004 .
[19] Sridha Sridharan,et al. The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition , 2015, INTERSPEECH.
[20] Nancy Bertin,et al. Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.
[21] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[22] DeLiang Wang,et al. Supervised Speech Separation Based on Deep Learning: An Overview , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[23] Bruno Torrésani,et al. Sparsity and persistence: mixed norms provide simple signal models with dependent coefficients , 2009, Signal Image Video Process..
[24] Tim Salimans,et al. Fixed-Form Variational Posterior Approximation through Stochastic Linear Regression , 2012, ArXiv.
[25] Tatsuya Kawahara,et al. Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Rémi Gribonval,et al. Non negative sparse representation for Wiener based source separation with a single sensor , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[27] Jérôme Idier,et al. Algorithms for Nonnegative Matrix Factorization with the β-Divergence , 2010, Neural Computation.
[28] Juha Karhunen,et al. Approximate Riemannian Conjugate Gradient Learning for Fixed-Form Variational Bayes , 2010, J. Mach. Learn. Res..
[29] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .
[30] Tatsuya Kawahara,et al. Semi-Supervised Multichannel Speech Enhancement With a Deep Speech Prior , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[31] Pierre-Alexandre Mattei,et al. Refit your Encoder when New Data Comes by , 2018 .
[32] Bhiksha Raj,et al. Supervised and Semi-supervised Separation of Sounds from Single-Channel Mixtures , 2007, ICA.
[33] Philippe Garat,et al. Blind separation of mixture of independent sources through a quasi-maximum likelihood approach , 1997, IEEE Trans. Signal Process..
[34] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..
[35] Paris Smaragdis,et al. A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[36] Bonnie Kirkpatrick,et al. Supplementary Document , 2011 .
[37] Andries P. Hekstra,et al. Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[38] Simon J. Godsill,et al. Sparse Regression with Structured Priors: Application to Audio Denoising , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[39] Keith Allman,et al. About the Companion Website , 2015 .
[40] Radu Horaud,et al. Semi-supervised Multichannel Speech Enhancement with Variational Autoencoders and Non-negative Matrix Factorization , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[41] Mark D. Plumbley,et al. Probabilistic Modeling Paradigms for Audio Source Separation , 2010 .
[42] Radu Horaud,et al. Speech Enhancement with Variational Autoencoders and Alpha-stable Distributions , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[43] Radu Horaud,et al. A VARIANCE MODELING FRAMEWORK BASED ON VARIATIONAL AUTOENCODERS FOR SPEECH ENHANCEMENT , 2018, 2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP).
[44] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.
[45] Yoshua Bengio,et al. A Recurrent Latent Variable Model for Sequential Data , 2015, NIPS.
[46] Emmanuel Vincent,et al. A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders , 2019, INTERSPEECH.
[47] Xiaofei Li,et al. Multichannel Speech Enhancement Based On Time-Frequency Masking Using Subband Long Short-Term Memory , 2019, 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).