论文信息 - Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks

Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks

Single-channel signal separation and deconvolution aims to separate and deconvolve individual sources from a single-channel mixture and is a challenging problem in which no prior knowledge of the mixing filters is available. Both individual sources and mixing filters need to be estimated. In addition, a mixture may contain non-stationary noise which is unseen in the training set. We propose a synthesizing-decomposition (S-D) approach to solve the single-channel separation and deconvolution problem. In synthesizing, a generative model for sources is built using a generative adversarial network (GAN). In decomposition, both mixing filters and sources are optimized to minimize the reconstruction error of the mixture. The proposed S-D approach achieves a peak-to-noise-ratio (PSNR) of 18.9 dB and 15.4 dB in image inpainting and completion, outperforming a baseline convolutional neural network PSNR of 15.3 dB and 12.2 dB, respectively and achieves a PSNR of 13.2 dB in source separation together with deconvolution, outperforming a convolutive non-negative matrix factorization (NMF) baseline of 10.1 dB.

[1] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[2] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[3] Sabine Van Huffel,et al. Source Separation From Single-Channel Recordings by Combining Empirical-Mode Decomposition and Independent Component Analysis , 2010, IEEE Transactions on Biomedical Engineering.

[4] K. Egiazarian,et al. Blind image deconvolution , 2007 .

[5] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[6] Lei Zhang,et al. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[7] Paris Smaragdis,et al. Generative Adversarial Source Separation , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8] Simon Dixon,et al. Adversarial Semi-Supervised Audio Source Separation Applied to Singing Voice Extraction , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9] Frédo Durand,et al. Understanding and evaluating blind deconvolution algorithms , 2009, CVPR.

[10] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.

[11] Kiyohiro Shikano,et al. Music signal separation by supervised nonnegative matrix factorization with basis deformation , 2013, 2013 18th International Conference on Digital Signal Processing (DSP).

[12] Enhong Chen,et al. Image Denoising and Inpainting with Deep Neural Networks , 2012, NIPS.

[13] Andrzej Cichocki,et al. Nonnegative Matrix and Tensor Factorization T , 2007 .

[14] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15] H. Sebastian Seung,et al. Natural Image Denoising with Convolutional Networks , 2008, NIPS.

[16] Hakan Erdogan,et al. Deep neural networks for single channel source separation , 2013, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17] Jyh-Shing Roger Jang,et al. SVSGAN: Singing Voice Separation Via Generative Adversarial Network , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18] Ce Liu,et al. Deep Convolutional Neural Network for Image Deconvolution , 2014, NIPS.

[19] Minh N. Do,et al. Semantic Image Inpainting with Deep Generative Models , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[21] Andrzej Cichocki,et al. New Algorithms for Non-Negative Matrix Factorization in Applications to Blind Source Separation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[22] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[23] Minh N. Do,et al. Semantic Image Inpainting with Perceptual and Contextual Losses , 2016, ArXiv.

[24] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.