Generative adversarial networks for non-negative matrix factorization in temporal psycho-visual modulation

Abstract The image factorization problem is the key challenge in Temporal Psycho-Visual Modulation (TPVM). In this paper, we present an end-to-end learned model for image-based non-negative matrix factorization. We decompose a set of images into a small number of image bases which can be used to reconstruct all the images by linearly combining the bases. During the process, the image bases, as well as their weights for the linear combination, are unknown. The method is based on conditional GAN and a variational sample disturber. Traditional NMF methods suffer from slow computational speed and poor generalization ability. A deep neural network shows potential in these two aspects. We conduct adversarial training in our model to generate better bases for image restoration. Our method outperforms other CNN based methods on several public datasets. Compared to traditional NMF algorithms, our model generates image bases that preserve details. Our model has advantages in speed as well as generalization ability.

[1]  Xiaokang Yang,et al.  Evaluation of Different Algorithms of Nonnegative Matrix Factorization in Temporal Psychovisual Modulation , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Michael W. Berry,et al.  Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[3]  Xiangxiang Zhu,et al.  Improved self-paced learning framework for nonnegative matrix factorization , 2017, Pattern Recognit. Lett..

[4]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[6]  Yin Zhang,et al.  Accelerating the Lee-Seung Algorithm for Nonnegative Matrix Factorization , 2005 .

[7]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[8]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[9]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yann LeCun,et al.  Deep multi-scale video prediction beyond mean square error , 2015, ICLR.

[11]  Haesun Park,et al.  Fast Nonnegative Matrix Factorization: An Active-Set-Like Method and Comparisons , 2011, SIAM J. Sci. Comput..

[12]  Xiaofeng Tao,et al.  Transient attributes for high-level understanding and editing of outdoor scenes , 2014, ACM Trans. Graph..

[13]  Xin Yao,et al.  Evolutionary Generative Adversarial Networks , 2018, IEEE Transactions on Evolutionary Computation.

[14]  Jiri Matas,et al.  DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Tao Mei,et al.  DA-GAN: Instance-Level Image Translation by Deep Attention Generative Adversarial Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Nicolas Gillis,et al.  Accelerated Multiplicative Updates and Hierarchical ALS Algorithms for Nonnegative Matrix Factorization , 2011, Neural Computation.

[17]  Léon Bottou,et al.  Wasserstein GAN , 2017, ArXiv.

[18]  Andrzej Cichocki,et al.  Fast Local Algorithms for Large Scale Nonnegative Matrix and Tensor Factorizations , 2009, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[19]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Xiaolin Wu,et al.  Temporal Psychovisual Modulation: A New Paradigm of Information Display [Exploratory DSP] , 2012, IEEE Signal Processing Magazine.

[21]  Dacheng Tao,et al.  Attention-GAN for Object Transfiguration in Wild Images , 2018, ECCV.

[22]  Guangtao Zhai,et al.  Factorization Algorithms for Temporal Psychovisual Modulation Display , 2016, IEEE Transactions on Multimedia.