论文信息 - DAWSON: A Domain Adaptive Few Shot Generation Framework - 字舞流文

DAWSON: A Domain Adaptive Few Shot Generation Framework

Training a Generative Adversarial Networks (GAN) for a new domain from scratch requires an enormous amount of training data and days of training time. To this end, we propose DAWSON, a Domain Adaptive FewShot Generation FrameworkFor GANs based on meta-learning. A major challenge of applying meta-learning GANs is to obtain gradients for the generator from evaluating it on development sets due to the likelihood-free nature of GANs. To address this challenge, we propose an alternative GAN training procedure that naturally combines the two-step training procedure of GANs and the two-step training procedure of meta-learning algorithms. DAWSON is a plug-and-play framework that supports a broad family of meta-learning algorithms and various GANs with architectural-variants. Based on DAWSON, We also propose MUSIC MATINEE, which is the first few-shot music generation model. Our experiments show that MUSIC MATINEE could quickly adapt to new domains with only tens of songs from the target domains. We also show that DAWSON can learn to generate new digits with only four samples in the MNIST dataset. We release source codes implementation of DAWSON in both PyTorch and Tensorflow, generated music samples on two genres and the lightning video.

Zixuan Liu | Weixin Liang | Can Liu | Weixin Liang | Zixuan Liu | Can Liu

[1] Yi-Hsuan Yang,et al. MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment , 2017, AAAI.

[2] Heiga Zen,et al. Sample Efficient Adaptive Text-to-Speech , 2018, ICLR.

[3] Louis Clouâtre,et al. FIGR: Few-shot Image Generation with Reptile , 2019, ArXiv.

[4] Jürgen Schmidhuber,et al. Finding temporal structure in music: blues improvisation with LSTM recurrent networks , 2002, Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.

[5] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[6] Eleonora Iotti,et al. MetalGAN: Multi-Domain Label-Less Image Synthesis Using cGANs and Meta-Learning , 2020, Neural Networks.

[7] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[8] Thomas Paine,et al. Few-shot Autoregressive Density Estimation: Towards Learning to Learn Distributions , 2017, ICLR.

[9] Andrew M. Dai,et al. Music Transformer: Generating Music with Long-Term Structure , 2018, ICLR.

[10] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[11] Xiaolin Hu,et al. A Hierarchical Recurrent Neural Network for Symbolic Melody Generation , 2017, IEEE Transactions on Cybernetics.

[12] Chris Donahue,et al. Synthesizing Audio with Generative Adversarial Networks , 2018, ArXiv.

[13] Andrew M. Dai,et al. MaskGAN: Better Text Generation via Filling in the ______ , 2018, ICLR.

[14] Frank Nielsen,et al. DeepBach: a Steerable Model for Bach Chorales Generation , 2016, ICML.

[15] Jörg Bornschein,et al. Variational Memory Addressing in Generative Models , 2017, NIPS.

[16] Douglas Eck,et al. Counterpoint by Convolution , 2019, ISMIR.

[17] Douglas Eck,et al. This time with feeling: learning expressive musical performance , 2018, Neural Computing and Applications.

[18] Zoubin Ghahramani,et al. Training generative neural networks via Maximum Mean Discrepancy optimization , 2015, UAI.

[19] Joshua B. Tenenbaum,et al. One shot learning of simple visual concepts , 2011, CogSci.

[20] Joshua Achiam,et al. On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[21] Daan Wierstra,et al. One-Shot Generalization in Deep Generative Models , 2016, ICML.

[22] Colin Raffel,et al. A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music , 2018, ICML.

[23] Christopher K. I. Williams,et al. Harmonising Chorales by Probabilistic Inference , 2004, NIPS.

[24] Tor Lattimore,et al. Online Learning with Gated Linear Networks , 2017, ArXiv.

[25] Yi-Hsuan Yang,et al. Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation , 2018, ISMIR.

[26] Dmitry P. Vetrov,et al. Few-shot Generative Modelling with Generative Matching Networks , 2018, AISTATS.

[27] Xiaohui Xie,et al. Adversarial Deep Structural Networks for Mammographic Mass Segmentation , 2016, bioRxiv.