论文信息 - MidiMe: Personalizing a MusicVAE model with user data

MidiMe: Personalizing a MusicVAE model with user data

Training a custom deep neural network model like Music Transformer [3], MusicVAE [4] or SketchRNN [2] from scratch requires significant amounts of data (millions of examples) and compute resources (specialized hardware like GPUs/TPUs) as well as expertise in hyperparameter tuning. Without sufficient data, models are either unable to produce realistic output (underfitting), or they memorize the training examples and are unable to generalize to produce varied outputs (overfitting) – it would be like trying to learn all of music theory from a single song.

[1] Curtis Hawthorne,et al. Magenta.js: A JavaScript API for Augmenting Creativity with Deep Learning , 2018 .

[2] Andrew M. Dai,et al. Music Transformer: Generating Music with Long-Term Structure , 2018, ICLR.

[3] Adam Roberts,et al. Latent Constraints: Learning to Generate Conditionally from Unconditional Generative Models , 2017, ICLR.

[4] Colin Raffel,et al. A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music , 2018, ICML.

[5] Douglas Eck,et al. A Neural Representation of Sketch Drawings , 2017, ICLR.