暂无分享,去创建一个
Ilya Sutskever | Alec Radford | Heewoo Jun | Prafulla Dhariwal | Christine Payne | Jong Wook Kim | Jong Wook Kim | Alec Radford | Prafulla Dhariwal | Ilya Sutskever | Heewoo Jun | Christine Payne | I. Sutskever
[1] Leonard Isaacson,et al. Musical composition with a high-speed digital computer , 1957 .
[2] James Anderson Moorer. Music and computer composition , 1972, CACM.
[3] Dennis H. Klatt,et al. Software for a cascade/parallel formant synthesizer , 1980 .
[4] Jeff Pressing,et al. Nonlinear Maps as Generators of Musical Design , 1988 .
[5] Peter Beyls,et al. The Musical Universe of Cellular Automata , 1989, ICMC.
[6] Alan W. Black,et al. Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[7] Heiga Zen,et al. An HMM-based singing voice synthesis system , 2006, INTERSPEECH.
[8] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[9] J. Bonada,et al. Synthesis of the Singing Voice by Performance Sampling and Spectral Models , 2007, IEEE Signal Processing Magazine.
[10] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.
[11] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[12] Nicolas Sturmel,et al. SIGNAL RECONSTRUCTION FROM STFT MAGNITUDE : A STATE OF THE ART , 2011 .
[13] Eduardo Reck Miranda,et al. Constraint programming systems for modeling music theories and composition , 2011, CSUR.
[14] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.
[15] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[16] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[17] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[18] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.
[19] Yoshua Bengio,et al. NICE: Non-linear Independent Components Estimation , 2014, ICLR.
[20] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.
[21] Hugo Larochelle,et al. Neural Autoregressive Distribution Estimation , 2016, J. Mach. Learn. Res..
[22] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[23] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[24] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.
[25] Yi-Hsuan Yang,et al. MidiNet: A Convolutional Generative Adversarial Network for Symbolic-Domain Music Generation , 2017, ISMIR.
[26] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[27] Ian J. Goodfellow,et al. NIPS 2016 Tutorial: Generative Adversarial Networks , 2016, ArXiv.
[28] Iain Murray,et al. Masked Autoregressive Flow for Density Estimation , 2017, NIPS.
[29] Sercan Ömer Arik,et al. Deep Voice 2: Multi-Speaker Neural Text-to-Speech , 2017, NIPS.
[30] Adam Coates,et al. Deep Voice: Real-time Neural Text-to-Speech , 2017, ICML.
[31] Max Welling,et al. Improved Variational Inference with Inverse Autoregressive Flow , 2016, NIPS 2016.
[32] Sercan Ömer Arik,et al. Deep Voice 3: 2000-Speaker Neural Text-to-Speech , 2017, ICLR 2018.
[33] Douglas Eck,et al. Counterpoint by Convolution , 2019, ISMIR.
[34] Karen Simonyan,et al. Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders , 2017, ICML.
[35] Frank Nielsen,et al. DeepBach: a Steerable Model for Bach Chorales Generation , 2016, ICML.
[36] Jordi Bonada,et al. A Neural Parametric Singing Synthesizer , 2017, INTERSPEECH.
[37] Yoshua Bengio,et al. SampleRNN: An Unconditional End-to-End Neural Audio Generation Model , 2016, ICLR.
[38] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.
[39] Zhuowen Tu,et al. Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Navdeep Jaitly,et al. Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[41] Yuxuan Wang,et al. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis , 2018, ICML.
[42] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.
[43] Colin Raffel,et al. A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music , 2018, ICML.
[44] Karen Simonyan,et al. The challenge of realistic music generation: modelling raw audio at scale , 2018, NeurIPS.
[45] Yutaka Matsuo,et al. Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder , 2018, INTERSPEECH.
[46] Lior Wolf,et al. VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop , 2017, ICLR.
[47] Yoshihiko Nankaku,et al. Recent Development of the DNN-based Singing Voice Synthesis System — Sinsy , 2018, 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).
[48] Patrick Nguyen,et al. Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis , 2018, NeurIPS.
[49] Sercan Ömer Arik,et al. Neural Voice Cloning with a Few Samples , 2018, NeurIPS.
[50] Roger Wattenhofer,et al. MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer , 2018, ISMIR.
[51] Erich Elsen,et al. Efficient Neural Audio Synthesis , 2018, ICML.
[52] Yi-Hsuan Yang,et al. MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment , 2017, AAAI.
[53] Mike Lewis,et al. MelNet: A Generative Model for Audio in the Frequency Domain , 2019, ArXiv.
[54] Gregory Diamos,et al. Fast Spectrogram Inversion Using Multi-Head Convolutional Neural Networks , 2018, IEEE Signal Processing Letters.
[55] Ilya Sutskever,et al. Generating Long Sequences with Sparse Transformers , 2019, ArXiv.
[56] Ryan Prenger,et al. Waveglow: A Flow-based Generative Network for Speech Synthesis , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[57] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.
[58] Garrison W. Cottrell,et al. LakhNES: Improving Multi-instrumental Music Generation with Cross-domain Pre-training , 2019, ISMIR.
[59] Tim Salimans,et al. Axial Attention in Multidimensional Transformers , 2019, ArXiv.
[60] Yoshua Bengio,et al. MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis , 2019, NeurIPS.
[61] Douglas Eck,et al. Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset , 2018, ICLR.
[62] Wei Ping,et al. ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech , 2018, ICLR.
[63] Ali Razavi,et al. Generating Diverse High-Fidelity Images with VQ-VAE-2 , 2019, NeurIPS.
[64] Kumar Krishna Agrawal,et al. GANSynth: Adversarial Neural Audio Synthesis , 2019, ICLR.
[65] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .
[66] Tengyu Ma,et al. Fixup Initialization: Residual Learning Without Normalization , 2019, ICLR.
[67] Haizhou Li,et al. Automatic Lyrics Transcription in Polyphonic Music: Does Background Music Help? , 2019, ArXiv.
[68] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.
[69] Jakub W. Pachocki,et al. Dota 2 with Large Scale Deep Reinforcement Learning , 2019, ArXiv.
[70] Quoc V. Le,et al. GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism , 2018, ArXiv.
[71] Romain Hennequin,et al. SPLEETER: A FAST AND STATE-OF-THE ART MUSIC SOURCE SEPARATION TOOL WITH PRE-TRAINED MODELS , 2019 .
[72] Jong Wook Kim,et al. Neural Music Synthesis for Flexible Timbre Control , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[73] Heiga Zen,et al. Hierarchical Generative Modeling for Controllable Speech Synthesis , 2018, ICLR.
[74] Andrew M. Dai,et al. Music Transformer: Generating Music with Long-Term Structure , 2018, ICLR.
[75] Karen Simonyan,et al. Hierarchical Autoregressive Image Models with Auxiliary Decoders , 2019, ArXiv.
[76] Xiaolin Hu,et al. A Hierarchical Recurrent Neural Network for Symbolic Melody Generation , 2017, IEEE Transactions on Cybernetics.
[77] Ryuichi Yamamoto,et al. Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).