论文信息 - Fast Generation for Convolutional Autoregressive Models

Fast Generation for Convolutional Autoregressive Models

Convolutional autoregressive models have recently demonstrated state-of-the-art performance on a number of generation tasks. While fast, parallel training methods have been crucial for their success, generation is typically implemented in a naive fashion where redundant computations are unnecessarily repeated. This results in slow generation, making such models infeasible for production environments. In this work, we describe a method to speed up generation in convolutional autoregressive models. The key idea is to cache hidden states to avoid redundant computation. We apply our fast generation method to the Wavenet and PixelCNN++ models and achieve up to $21\times$ and $183\times$ speedups respectively.

[1] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.

[2] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[3] Koray Kavukcuoglu,et al. Pixel Recurrent Neural Networks , 2016, ICML.

[4] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[5] Alex Graves,et al. Neural Machine Translation in Linear Time , 2016, ArXiv.

[6] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[7] Francesco Visin,et al. A guide to convolution arithmetic for deep learning , 2016, ArXiv.

[8] Xi Chen,et al. PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications , 2017, ICLR.

[9] Mohammad Norouzi,et al. Pixel Recursive Super Resolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[10] Alex Graves,et al. Video Pixel Networks , 2016, ICML.