Multiplicative Interactions and Where to Find Them
暂无分享,去创建一个
Yee Whye Teh | Razvan Pascanu | Simon Osindero | Jonathan Schwarz | Wojciech Marian Czarnecki | Jacob Menick | Tim Harley | Siddhant M. Jayakumar | Jack W. Rae | Wojciech M. Czarnecki | Simon Osindero | Y. Teh | Razvan Pascanu | Tim Harley | Jacob Menick | Jonathan Schwarz
[1] Yee Whye Teh,et al. Attentive Neural Processes , 2019, ICLR.
[2] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[3] Kate Saenko,et al. Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering , 2015, ECCV.
[4] Wojciech Czarnecki,et al. Multi-task Deep Reinforcement Learning with PopArt , 2018, AAAI.
[5] Aaron C. Courville,et al. FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.
[6] Razvan Pascanu,et al. Relational recurrent neural networks , 2018, NeurIPS.
[7] Yang Gao,et al. Compact Bilinear Pooling , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Jürgen Schmidhuber,et al. Recurrent Highway Networks , 2016, ICML.
[9] Yiming Yang,et al. Transformer-XL: Language Modeling with Longer-Term Dependency , 2018 .
[10] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[11] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[12] Geoffrey E. Hinton,et al. Generating Text with Recurrent Neural Networks , 2011, ICML.
[13] Ying Zhang,et al. On Multiplicative Integration with Recurrent Neural Networks , 2016, NIPS.
[14] Alexander M. Rush,et al. Avoiding Latent Variable Collapse With Generative Skip Models , 2018, AISTATS.
[15] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.
[16] Aaron C. Courville,et al. Learning Visual Reasoning Without Strong Priors , 2017, ICML 2017.
[17] Geoffrey E. Hinton,et al. Factored conditional restricted Boltzmann Machines for modeling motion style , 2009, ICML '09.
[18] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.
[19] Jiasen Lu,et al. Hierarchical Question-Image Co-Attention for Visual Question Answering , 2016, NIPS.
[20] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[21] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[22] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.
[23] Tamir Hazan,et al. High-Order Attention Models for Visual Question Answering , 2017, NIPS.
[24] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[25] T. Sejnowski. Higher‐order Boltzmann machines , 1987 .
[26] Yann Dauphin,et al. Pay Less Attention with Lightweight and Dynamic Convolutions , 2019, ICLR.
[27] Yann Dauphin,et al. Language Modeling with Gated Convolutional Networks , 2016, ICML.
[28] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.
[29] Jonathon Shlens,et al. A Learned Representation For Artistic Style , 2016, ICLR.
[30] Zhou Yu,et al. Multi-modal Factorized Bilinear Pooling with Co-attention Learning for Visual Question Answering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[31] Theodore Lim,et al. SMASH: One-Shot Model Architecture Search through HyperNetworks , 2017, ICLR.
[32] Vladlen Koltun,et al. Trellis Networks for Sequence Modeling , 2018, ICLR.
[33] Geoffrey E. Hinton,et al. Unsupervised Learning of Image Transformations , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.
[34] Peter Dayan,et al. Fast Parametric Learning with Activation Memorization , 2018, ICML.
[35] Gang Sun,et al. Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.