论文信息 - Learning Latent Multiscale Structure Using Recurrent Neural Networks

Learning Latent Multiscale Structure Using Recurrent Neural Networks

In this paper, we introduce a hierarchical recurrent neural network architecture that enables the model to adpatively capture the underlying temporal dependencies in sequences with different timescales while not using explicit boundary information. In experiments on character-level language modelling, we demonstrate that our proposed model performs significantly better than previously proposed models, achieving the state-of-the-art.

Yoshua Bengio | Junyoung Chung | Sungjin Ahn

[1] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[2] J. Kleinberg. Bursty and Hierarchical Structure in Streams , 2002, Data mining and knowledge discovery.

[3] Ying Zhang,et al. On Multiplicative Integration with Recurrent Neural Networks , 2016, NIPS.

[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[5] Jürgen Schmidhuber,et al. Learning Complex, Extended Sequences Using the Principle of History Compression , 1992, Neural Computation.

[6] Jürgen Schmidhuber,et al. A Clockwork RNN , 2014, ICML.

[7] Quoc V. Le,et al. HyperNetworks , 2016, ICLR.

[8] Alex Graves,et al. Grid Long Short-Term Memory , 2015, ICLR.

[9] Yoshua Bengio,et al. Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.

[10] Jürgen Schmidhuber,et al. Recurrent Highway Networks , 2016, ICML.

[11] Kamil M Rocki,et al. Recurrent Memory Array Structures , 2016, ArXiv.