论文信息 - Tangent: Automatic differentiation using source-code transformation for dynamically typed array programming

Tangent: Automatic differentiation using source-code transformation for dynamically typed array programming

The need to efficiently calculate first- and higher-order derivatives of increasingly complex models expressed in Python has stressed or exceeded the capabilities of available tools. In this work, we explore techniques from the field of automatic differentiation (AD) that can give researchers expressive power, performance and strong usability. These include source-code transformation (SCT), flexible gradient surgery, efficient in-place array operations, and higher-order derivatives. We implement and demonstrate these ideas in the Tangent software library for Python, the first AD framework for a dynamic language that uses SCT.

Dan Moldovan | Alexander B. Wiltschko | Bart van Merrienboer

[1] Mehdi Amini,et al. Pythran: Enabling Static Optimization of Scientific Python Programs , 2013, SciPy.

[2] Alex Graves,et al. Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes , 2016, NIPS.

[3] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.

[4] Barak A. Pearlmutter,et al. Automatic differentiation in machine learning: a survey , 2015, J. Mach. Learn. Res..

[5] Graham Neubig,et al. Neural Lattice Language Models , 2018, TACL.

[6] Arild Nøkland,et al. Direct Feedback Alignment Provides Learning in Deep Neural Networks , 2016, NIPS.

[7] Christian Bischof,et al. Computing derivatives of computer programs , 2000 .

[8] Yuval Tassa,et al. Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.

[9] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.

[10] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[11] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.