论文信息 - Time Window Width Influence on Dynamic BPTT(h) Learning Algorithm Performances: Experimental Study

Time Window Width Influence on Dynamic BPTT(h) Learning Algorithm Performances: Experimental Study

The purpose of the research addressed in this paper is to study the influence of the time window width in dynamic truncated BackPropagation Through Time BPTT(h) learning algorithms. Statistical experiments based on the identification of a real biped robot balancing mechanism are carried out to raise the link between the window width and the stability, the speed and the accuracy of the learning. The time window width choice is shown to be crucial for the convergence speed of the learning process and the generalization ability of the network. Although, a particular attention is brought to a divergence problem (gradient blow up) observed with the assumption where the net parameters are constant along the window. The limit of this assumption is demonstrated and parameters evolution storage, used as a solution for this problem, is detailed.

[1] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.

[2] Barak A. Pearlmutter. Gradient calculations for dynamic recurrent neural networks: a survey , 1995, IEEE Trans. Neural Networks.

[3] Miomir Vukobratovic,et al. Zero-Moment Point - Thirty Five Years of its Life , 2004, Int. J. Humanoid Robotics.

[4] Garrison W. Cottrell,et al. Tau Net A neural network for modeling temporal variability , 1997, Neurocomputing.

[5] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[6] B. Mohamed,et al. Emulation of the Dynamic Effects of Human Torso During a Walking Gait , 2004 .

[7] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.

[8] Bhaskar D. Rao,et al. On-line learning algorithms for locally recurrent neural networks , 1999, IEEE Trans. Neural Networks.

[9] Fu-Sheng Tsung,et al. Modeling dynamical systems with recurrent neural networks , 1994 .