论文信息 - The Ecient Learning of Multiple Task Sequences

The Ecient Learning of Multiple Task Sequences

I present a modular network architecture and a learning algorithm based on incremental dynamic programming that allows a single learning agent to learn to solve multiple Markovian decision tasks (MDTs) with significant transfer of learning across the tasks. I consider a class of MDTs, called composite tasks, formed by temporally concatenating a number of simpler, elemental MDTs. The architecture is trained on a set of composite and elemental MDTs. The temporal structure of a composite task is assumed to be unknown and the architecture learns to produce a temporal decomposition. It is shown that under certain conditions the solution of a composite MDT can be constructed by computationally inexpensive modi cations of the solutions of its constituent elemental MDTs.

Satinder Singh

[1] Michael I. Jordan,et al. Task Decomposition through Competition in A , 1990 .

[2] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[3] C. Watkins. Learning from delayed rewards , 1989 .

[4] Satinder Singh. Transfer of Learning by Composing Solutions of Elemental Sequential Tasks , 1992, Mach. Learn..

[5] Michael I. Jordan,et al. Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks , 1990, Cogn. Sci..

[6] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[7] Jonathan Bachrach,et al. A Connectionist Learning Control Architecture for Navigation , 1990, NIPS.