Cost-Based Policy Mapping for Imitation
暂无分享,去创建一个
Imitation represents a powerful approach for programming and autonomous learning in robot and computer systems. An important aspect of imitation is the mapping of observations to an executable control strategy. This is particularly important if the behavioral capabilities of the observed and imitating agent differ significantly. This paper presents an approach that addresses this problem by locally optimizing a cost function representing the deviation from the observed state sequence and the cost of the actions required to perform the imitation. The result are imitation strategies that can be performed by the imitating agent and that as closely as possible resemble the observations of the demonstrating agent. The performance of this approach is illustrated within the context of a simulated multi-agent environment.
[1] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.
[2] Diane J. Cook,et al. DFA Learning of Opponent Strategies , 1998, FLAIRS Conference.
[3] Katsushi Ikeuchi,et al. Toward automatic robot instruction from perception-recognizing a grasp from observation , 1993, IEEE Trans. Robotics Autom..
[4] Maja J. Matarić,et al. Learning Motor Skills by Imitation , 1994, AAAI 1994.