This paper focuses on learning in autonomous agents under dynamic environments. Autonomous agent control has been dominated by two major Artificial Intelligence (AI) approaches, Planning-based and Behavior-based. We concentrate on Behavior-based control. Multiple behavior modules from which a Behavior-based agent is constituted may have conflicting outputs (actions) for various input situations. It is becoming increasingly difficult to 'hard-wire' or 'pre-fix' the actions. "Learning" to select an appropriate action emerges as a strong alternative to hard-wired schemes. A Behavior-based agent is constructed for the research study. The application task chosen for the agent is to learn to navigate in a real indoor environment avoiding static and moving obstacles. The strategy is as follows: learning to avoid obstacles is to be achieved by learning to select an appropriate action in any input situation. The learning algorithm, based on reinforcement learning principles, chooses with high probability, an appropriate action based on the performance statistics (activations and reinforcements) of the conflicting behaviors. Learning experiments conducted to observe the performance of the agent were encouraging.
[1]
Sebastian Thrun,et al.
Explanation-Based Neural Network Learning for Robot Control
,
1992,
NIPS.
[2]
Sridhar Mahadevan,et al.
Automatic Programming of Behavior-Based Robots Using Reinforcement Learning
,
1991,
Artif. Intell..
[3]
R. Larsen,et al.
An introduction to mathematical statistics and its applications (2nd edition) , by R. J. Larsen and M. L. Marx. Pp 630. £17·95. 1987. ISBN 13-487166-9 (Prentice-Hall)
,
1987,
The Mathematical Gazette.
[4]
Rodney A. Brooks,et al.
Learning to Coordinate Behaviors
,
1990,
AAAI.
[5]
Maja J. Mataric,et al.
Designing and Understanding Adaptive Group Behavior
,
1995,
Adapt. Behav..
[6]
P. Maes.
Modeling adaptive autonomous agents
,
1993
.
[7]
Rodney A. Brooks,et al.
A Robust Layered Control Syste For A Mobile Robot
,
2022
.
[8]
Ulrich Nehmzow,et al.
Increasing behavioral repertoire in a mobile robot
,
1993
.