Multiple fuzzy state-value functions for human evaluation through interactive trajectory planning of a partner robot

The purpose of this study is to develop partner robots that can obtain and accumulate human-friendly behaviors. To achieve this purpose, the entire architecture of the robot is designed, based on a concept of structured learning which emphasizes the importance of interactive learning of several modules through interaction with its environment. This paper deals with a trajectory planning method for generating hand-to-hand behaviors of a partner robot by using multiple fuzzy state-value functions, a self-organizing map, and an interactive genetic algorithm. A trajectory for the behavior is generated by an interactive genetic algorithm using human evaluation. In order to reduce human load, human evaluation is estimated by using the fuzzy state-value function. Furthermore, to cope with various situations, a self-organizing map is used for clustering a given task dependent on a human hand position. And then, a fuzzy state-value function is assigned to each output unit of the self-organizing map. The robot can easily obtain and accumulate human-friendly trajectories using a fuzzy state-value function and a knowledge database corresponding to the unit selected in the self-organizing map. Finally, multiple fuzzy state-value functions can estimate a human evaluation model for the hand-to-hand behaviors. Several experimental results show the effectiveness of the proposed method.

[1]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[2]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[3]  Rolf Pfeifer,et al.  Understanding intelligence , 2020, Inequality by Design.

[4]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[5]  Gilbert Syswerda,et al.  A Study of Reproduction in Generational and Steady State Genetic Algorithms , 1990, FOGA.

[6]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[7]  Jun Tani,et al.  Model-based learning for mobile robot navigation from the dynamical systems perspective , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[8]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[9]  Osamu Katai,et al.  Acquisition of a specialty in multi-agent learning: approach from learning classifier system , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).

[10]  Ronald C. Arkin,et al.  An Behavior-based Robotics , 1998 .

[11]  Zbigniew Michalewicz,et al.  Adaptive evolutionary planner/navigator for mobile robots , 1997, IEEE Trans. Evol. Comput..

[12]  Fumio Kojima,et al.  Fuzzy and Neural Computing for Communication of a Partner Robot , 2003, J. Multiple Valued Log. Soft Comput..

[13]  D M Wolpert,et al.  Multiple paired forward and inverse models for motor control , 1998, Neural Networks.

[14]  Seiji Yamada,et al.  Teacher's load and timing of teaching based on interactive evolutionary robotics , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).

[15]  R. Paul Robot manipulators : mathematics, programming, and control : the computer control of robot manipulators , 1981 .

[16]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[17]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[18]  Hideyuki Takagi,et al.  Interactive evolutionary computation: fusion of the capabilities of EC optimization and human evaluation , 2001, Proc. IEEE.

[19]  Shinichi Nakasuka,et al.  Robustness in organizational-learning oriented classifier system , 2002, Soft Comput..

[20]  Jean-Claude Latombe,et al.  Robot motion planning , 1970, The Kluwer international series in engineering and computer science.

[21]  Toshio Fukuda,et al.  An intelligent robotic system based on a fuzzy approach , 1999, Proc. IEEE.

[22]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[23]  John Canny,et al.  The complexity of robot motion planning , 1988 .

[24]  Toshio Fukuda,et al.  Trajectory Planning and Learning of A Redundant Manipulator with Structured Intelligence , 1998, J. Braz. Comput. Soc..

[25]  Inman Harvey,et al.  Explorations in Evolutionary Robotics , 1993, Adapt. Behav..

[26]  Stefano Nolfi,et al.  Evolutionary Robotics: The Biology, Intelligence, and Technology of Self-Organizing Machines , 2000 .

[27]  Fumio Kojima,et al.  Trajectory generation for human-friendly behavior of partner robot using fuzzy evaluating interactive genetic algorithm , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).