论文信息 - Multiple fuzzy state-value functions for human evaluation through interactive trajectory planning of a partner robot

Multiple fuzzy state-value functions for human evaluation through interactive trajectory planning of a partner robot

The purpose of this study is to develop partner robots that can obtain and accumulate human-friendly behaviors. To achieve this purpose, the entire architecture of the robot is designed, based on a concept of structured learning which emphasizes the importance of interactive learning of several modules through interaction with its environment. This paper deals with a trajectory planning method for generating hand-to-hand behaviors of a partner robot by using multiple fuzzy state-value functions, a self-organizing map, and an interactive genetic algorithm. A trajectory for the behavior is generated by an interactive genetic algorithm using human evaluation. In order to reduce human load, human evaluation is estimated by using the fuzzy state-value function. Furthermore, to cope with various situations, a self-organizing map is used for clustering a given task dependent on a human hand position. And then, a fuzzy state-value function is assigned to each output unit of the self-organizing map. The robot can easily obtain and accumulate human-friendly trajectories using a fuzzy state-value function and a knowledge database corresponding to the unit selected in the self-organizing map. Finally, multiple fuzzy state-value functions can estimate a human evaluation model for the hand-to-hand behaviors. Several experimental results show the effectiveness of the proposed method.

[1] Teuvo Kohonen,et al. Self-Organization and Associative Memory , 1988 .

[2] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.

[3] Rolf Pfeifer,et al. Understanding intelligence , 2020, Inequality by Design.

[4] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[5] Gilbert Syswerda,et al. A Study of Reproduction in Generational and Steady State Genetic Algorithms , 1990, FOGA.

[6] Teuvo Kohonen,et al. Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[7] Jun Tani,et al. Model-based learning for mobile robot navigation from the dynamical systems perspective , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[8] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[9] Osamu Katai,et al. Acquisition of a specialty in multi-agent learning: approach from learning classifier system , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).

[10] Ronald C. Arkin,et al. An Behavior-based Robotics , 1998 .

[11] Zbigniew Michalewicz,et al. Adaptive evolutionary planner/navigator for mobile robots , 1997, IEEE Trans. Evol. Comput..

[12] Fumio Kojima,et al. Fuzzy and Neural Computing for Communication of a Partner Robot , 2003, J. Multiple Valued Log. Soft Comput..

[13] D M Wolpert,et al. Multiple paired forward and inverse models for motor control , 1998, Neural Networks.

[14] Seiji Yamada,et al. Teacher's load and timing of teaching based on interactive evolutionary robotics , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).

[15] R. Paul. Robot manipulators : mathematics, programming, and control : the computer control of robot manipulators , 1981 .

[16] T. Kohonen. Self-organized formation of topographically correct feature maps , 1982 .

[17] Lotfi A. Zadeh,et al. Fuzzy Sets , 1996, Inf. Control..

[18] Hideyuki Takagi,et al. Interactive evolutionary computation: fusion of the capabilities of EC optimization and human evaluation , 2001, Proc. IEEE.

[19] Shinichi Nakasuka,et al. Robustness in organizational-learning oriented classifier system , 2002, Soft Comput..

[20] Jean-Claude Latombe,et al. Robot motion planning , 1970, The Kluwer international series in engineering and computer science.

[21] Toshio Fukuda,et al. An intelligent robotic system based on a fuzzy approach , 1999, Proc. IEEE.

[22] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[23] John Canny,et al. The complexity of robot motion planning , 1988 .

[24] Toshio Fukuda,et al. Trajectory Planning and Learning of A Redundant Manipulator with Structured Intelligence , 1998, J. Braz. Comput. Soc..

[25] Inman Harvey,et al. Explorations in Evolutionary Robotics , 1993, Adapt. Behav..

[26] Stefano Nolfi,et al. Evolutionary Robotics: The Biology, Intelligence, and Technology of Self-Organizing Machines , 2000 .

[27] Fumio Kojima,et al. Trajectory generation for human-friendly behavior of partner robot using fuzzy evaluating interactive genetic algorithm , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).