Adaptive neurofuzzy control of a robotic gripper with on-line machine learning

Abstract Pre-programming complex robotic systems to operate in unstructured environments is extremely difficult because of the programmer’s inability to predict future operating conditions in the face of unforeseen environmental conditions, mechanical wear of parts, etc. The solution to this problem is for the robot controller to learn on-line about its own capabilities and limitations when interacting with its environment. At the present state of technology, this poses a challenge to existing machine learning methods. We study this problem using a simple two-fingered gripper which learns to grasp an object with appropriate force, without slip while minimising chances of damage to the object. Three machine learning methods are used to produce a neurofuzzy controller for the gripper. These are off-line supervised neurofuzzy learning and two on-line methods, namely unsupervised reinforcement learning and an unsupervised/supervised hybrid. With the two on-line methods, we demonstrate that the controller can learn through interaction with its environment to overcome simulated failure of its sensors. Further, the hybrid is shown to out perform reinforcement learning alone in terms of faster adaptation to the changing circumstances of sensor failure. The hybrid learning scheme allows us to make best use of such pre-labeled datasets as might exist and to remember effectively good control actions discovered by reinforcement learning.

[1]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[2]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[3]  James M. Tien,et al.  A knowledge-base generating hierarchical fuzzy-neural controller , 1997, IEEE Trans. Neural Networks.

[4]  Ella Bingham Reinforcement learning in neurofuzzy traffic signal control , 2001, Eur. J. Oper. Res..

[5]  Xia Hong,et al.  Adaptive Modelling, Estimation and Fusion from Data: A Neurofuzzy Approach , 2002, Advanced information processing.

[6]  Marcelo Simoes Introduction to Fuzzy Control , 2003 .

[7]  Antonio Bicchi,et al.  Hands for dexterous manipulation and robust grasping: a difficult road toward simplicity , 2000, IEEE Trans. Robotics Autom..

[8]  Clarence W. de Silva,et al.  Applications of fuzzy logic in the control of robotic manipulators , 1995 .

[9]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[10]  Martin Brown,et al.  Neurofuzzy adaptive modelling and control , 1994 .

[11]  Hamid R. Berenji,et al.  Learning and tuning fuzzy logic controllers through reinforcements , 1992, IEEE Trans. Neural Networks.

[12]  Chris J. Harris,et al.  Optimal object grasping using fuzzy logic , 2003 .

[13]  Richard S. Sutton,et al.  Reinforcement learning architectures for animats , 1991 .

[14]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[15]  Richard M. Crowder Sensors: Touch, Force, and Torque , 2000 .

[16]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[17]  Richard M. Crowder,et al.  Optimal object grasp using tactile sensors and fuzzy logic , 1999, Robotica.

[18]  John Fulcher,et al.  Computational Intelligence: An Introduction , 2008, Computational Intelligence: A Compendium.

[19]  Leslie Pack Kaelbling,et al.  Foundations of learning in autonomous agents , 1991, Robotics Auton. Syst..

[20]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[21]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[22]  Günhan Dündar,et al.  Hierarchical neuro-fuzzy call admission controller for ATM networks , 2001, Comput. Commun..

[23]  J. Nazuno Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[24]  Ernest L. Hall,et al.  Handbook of Industrial Automation , 2000 .

[25]  Jean-Arcady Meyer,et al.  From Animals to Animats: Proceedings of The First International Conference on Simulation of Adaptive Behavior (Complex Adaptive Systems) , 1990 .

[26]  E. Mizutani,et al.  Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review] , 1997, IEEE Transactions on Automatic Control.

[27]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[28]  Spyros G. Tzafestas,et al.  Computational Intelligence Techniques for Short-Term Electric Load Forecasting , 2001, J. Intell. Robotic Syst..