Neurocontroller alternatives for "fuzzy" ball-and-beam systems with nonuniform nonlinear friction

The ball-and-beam problem is a benchmark for testing control algorithms. In the World Congress on Neural Networks, 1994, Prof. L. Zadeh proposed a twist to the problem, which, he suggested, would require a fuzzy logic controller. This experiment uses a beam, partially covered with a sticky substance, increasing the difficulty of predicting the ball's motion. We complicated this problem even more by not using any information concerning the ball's velocity. Although it is common to use the first differences of the ball's consecutive positions as a measure of velocity and explicit input to the controller, we preferred to exploit recurrent neural networks, inputting only consecutive positions instead. We have used truncated backpropagation through time with the node-decoupled extended Kalman filter (NDEKF) algorithm to update the weights in the networks. Our best neurocontroller uses a form of approximate dynamic programming called an adaptive critic design. A hierarchy of such designs exists. Our system uses dual heuristic programming (DHP), an upper-level design. To our best knowledge, our results are the first use of DHP to control a physical system. It is also the first system we know of to respond to Zadeh's challenge. We do not claim this neural network control algorithm is the best approach to this problem, nor do we claim it is better than a fuzzy controller. It is instead a contribution to the scientific dialogue about the boundary between the two overlapping disciplines.

[1]  Michael I. Jordan Supervised learning and systems with excess degrees of freedom , 1988 .

[2]  Paul J. Werbos,et al.  Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[3]  Richard S. Sutton,et al.  Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.

[4]  B. Womack,et al.  Adaptive Control Using Neural Networks , 1991, 1991 American Control Conference.

[5]  D.R. Hush,et al.  Progress in supervised neural networks , 1993, IEEE Signal Processing Magazine.

[6]  Lee A. Feldkamp,et al.  Neurocontrol of nonlinear dynamical systems with Kalman filter trained recurrent networks , 1994, IEEE Trans. Neural Networks.

[7]  G. V. Puskorius,et al.  Truncated backpropagation through time and Kalman filter training for neurocontrol , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[8]  Ronald J. Williams,et al.  Gradient-based learning algorithms for recurrent networks and their computational complexity , 1995 .

[9]  Richard S. Sutton,et al.  A Menu of Designs for Reinforcement Learning Over Time , 1995 .

[10]  Hua Li,et al.  Real Time Fuzzy Logic Controller for Balancing a Beam-and-Ball System , 1995 .

[11]  Yuhong Jiang,et al.  Application of neural networks for real time control of a ball-beam system , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[12]  Roberto A. Santiago,et al.  Adaptive critic designs: A case study for neurocontrol , 1995, Neural Networks.

[13]  Lotfi A. Zadeh,et al.  Fuzzy Control: Issues, Contentions and Perspectives , 1996 .

[14]  Donald C. Wunsch,et al.  Neurocontrollers for Ball-and-Beam Systems , 1996 .

[15]  Mohan M. Trivedi,et al.  Neural integrated fuzzy controller (NiF-T) and real-time implementation of a ball balancing beam (BBB) , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[16]  Richard D. Braatz,et al.  On the "Identification and control of dynamical systems using neural networks" , 1997, IEEE Trans. Neural Networks.

[17]  Donald C. Wunsch,et al.  Adaptive critic designs and their applications , 1997 .

[18]  G. V. Puskorius,et al.  A signal processing framework based on dynamic neural networks with application to problems in adaptation, filtering, and classification , 1998, Proc. IEEE.