CMAC Structure Optimization with Q-learning Approach and its Application

Comparing with other neural networks based models, CMAC is successfully applied on many nonlinear control systems because of its computational speed and learning ability. However, for high-dimensional input cases in real application, we often have to make our choice between learning accuracy and memory size. This paper discusses how both the number of layer and step quantization influence the approximation quality of CMAC. By experimental enquiry, it is shown that it is possible to decrease the memory size without losing the approximation quality by selecting the adaptive structural parameters. Based on Qlearning approach, the CMAC structural parameters can be optimized automatically without increasing the complexity of its structure. The choice of this optimized CMAC structure can achieve a tradeoff between the learning accuracy and finite memory size. At last, the application of this Q-learning based CMAC structure optimization approach on the joint angle tracking problem for biped robot is presented.

[1]  Ming-Feng Yeh,et al.  CMAC Study with Adaptive Quantization , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[2]  Daming Shi,et al.  Self-Organizing Gaussian Fuzzy CMAC with Truth Value Restriction , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[3]  Chih-Min Lin,et al.  Self-Organizing CMAC Control for a Class of MIMO Uncertain Nonlinear Systems , 2009, IEEE Transactions on Neural Networks.

[4]  C. Sabourin,et al.  OBSTACLE AVOIDANCE STRATEGY FOR BIPED ROBOT BASED ON FUZZY Q-LEARNING , 2008 .

[5]  Hiok Chai Quek,et al.  Hierarchically Clustered Adaptive Quantization CMAC and Its Learning Convergence , 2007, IEEE Transactions on Neural Networks.

[6]  Mo-Yuen Chow,et al.  On the training of a multi-resolution CMAC neural network , 1997, ISIE '97 Proceeding of the IEEE International Symposium on Industrial Electronics.

[7]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[8]  Daming Shi,et al.  Fuzzy CMAC With Incremental Bayesian Ying–Yang Learning and Dynamic Rule Construction , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  James S. Albus,et al.  New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)1 , 1975 .