A learning approach to enhancing machine reconfiguration decision-making games in a heterarchical manufacturing environment

Business process management (BPM) emerges as a promising guiding principle and technology for integrating existing assets and future deployments. BPM provides mechanisms to transform the behaviours of disparate and heterogeneous systems into standard and interoperable business processes, aimed at effectively facilitating the conduct of factory system integration. On the shop floor, BPM requires machine-level controllers to be architected in a heterarchical fashion. Past work by the authors introduced a heuristic non-cooperative game theoretic planning technique for an autonomous machine controller to frame a decision about an impending reconfiguration (i.e. setup change) in a heterarchical manufacturing environment. The work described in this paper extends the authors’ previous work by employing a reinforcement learning approach for specifying the payoffs in reconfiguration games through capturing the effects of a sequence of reconfiguration decisions. Consequently, the controller can autonomously learn the long-term implications of decisions, and ultimately improve its decision-making process in manufacturing.

[1]  John N. Tsitsiklis,et al.  Asynchronous Stochastic Approximation and Q-Learning , 1994, Machine Learning.

[2]  Robin Qiu Manufacturing grid: a next generation manufacturing model , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[3]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[4]  J. A. Harding *,et al.  An enterprise modeling and integration framework based on knowledge discovery and data mining , 2005 .

[5]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[6]  Y. Tang *,et al.  Integrated design approach for virtual production line-based reconfigurable manufacturing systems , 2004 .

[7]  Jukka Ranta,et al.  Economics and success factors of flexible manufacturing systems: The conventional explanation revisited , 1990 .

[8]  S. Joshi,et al.  Set-up games: a heuristic game-theoretic approach to set-up decisions for heterarchical manufacturing systems , 2004 .

[9]  David Naso,et al.  A genetic approach for adaptive multiagent control in heterarchical manufacturing systems , 2003, IEEE Trans. Syst. Man Cybern. Part A.

[10]  John N. Tsitsiklis,et al.  Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[11]  Richard S. Sutton,et al.  Planning by Incremental Dynamic Programming , 1991, ML.

[12]  Tuomas Sandholm,et al.  On Multiagent Q-Learning in a Semi-Competitive Domain , 1995, Adaption and Learning in Multi-Agent Systems.

[13]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..