A Machine Learning Approach to Policy Optimization in System Dynamics Models

The paper proposes a policy design method for system dynamics models based on recurrent neural networks. A policy maker first directly creates an arbitrary desired reference mode and run the algorithm to search for the most appropriate model(s) automatically to fit it. In the searching process, both the system structure and its parameter values evolve simultaneously. Several experiments are conducted to evaluate our approach. The results show that our approach is as good as or better than other comparable methods. Copyright © 2011 John Wiley & Sons, Ltd.

[1]  V. Ramachandran,et al.  Hearing Colors, Tasting Shapes , 2003 .

[2]  E. F. Wolstenholme,et al.  A Case Study in System Dynamics Optimization , 1989 .

[3]  Sushil K. Sharma,et al.  Synthetic design of policy decisions in system dynamics models: A modal control theoretical approach , 1985 .

[4]  Rogelio Oliva,et al.  The greater whole: Towards a synthesis of system dynamics and soft systems methodology , 1998, Eur. J. Oper. Res..

[5]  Xin Yao,et al.  Evolving artificial neural networks , 1999, Proc. IEEE.

[6]  E. F. Wolstenholme,et al.  System dynamics and heuristic optimisation in defence analysis , 1987 .

[7]  Rogelio Oliva,et al.  Loop Eigenvalue Elasticity Analysis: Three Case Studies , 2006 .

[8]  Robert Hooke,et al.  `` Direct Search'' Solution of Numerical and Statistical Problems , 1961, JACM.

[9]  Janet K. Allen,et al.  Using response surfaces to improve the search for satisfactory behavior in system dynamics models , 2000 .

[10]  R G Coyle Simulation by repeated optimisation , 1999, J. Oper. Res. Soc..

[11]  Hannu Kivijärvi,et al.  Solving economic optimal control problems with system dynamics , 1986 .

[12]  Rogelio Oliva,et al.  Maps and models in system dynamics: a response to Coyle , 2001, System Dynamics Review.

[13]  R. E. Kalman,et al.  A New Approach to Linear Filtering and Prediction Problems , 2002 .

[14]  James R. Burns,et al.  Optimization Techniques Applied to the Forrester Model of the World , 1974, IEEE Trans. Syst. Man Cybern..

[15]  M. J. D. Powell,et al.  An efficient method for finding the minimum of a function of several variables without calculating derivatives , 1964, Comput. J..

[16]  R. G. Coyle,et al.  System Dynamics Modelling , 1996 .

[17]  Rogelio Oliva,et al.  Model calibration as a testing strategy for system dynamics models , 2003, Eur. J. Oper. Res..

[18]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[19]  Jim Duggan,et al.  Equation‐based policy optimization for agent‐oriented system dynamics models , 2008 .

[20]  W. Wonham On pole assignment in multi-input controllable linear systems , 1967 .

[21]  Jack P. C. Kleijnen,et al.  Sensitivity analysis and optimization of system dynamics models: Regression analysis and statistical design of experiments , 1995 .