Synthesizing Efficient Controllers

In many situations, we are interested in controllers that implement a good trade-off between conflicting objectives, e.g., the speed of a car versus its fuel consumption, or the transmission rate of a wireless device versus its energy consumption. In both cases, we aim for a system that efficiently uses its resources. In this paper we show how to automatically construct efficient controllers. We provide a specification framework for controllers in probabilistic environments and show how to synthesize implementations from them. We achieve this by reduction to Markov Decision Processes with a novel objective function. We compute optimal strategies for them using three different solutions (linear programming, fractional linear programming, policy iteration). We implemented and compared the three algorithms and integrated the fastest algorithm into the model checker PRISM.

[1]  Zohar Manna,et al.  Temporal Verification of Reactive Systems , 1995, Springer New York.

[2]  Krishnendu Chatterjee,et al.  Quantitative languages , 2008, TOCL.

[3]  Fred Kröger,et al.  Temporal Logic of Programs , 1987, EATCS Monographs on Theoretical Computer Science.

[4]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[5]  C. Derman On Sequential Decisions and Markov Chains , 1962 .

[6]  Christian von Essen,et al.  Synthesizing Systems with Optimal Average-Case Behavior for Ratio Objectives , 2011, iWIGP.

[7]  Marta Z. Kwiatkowska,et al.  PRISM: probabilistic model checking for performance and reliability analysis , 2009, PERV.

[8]  Joost-Pieter Katoen,et al.  Analyzing Energy Consumption in a Gossiping MAC Protocol , 2010, MMB/DFT.

[9]  Robin Milner,et al.  On Observing Nondeterminism and Concurrency , 1980, ICALP.

[10]  Krishnendu Chatterjee,et al.  Synthesizing robust systems , 2009, 2009 Formal Methods in Computer-Aided Design.

[11]  Uri Zwick,et al.  The Complexity of Mean Payoff Games on Graphs , 1996, Theor. Comput. Sci..

[12]  Stuart J. Russell,et al.  Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.

[13]  Alex K. Simpson,et al.  Computational Adequacy in an Elementary Topos , 1998, CSL.

[14]  Krishnendu Chatterjee,et al.  Better Quality in Synthesis through Quantitative Objectives , 2009, CAV.

[15]  Bernd Becker,et al.  Fiber-Optic Fabry–Pérot Sensor Based on Periodic Focusing Effect of Graded-Index Multimode Fibers , 2010, IEEE Photonics Technology Letters.

[16]  Zohar Manna,et al.  Temporal verification of reactive systems - safety , 1995 .

[17]  Boudewijn R. Haverkort,et al.  Performance of computer communication systems - a model-based approach , 1998 .

[18]  Christel Baier,et al.  Principles of model checking , 2008 .

[19]  Zohar Manna,et al.  Formal verification of probabilistic systems , 1997 .

[20]  Klaus Echtle,et al.  Measurement, Modelling, and Evaluation of Computing Systems and Dependability and Fault Tolerance, 15th International GI/ITG Conference, MMB&DFT 2010, Essen, Germany, March 15-17, 2010. Proceedings , 2010, MMB/DFT.

[21]  Christel Baier,et al.  Principles of Model Checking (Representation and Mind Series) , 2008 .

[22]  Mihalis Yannakakis,et al.  Markov Decision Processes and Regular Events (Extended Abstract) , 1990, ICALP.