An Adaptive $Q$-Learning Algorithm Developed for Agent-Based Computational Modeling of Electricity Market

Balancing between exploration and exploitation with adaptation of the Q-learning (QL) parameters to the condition of dynamic uncertain environment has always been a significant subject of interest in the context of reinforcement learning. The peculiarities of the electricity market have provided such complex dynamic economic environment, and consequently have increased the requirement for advancement of the learning methods. In this economic system, the agent's market power plays a vital role in bidding decision-making problem. In order to improve the QL method, as main idea, adaptation of its parameters to the market power is proposed for making a good balance between exploration and exploitation. To implement this adaptation process, due to the fuzzy nature of human's decision-making process, a fuzzy system is designed to map each agent's market power into the QL parameters. Therefore, a fuzzy QL method is developed to model the power supplier's strategic bidding behavior in a computational electricity market. In the simulation framework, the QL algorithm selects the power supplier's bidding strategy according to the past experiences and the values of the parameters, which show the human's risk characteristic. The application of the proposed methodology for the power supplier in a multiarea power system shows the performance improvement in comparison to the QL with fixed parameters.

[1]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[2]  Amos Golan,et al.  Estimating Market Power and Strategies: Answers , 2007 .

[3]  Leigh Tesfatsion,et al.  Agent-Based Computational Economics: Growing Economies From the Bottom Up , 2002, Artificial Life.

[4]  Derek W. Bunn,et al.  Agent-based simulation-an application to the new electricity trading arrangements of England and Wales , 2001, IEEE Trans. Evol. Comput..

[5]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  Li-Xin Wang,et al.  A Course In Fuzzy Systems and Control , 1996 .

[8]  Toshiyuki Sueyoshi,et al.  Wholesale Power Price Dynamics Under Transmission Line Limits: A Use of an Agent-Based Intelligent Simulator , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  S. Stoft Power System Economics: Designing Markets for Electricity , 2002 .

[10]  S. Stoft Power System Economics: Designing Markets for Electricity , 2002 .

[11]  Leigh Tesfatsion,et al.  Market power and efficiency in a computational electricity market with discriminatory double-auction pricing , 2001, IEEE Trans. Evol. Comput..

[12]  Isabel Praça,et al.  Mascem: A Multiagent System That Simulates Competitive Electricity Markets , 2003, IEEE Intell. Syst..

[13]  Abdelaziz Lahlou Multi-agent modelling of electricity markets , 2007 .

[14]  Toshiyuki Sueyoshi,et al.  Why Did California Electricity Crisis Occur? a Numerical Analysis Using Multiagent Intelligent Simulator Abstract—during the Summer (2000), Wholesale Electricity Fits of Competition Promised to Consumers. This Study Proposes a Use of a Multiagent Intelligent Simulator (mais) to Numerically Ex- Amine , 2022 .

[15]  Reda Alhajj,et al.  A novel approach to multiagent reinforcement learning: utilizing OLAP mining in the learning process , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[17]  Richard Green,et al.  A Review of the Monitoring of Market Power The Possible Roles of TSOs in Monitoring for Market Power Issues in Congested Transmission Systems , 2005 .

[18]  S. A. Soman,et al.  Application of Actor-Critic Learning Algorithm for Optimal Bidding Problem of a Genco , 2002, IEEE Power Engineering Review.

[19]  Venkata L. Raju Chinthalapati,et al.  Learning dynamic prices in MultiSeller electronic retail markets with price sensitive customers, stochastic demands, and inventory replenishments , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[20]  Daniel J. Veit,et al.  A Critical Survey of Agent-Based Wholesale Electricity Market Models , 2008 .

[21]  Damien Ernst,et al.  A comparison of Nash equilibria analysis and agent-based modelling for power markets , 2006 .

[22]  T. Das,et al.  A Reinforcement Learning Model to Assess Market Power Under Auction-Based Energy Pricing , 2007, IEEE Transactions on Power Systems.

[23]  George Q. Huang,et al.  Agent-based modeling of supply chains for distributed scheduling , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[24]  Fushuan Wen,et al.  Market power in generation markets , 2000 .

[25]  Jens Grossklags,et al.  Software agents and market (in) efficiency: a human trader experiment , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[26]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[27]  Reda Alhajj,et al.  Multiagent reinforcement learning using function approximation , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[28]  Amy R. Pritchett,et al.  Predicting Interactions Between Agents in Agent-Based Modeling and Simulation of Sociotechnical Systems , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[29]  Donald V. Bourcier,et al.  Locational Market Power Screening and Congestion Management: Experience and Suggestions , 2002, IEEE Power Engineering Review.

[30]  H. R. Mashhadi,et al.  Modeling the Supplier Agent's Risk Strategy based on Fuzzy Logic Combined with the Q-Learning Algorithm , 2006, 2006 International Conference on Computational Intelligence and Security.

[31]  Morteza Rahimiyan,et al.  Supplier's optimal bidding strategy in electricity pay-as-bid auction: Comparison of the Q-learning and a model-based approach , 2008 .

[32]  Shigeru Okuma,et al.  A Supplier Bidding Strategy Through Q-Learning Algorithm in Electricity Auction Markets , 2003 .