Prediction-Based Multi-Agent Reinforcement Learning in Inherently Non-Stationary Environments

Multi-agent reinforcement learning (MARL) is a widely researched technique for decentralised control in complex large-scale autonomous systems. Such systems often operate in environments that are continuously evolving and where agents’ actions are non-deterministic, so called inherently non-stationary environments. When there are inconsistent results for agents acting on such an environment, learning and adapting is challenging. In this article, we propose P-MARL, an approach that integrates prediction and pattern change detection abilities into MARL and thus minimises the effect of non-stationarity in the environment. The environment is modelled as a time-series, with future estimates provided using prediction techniques. Learning is based on the predicted environment behaviour, with agents employing this knowledge to improve their performance in realtime. We illustrate P-MARL’s performance in a real-world smart grid scenario, where the environment is heavily influenced by non-stationary power demand patterns from residential consumers. We evaluate P-MARL in three different situations, where agents’ action decisions are independent, simultaneous, and sequential. Results show that all methods outperform traditional MARL, with sequential P-MARL achieving best results.

[1]  Peter Whittle,et al.  Hypothesis Testing in Time Series Analysis. , 1951 .

[2]  P. Whittle Hypothesis testing in time series analysis , 1954 .

[3]  Peter R. Winters,et al.  Forecasting Sales by Exponentially Weighted Moving Averages , 1960 .

[4]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[5]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[6]  Richard M. Karp,et al.  Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[7]  Michael Athans,et al.  Survey of decentralized control methods for large scale systems , 1978 .

[8]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[9]  W. Cleveland LOWESS: A Program for Smoothing Scatterplots by Robust Locally Weighted Regression , 1981 .

[10]  K. Stout Cumulative Sum Charts , 1985 .

[11]  David E. Goldberg,et al.  Nonstationary Function Optimization Using Genetic Algorithms with Dominance and Diploidy , 1987, ICGA.

[12]  G. Gross,et al.  Short-term load forecasting , 1987, Proceedings of the IEEE.

[13]  Scott E. Fahlman,et al.  An empirical study of learning speed in back-propagation networks , 1988 .

[14]  R. Clemen Combining forecasts: A review and annotated bibliography , 1989 .

[15]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[16]  Irma J. Terpenning,et al.  STL : A Seasonal-Trend Decomposition Procedure Based on Loess , 1990 .

[17]  Helen G. Cobb,et al.  An Investigation into the Use of Hypermutation as an Adaptive Operator in Genetic Algorithms Having Continuous, Time-Dependent Nonstationary Environments , 1990 .

[18]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[19]  Sebastian Thrun,et al.  Active Exploration in Dynamic Environments , 1991, NIPS.

[20]  Robert J. Marks,et al.  Electric load forecasting using an artificial neural network , 1991 .

[21]  D. Sofge THE ROLE OF EXPLORATION IN LEARNING CONTROL , 1992 .

[22]  Martin Brown,et al.  Neurofuzzy adaptive modelling and control , 1994 .

[23]  Paul J. Werbos,et al.  The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting , 1994 .

[24]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[25]  Mark Humphrys W-learning: Competition among selfish Q-learners , 1995 .

[26]  Moshe Tennenholtz,et al.  Adaptive Load Balancing: A Study in Multi-Agent Learning , 1994, J. Artif. Intell. Res..

[27]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[28]  Marek Kisiel-Dorohinicki,et al.  The Application of Evolution Process in Multi-Agent World to the Prediction System , 1996 .

[29]  Dominic Maratukulam,et al.  ANNSTLF-a neural-network-based electric load forecasting system , 1997, IEEE Trans. Neural Networks.

[30]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[31]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[32]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[33]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[34]  Zbigniew Michalewicz,et al.  Searching for optima in non-stationary environments , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[35]  Turner,et al.  A realizable renewable energy future , 1999, Science.

[36]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[37]  Martin A. Riedmiller,et al.  Reinforcement Learning for Cooperating and Communicating Reactive Agents in Electrical Power Grids , 2000, Balancing Reactivity and Social Deliberation in Multi-Agent Systems.

[38]  Manuela Veloso,et al.  An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .

[39]  Marco Wiering,et al.  Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .

[40]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[41]  Michael L. Littman,et al.  Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.

[42]  Bruce H. Krogh,et al.  Distributed model predictive control , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[43]  Carlos E. Pedreira,et al.  Neural networks for short-term load forecasting: a review and evaluation , 2001 .

[44]  Dit-Yan Yeung,et al.  Hidden-Mode Markov Decision Processes for Nonstationary Sequential Decision Making , 2001, Sequence Learning.

[45]  D. O A L O N S O,et al.  Learning in multi-agent systems , 2002 .

[46]  Hesham K. Alfares,et al.  Electric load forecasting: Literature survey and classification of methods , 2002, Int. J. Syst. Sci..

[47]  Mitsuo Kawato,et al.  Multiple Model-Based Reinforcement Learning , 2002, Neural Computation.

[48]  Peter J. Bentley,et al.  Towards an artificial immune system for network intrusion detection: an investigation of dynamic clonal selection , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[49]  Marko Bacic,et al.  Model predictive control , 2003 .

[50]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[51]  Yoav Shoham,et al.  Multi-Agent Reinforcement Learning:a critical survey , 2003 .

[52]  Michael P. Wellman,et al.  Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..

[53]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[54]  Darren M. Chitty,et al.  A Hybrid Ant Colony Optimisation Technique for Dynamic Vehicle Routing , 2004, GECCO.

[55]  Q. Henry Wu,et al.  Multi-agent learning for routing control within an Internet environment , 2004, Eng. Appl. Artif. Intell..

[56]  Nicholas R. Jennings,et al.  A Roadmap of Agent Research and Development , 2004, Autonomous Agents and Multi-Agent Systems.

[57]  Jacques Ferber,et al.  Environments for Multiagent Systems State-of-the-Art and Research Challenges , 2004, E4MAS.

[58]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[59]  Jeffrey S. Rosenschein,et al.  Best-response multiagent learning in non-stationary environments , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[60]  H. Fujita,et al.  A multi-agent approach to distribution system restoration , 2004, The 2004 47th Midwest Symposium on Circuits and Systems, 2004. MWSCAS '04..

[61]  V. Lo Brano,et al.  Forecasting daily urban electric load profiles using artificial neural networks , 2004 .

[62]  Gerhard Widmer,et al.  Learning in the presence of concept drift and hidden contexts , 2004, Machine Learning.

[63]  Franziska Klügl-Frohnmeyer,et al.  About the Role of the Environment in Multi-agent Simulations , 2004, E4MAS.

[64]  Kristina Lerman,et al.  Resource allocation in the grid using reinforcement learning , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[65]  B.F. Wollenberg,et al.  Toward a smart grid: power delivery for the 21st century , 2005, IEEE Power and Energy Magazine.

[66]  I. Kamwa,et al.  Causes of the 2003 major grid blackouts in North America and Europe, and recommended means to improve system dynamic performance , 2005, IEEE Transactions on Power Systems.

[67]  Mehdi Khosrow-Pour Encyclopedia of Information Science and Technology (5 Volumes) , 2005 .

[68]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[69]  B. De Moor,et al.  Short-term load forecasting, profile identification, and customer segmentation: a methodology based on periodic time series , 2005, IEEE Transactions on Power Systems.

[70]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[71]  Dina Q. Goldin,et al.  Indirect Interaction in Environments for Multi-agent Systems , 2005, E4MAS.

[72]  B. Chaib-draa,et al.  Multiagent Q-Learning : Preliminary Study on Dominance between the Nash and Stackelberg Equilibriums , 2005 .

[73]  Paulo Martins Engel,et al.  Dealing with non-stationary environments using context detection , 2006, ICML.

[74]  Douglas C. Schmidt,et al.  Ultra-Large-Scale Systems: The Software Challenge of the Future , 2006 .

[75]  David Wallace,et al.  Dynamic multi-objective optimization with evolutionary algorithms: a forward-looking approach , 2006, GECCO.

[76]  Jiri Ocenasek,et al.  Bayesian Optimization Algorithms for Dynamic Problems , 2006, EvoWorkshops.

[77]  Johann Dréo,et al.  An ant colony algorithm aimed at dynamic continuous optimization , 2006, Appl. Math. Comput..

[78]  Rajarshi Das,et al.  A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation , 2006, 2006 IEEE International Conference on Autonomic Computing.

[79]  A.L. Dimeas,et al.  Agent based control of Virtual Power Plants , 2007, 2007 International Conference on Intelligent Systems Applications to Power Systems.

[80]  N. Amjady,et al.  Short-Term Bus Load Forecasting of Power Systems by a New Hybrid Method , 2007, IEEE Transactions on Power Systems.

[81]  P. McSharry,et al.  Short-Term Load Forecasting Methods: An Evaluation Based on European Data , 2007, IEEE Transactions on Power Systems.

[82]  S.D.J. McArthur,et al.  Multi-Agent Systems for Power Engineering Applications—Part I: Concepts, Approaches, and Technical Challenges , 2007, IEEE Transactions on Power Systems.

[83]  Sanyou Zeng,et al.  Orthogonal Dynamic Hill Climbing Algorithm: ODHC , 2007, Evolutionary Computation in Dynamic and Uncertain Environments.

[84]  S.D.J. McArthur,et al.  Multi-Agent Systems for Power Engineering Applications—Part II: Technologies, Standards, and Tools for Building Multi-agent Systems , 2007, IEEE Transactions on Power Systems.

[85]  Julie A. McCann,et al.  A survey of autonomic computing—degrees, models, and applications , 2008, CSUR.

[86]  Average Annual Emissions and Fuel Consumption for Gasoline-Fueled Passenger Cars and Light Trucks -- Emission Facts (EPA-420-F-08-024) , 2008 .

[87]  K. Schneider,et al.  GridLAB-D: An open-source power systems modeling and simulation environment , 2008, 2008 IEEE/PES Transmission and Distribution Conference and Exposition.

[88]  A.L. Dimeas,et al.  Development of an agent based intelligent control system for microgrids , 2008, 2008 IEEE Power and Energy Society General Meeting - Conversion and Delivery of Electrical Energy in the 21st Century.

[89]  Cesare Alippi,et al.  Just-in-Time Adaptive Classifiers—Part II: Designing the Classifier , 2008, IEEE Transactions on Neural Networks.

[90]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[91]  Arthur P. Dempster,et al.  A Generalization of Bayesian Inference , 1968, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[92]  Y. Kumar,et al.  Multiobjective, Multiconstraint Service Restoration of Electric Power Distribution System With Priority Customers , 2008, IEEE Transactions on Power Delivery.

[93]  A. Druckman,et al.  Household energy consumption in the UK: A highly geographically and socio-economically disaggregated model , 2008 .

[94]  Yoav Shoham,et al.  Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[95]  Daniel J. Veit,et al.  A Critical Survey of Agent-Based Wholesale Electricity Market Models , 2008 .

[96]  Thomas E. Potok,et al.  A Simple Distributed Particle Swarm Optimization for Dynamic and Noisy Environments , 2008, NICSO.

[97]  L.H. Tsoukalas,et al.  From smart grids to an energy internet: Assumptions, architectures and requirements , 2008, 2008 Third International Conference on Electric Utility Deregulation and Restructuring and Power Technologies.

[98]  V. Lesser,et al.  A Multi-Agent Learning Approach to Online Distributed Resource Allocation , 2009, IJCAI.

[99]  Robi Polikar,et al.  Incremental Learning of Variable Rate Concept Drift , 2009, MCS.

[100]  Vinny Cahill,et al.  Multi-policy Optimization in Self-organizing Systems , 2009, SOAR.

[101]  Fabrício Olivetti de França,et al.  A dynamic artificial immune algorithm applied to challenging benchmarking problems , 2009, IEEE Congress on Evolutionary Computation.

[102]  G. Lambert-Torres,et al.  Particle Swarm Optimization applied to system restoration , 2009, 2009 IEEE Bucharest PowerTech.

[103]  Eduardo W. Basso,et al.  Reinforcement Learning in Non-Stationary Continuous Time and Space Scenarios , 2009 .

[104]  Louis Wehenkel,et al.  Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[105]  Koen Kok,et al.  Multi-agent coordination in the electricity grid, from concept towards market introduction , 2010, AAMAS.

[106]  S.K. Srivastava,et al.  Intelligent agent based auction by economic generation scheduling for microgrid operation , 2010, 2010 Innovative Smart Grid Technologies (ISGT).

[107]  Arindam Ghosh,et al.  Renewable energy sources and frequency regulation : survey and new perspectives , 2010 .

[108]  Vinny Cahill,et al.  Soilse: A decentralized approach to optimization of fluctuating urban traffic using Reinforcement Learning , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[109]  李君 NISSAN LEAF 或许,就在明天 , 2010 .

[110]  Martijn Brons,et al.  Plug-in Hybrid and Battery Electric Vehicles. Market penetration scenarios of electric drive vehicles , 2010 .

[111]  Farshid Keynia,et al.  Short-Term Load Forecast of Microgrids by a New Bilevel Prediction Strategy , 2010, IEEE Transactions on Smart Grid.

[112]  H. Farhangi,et al.  The path of the smart grid , 2010, IEEE Power and Energy Magazine.

[113]  M. F. Irzaq bin Khamis,et al.  Electricity forecasting for small scale power system using fuzzy logic , 2010, 2010 Conference Proceedings IPEC.

[114]  Houlei Gao,et al.  Multi-agent based fault location algorithm for smart distribution grid , 2010 .

[115]  Pengcheng Zhang,et al.  A novel multi-agent reinforcement learning approach for job scheduling in Grid computing , 2011, Future Gener. Comput. Syst..

[116]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[117]  Yung-Ruei Chang,et al.  Short-term load forecasting via fuzzy neural network with varied learning rates , 2011, 2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011).

[118]  Sarvapali D. Ramchurn,et al.  Agent-based control for decentralised demand side management in the smart grid , 2011, AAMAS.

[119]  Eduardo F. Morales,et al.  An Introduction to Reinforcement Learning , 2011 .

[120]  Thillainathan Logenthiran,et al.  Multi-agent system for energy resource scheduling of integrated microgrids in a distributed system , 2011 .

[121]  Shengxiang Yang,et al.  Memory-Based Immigrants for Ant Colony Optimization in Changing Environments , 2011, EvoApplications.

[122]  Trung Thanh Nguyen,et al.  Continuous dynamic optimisation using evolutionary algorithms , 2011 .

[123]  Patrick P. K. Chan,et al.  Multiple classifier system for short term load forecast of Microgrid , 2011, 2011 International Conference on Machine Learning and Cybernetics.

[124]  Tom Holvoet,et al.  Decentralized coordination of plug-in hybrid vehicles for imbalance reduction in a smart grid , 2011, AAMAS.

[125]  Peter Vrancx,et al.  Game Theory and Multi-agent Reinforcement Learning , 2012, Reinforcement Learning.

[126]  Vinny Cahill,et al.  Autonomic multi-policy optimization in pervasive systems: Overview and evaluation , 2012, TAAS.

[127]  Sarvapali D. Ramchurn,et al.  Putting the 'smarts' into the smart grid , 2012, Commun. ACM.

[128]  Alfredo Núñez,et al.  Load profile generator and load forecasting for a renewable based microgrid using Self Organizing Maps and neural networks , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[129]  E. Shimoda,et al.  Operation planning and load prediction for microgrid using thermal demand estimation , 2012, 2012 IEEE Power and Energy Society General Meeting.

[130]  Ramachandra Kota,et al.  Cooperative Virtual Power Plant Formation Using Scoring Rules , 2012, AAAI.

[131]  Hamidreza Zareipour,et al.  Electricity Price and Demand Forecasting in Smart Grids , 2012, IEEE Transactions on Smart Grid.

[132]  Helge Langseth,et al.  EFFECTS OF SCALE ON LOAD PREDICTION ALGORITHMS , 2013 .

[133]  Pablo Hernandez-Leal,et al.  Learning against non-stationary opponents , 2013 .

[134]  Panagiotis D. Christofides,et al.  Distributed model predictive control: A tutorial review and future research directions , 2013, Comput. Chem. Eng..

[135]  L. Pazvakawambwa,et al.  Forecasting methods and applications. , 2013 .

[136]  M. H. Nehrir,et al.  Comprehensive Real-Time Microgrid Power Management and Control With Distributed Agents , 2013, IEEE Transactions on Smart Grid.

[137]  Vinny Cahill,et al.  Multi-agent residential demand response based on load forecasting , 2013, 2013 1st IEEE Conference on Technologies for Sustainability (SusTech).

[138]  Ken Nagasaka,et al.  Multiobjective Intelligent Energy Management for a Microgrid , 2013, IEEE Transactions on Industrial Electronics.

[139]  Bin Liu,et al.  Multi-Agent Based Hierarchical Hybrid Control for Smart Microgrid , 2013, IEEE Transactions on Smart Grid.

[140]  Ufuk Topcu,et al.  Optimal decentralized protocol for electric vehicle charging , 2011, IEEE Transactions on Power Systems.

[141]  Siobhán Clarke,et al.  Residential electrical demand forecasting in very small scale: An evaluation of forecasting methods , 2013, 2013 2nd International Workshop on Software Engineering Challenges for the Smart Grid (SE4SG).

[142]  Siobhán Clarke,et al.  Analysis of Approaches to Coordinated Charging of Electric Vehicles on the Distribution Grid , 2013, International Conference on Smart Grids and Green IT Systems.

[143]  Tom Holvoet,et al.  A Scalable Three-Step Approach for Demand Side Management of Plug-in Hybrid Vehicles , 2013, IEEE Transactions on Smart Grid.

[144]  Nikolas Hill,et al.  Powering Ahead: The future of low-carbon cars and fuels , 2013 .

[145]  Tom Holvoet,et al.  A comparison of two GIV mechanisms for providing ancillary services at the University of Delaware , 2013, 2013 IEEE International Conference on Smart Grid Communications (SmartGridComm).

[146]  Suryanarayana Doolla,et al.  Multiagent-Based Distributed-Energy-Resource Management for Intelligent Microgrids , 2013, IEEE Transactions on Industrial Electronics.

[147]  K. Aberer,et al.  Individual, Aggregate, and Cluster-based Aggregate Forecasting of Residential Demand , 2014 .

[148]  Emmanuel Hadoux,et al.  Sequential Decision-Making under Non-stationary Environments via Sequential Change-point Detection , 2014 .

[149]  Pedro Faria,et al.  Distributed intelligent management of microgrids using a multi-agent simulation platform , 2014, 2014 IEEE Symposium on Intelligent Agents (IA).

[150]  Bill Rose,et al.  Microgrids , 2018, Smart Grids.

[151]  T. Pinto,et al.  Multiagent System Architecture for Short-term Operation of Integrated Microgrids , 2014 .

[152]  Jörg Hähner,et al.  Reinforcement Learning for Coverage Optimization Through PTZ Camera Alignment in Highly Dynamic Environments , 2014, ICDSC.

[153]  Ratnesh K. Sharma,et al.  Improving Sustainability of Hybrid Energy Systems Part II: Managing Multiple Objectives With a Multiagent System , 2014, IEEE Transactions on Sustainable Energy.

[154]  Jan van Dalen,et al.  Agent-coordinated virtual power plants of electric vehicles , 2014, AAMAS.

[155]  Siobhán Clarke,et al.  A hybrid approach to very small scale electrical demand forecasting , 2014, ISGT 2014.

[156]  Benjamin Rosman,et al.  Context-based online policy instantiation for multiple tasks and changing environments , 2014 .

[157]  Maria L. Gini,et al.  Fast adaptive learning in repeated stochastic games by game abstraction , 2014, AAMAS.

[158]  Ratnesh K. Sharma,et al.  Improving Sustainability of Hybrid Energy Systems Part I: Incorporating Battery Round-Trip Efficiency and Operational Cost Factors , 2014, IEEE Transactions on Sustainable Energy.

[159]  J. M. Maestre,et al.  Distributed Model Predictive Control: An Overview and Roadmap of Future Research Opportunities , 2014, IEEE Control Systems.

[160]  Liuchen Chang,et al.  Multiagent-Based Hybrid Energy Management System for Microgrids , 2014, IEEE Transactions on Sustainable Energy.

[161]  Wolfgang Ketter,et al.  Learning to schedule electric vehicle charging given individual customer preferences , 2014, AAMAS.

[162]  Jaime Lloret,et al.  A Survey on Electric Power Demand Forecasting: Future Trends in Smart Grids, Microgrids and Smart Buildings , 2014, IEEE Communications Surveys & Tutorials.

[163]  Anil Pahwa,et al.  Game theoretic model of energy trading strategies at equilibrium in microgrids , 2014, 2014 North American Power Symposium (NAPS).

[164]  Ana L. C. Bazzan Beyond Reinforcement Learning and Local View in Multiagent Systems , 2014, KI - Künstliche Intelligenz.

[165]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[166]  Siobhán Clarke,et al.  A dynamic forecasting method for small scale residential electrical demand , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[167]  S. X. Chen,et al.  Multi-Agent System for Distributed Management of Microgrids , 2015, IEEE Transactions on Power Systems.

[168]  Gordon G. Parker,et al.  Survey of multi-agent systems for microgrid control , 2015, Eng. Appl. Artif. Intell..

[169]  Scott Backhaus,et al.  DC Microgrids Scoping Study. Estimate of Technical and Economic Benefits , 2015 .

[170]  Karl Tuyls,et al.  Evolutionary Dynamics of Multi-Agent Learning: A Survey , 2015, J. Artif. Intell. Res..

[171]  Bo Zhao,et al.  An MAS based energy management system for a stand-alone microgrid at high altitude , 2015 .