Performance Loss Bound for State Aggregation in a Class of Supply Demand Matching Systems
暂无分享,去创建一个
[1] Ricard Gavaldà,et al. Monotone Proofs of the Pigeon Hole Principle , 2001, Math. Log. Q..
[2] Dimitri P. Bertsekas,et al. Approximate Dynamic Programming , 2017, Encyclopedia of Machine Learning and Data Mining.
[3] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.
[4] Peter Stone,et al. State Abstraction Discovery from Irrelevant State Variables , 2005, IJCAI.
[5] Benjamin Van Roy. Performance Loss Bounds for Approximate Value Iteration with State Aggregation , 2006, Math. Oper. Res..
[6] D. Bertsekas. Approximate policy iteration: a survey and some new methods , 2011 .
[7] B. Krogh,et al. State aggregation in Markov decision processes , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..
[8] Xi-Ren Cao,et al. Stochastic learning and optimization - A sensitivity-based approach , 2007, Annu. Rev. Control..
[9] Zhiyuan Ren,et al. A time aggregation approach to Markov decision processes , 2002, Autom..
[10] Michael L. Littman,et al. Near Optimal Behavior via Approximate State Abstraction , 2016, ICML.
[11] Xuan Zhang,et al. Decentralized EV-Based Charging Optimization With Building Integrated Wind Energy , 2019, IEEE Transactions on Automation Science and Engineering.
[12] Junjie Wu,et al. A Q-Learning Method for Scheduling Shared EVs Under Uncertain User Demand and Wind Power Supply , 2018, 2018 IEEE Conference on Control Technology and Applications (CCTA).
[13] Qing-Shan Jia,et al. On State Aggregation to Approximate Complex Value Functions in Large-Scale Markov Decision Processes , 2011, IEEE Transactions on Automatic Control.
[14] Junjie Wu,et al. On State Aggregation in a Class of Cyber Physical Energy Systems , 2018, 2018 37th Chinese Control Conference (CCC).
[15] Thomas J. Walsh,et al. Towards a Unified Theory of State Abstraction for MDPs , 2006, AI&M.
[16] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[17] Abhijit Gosavi,et al. Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .
[18] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[19] D. Bertsekas. Reinforcement Learning and Optimal ControlA Selective Overview , 2018 .
[20] Junjie Wu,et al. Event-Based HVAC Control—A Complexity-Based Approach , 2018, IEEE Transactions on Automation Science and Engineering.