Online abstraction with MDP homomorphisms for Deep Learning
暂无分享,去创建一个
[1] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[2] Alicia P. Wolfe. Defining Object Types and Options Using MDP Homomorphisms , 2006 .
[3] Atsuto Maki,et al. A systematic study of the class imbalance problem in convolutional neural networks , 2017, Neural Networks.
[4] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Manfred Huber,et al. Learning to generalize and reuse skills using approximate partial policy homomorphisms , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.
[6] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[7] A. Barto,et al. An algebraic approach to abstraction in reinforcement learning , 2004 .
[8] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[9] A. Azzouz. 2011 , 2020, City.
[10] John Platt,et al. Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .
[11] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[12] Thomas A. Funkhouser,et al. Dilated Residual Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] R. Bellman. A Markovian Decision Process , 1957 .
[14] Satinder P. Singh,et al. Transfer via soft homomorphisms , 2009, AAMAS.
[15] Peter E. Hart,et al. Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.
[16] Kilian Q. Weinberger,et al. On Calibration of Modern Neural Networks , 2017, ICML.
[17] Balaraman Ravindran. Approximate Homomorphisms : A framework for non-exact minimization in Markov Decision Processes , 2022 .
[18] Vishal Soni,et al. Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains , 2006, AAAI.
[19] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[20] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[21] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[22] Alicia P. Wolfe,et al. Decision Tree Methods for Finding Reusable MDP Homomorphisms , 2006, AAAI.
[23] Doina Precup,et al. Bounding Performance Loss in Approximate MDP Homomorphisms , 2008, NIPS.
[24] Robert Givan,et al. Equivalence notions and model minimization in Markov decision processes , 2003, Artif. Intell..