暂无分享,去创建一个
Michael L. Littman | Jun Ki Lee | Emma Tosch | Sam Witty | Akanksha Atrey | David D. Jensen | M. Littman | David D. Jensen | Emma Tosch | Akanksha Atrey | Jun Ki Lee | Sam Witty
[1] Dileep George,et al. Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics , 2017, ICML.
[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[3] Ivan Laptev,et al. Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[4] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[5] Marlos C. Machado,et al. Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents , 2017, J. Artif. Intell. Res..
[6] P. Alam. ‘A’ , 2021, Composites Engineering: An A–Z Guide.
[7] Shane Legg,et al. Massively Parallel Methods for Deep Reinforcement Learning , 2015, ArXiv.
[8] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[9] Sandy H. Huang,et al. Adversarial Attacks on Neural Network Policies , 2017, ICLR.
[10] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[11] Ronald E. Parr,et al. A Novel Benchmark Methodology and Data Repository for Real-life Reinforcement Learning , 2009 .
[12] Peter Stone,et al. The Impact of Determinism on Learning Atari 2600 Games , 2015, AAAI Workshop: Learning for General Competency in Video Games.
[13] Moshe Dor,et al. אבן, and: Stone , 2017 .
[14] Samy Bengio,et al. A Study on Overfitting in Deep Reinforcement Learning , 2018, ArXiv.
[15] Yoshua Bengio,et al. A Closer Look at Memorization in Deep Networks , 2017, ICML.
[16] Shimon Whiteson,et al. Protecting against evaluation overfitting in empirical reinforcement learning , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).
[17] Sham M. Kakade,et al. On the sample complexity of reinforcement learning. , 2003 .
[18] P. Alam. ‘S’ , 2021, Composites Engineering: An A–Z Guide.
[19] Tsuyoshi Murata,et al. {m , 1934, ACML.
[20] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[21] P. Alam. ‘G’ , 2021, Composites Engineering: An A–Z Guide.
[22] Silvio Savarese,et al. Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).