Scaling-up Knowledge for a Cognizant Robot
暂无分享,去创建一个
[1] W. Grey Walter,et al. AN ELECTRO‐MECHANICAL »ANIMAL«1 , 1950 .
[2] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .
[3] J. Pearce. Animal Learning and Cognition: An Introduction , 1997 .
[4] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[5] Michael R. James,et al. Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.
[6] R. Greenspan,et al. Cognitive consonance: complex brain functions in the fruit fly and its relatives , 2004, Trends in Neurosciences.
[7] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[8] Shalabh Bhatnagar,et al. Fast gradient-descent methods for temporal-difference learning with linear function approximation , 2009, ICML '09.
[9] William Whittaker,et al. Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.
[10] Richard S. Sutton,et al. GQ(lambda): A general gradient algorithm for temporal-difference prediction learning with eligibility traces , 2010, Artificial General Intelligence.
[11] R. Sutton,et al. GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces , 2010 .
[12] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.