Behavior Acquisition in Partially Observable Environments by Autonomous Segmentation of the Observation Space

[1]  Hiroshi Kawano Three-Dimensional Obstacle Avoidance of Blimp-Type Unmanned Aerial Vehicle Flying in Unknown and Non-Uniform Wind Disturbance , 2007, J. Robotics Mechatronics.

[2]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[3]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[4]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[5]  Tsukasa Ogasawara,et al.  Self-Partitioning State Space for Behavior Acquisition of Vision-Based Mobile Robots , 2001, J. Robotics Mechatronics.

[6]  Ron Sun,et al.  Self-segmentation of sequences: automatic formation of hierarchies of sequential behaviors , 2000, IEEE Trans. Syst. Man Cybern. Part B.

[7]  Sebastian Thrun,et al.  Monte Carlo POMDPs , 1999, NIPS.

[8]  Rolf Pfeifer,et al.  Understanding intelligence , 2020, Inequality by Design.

[9]  Akira Hayashi,et al.  A Reinforcement Learning Algorithm in Partially Observable Environments Using Short-Term Memory , 1998, NIPS.

[10]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[11]  Jürgen Schmidhuber,et al.  HQ-Learning , 1997, Adapt. Behav..

[12]  Andrew McCallum,et al.  Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State , 1995, ICML.

[13]  Long Ji Lin,et al.  Reinforcement Learning of Non-Markov Decision Processes , 1995, Artif. Intell..

[14]  Michael L. Littman,et al.  Memoryless policies: theoretical limitations and practical results , 1994 .

[15]  Michael I. Jordan,et al.  Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.

[16]  Pattie Maes,et al.  Behavior-based artificial intelligence , 1993 .

[17]  Tom M. Mitchell,et al.  Reinforcement learning with hidden states , 1993 .

[18]  Lonnie Chrisman,et al.  Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach , 1992, AAAI.

[19]  Long Lin,et al.  Memory Approaches to Reinforcement Learning in Non-Markovian Domains , 1992 .