Integrating distributed Bayesian inference and reinforcement learning for sensor management
暂无分享,去创建一个
Shimon Whiteson | Gregor Pavlin | Bram Bakker | Corrado Grappiolo | B. Bakker | G. Pavlin | Corrado Grappiolo | Shimon Whiteson
[1] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..
[2] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[3] Craig Boutilier,et al. Value-Directed Belief State Approximation for POMDPs , 2000, UAI.
[4] Joelle Pineau,et al. Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.
[5] Yang Xiang,et al. PROBABILISTIC REASONING IN MULTIAGENT SYSTEMS: A GRAPHICAL MODELS APPROACH, by Yang Xiang, Cambridge University Press, Cambridge, 2002, xii + 294 pp., ISBN 0-521-81308-5 (Hardback, £45.00). , 2002, Robotica.
[6] Bram Bakker,et al. Reinforcement Learning with Long Short-Term Memory , 2001, NIPS.
[7] Andrew McCallum,et al. Instance-Based Utile Distinctions for Reinforcement Learning , 1995 .
[8] M. Spaan. Cooperative Active Perception using POMDPs , 2008 .
[9] Judea Pearl,et al. Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.
[10] R. Simmons,et al. Probabilistic Navigation in Partially Observable Environments , 1995 .
[11] R. A. Adrian. Sensor management , 1993, [1993 Proceedings] AIAA/IEEE Digital Avionics Systems Conference.
[12] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[13] Marinus Maris,et al. A multi-agent systems approach to distributed bayesian information fusion , 2010, Inf. Fusion.
[14] Wendi Heinzelman,et al. Sensor management , 2004 .
[15] Reid G. Simmons,et al. Probabilistic Robot Navigation in Partially Observable Environments , 1995, IJCAI.
[16] Milos Hauskrecht,et al. Value-Function Approximations for Partially Observable Markov Decision Processes , 2000, J. Artif. Intell. Res..
[17] Toygar Karadeniz,et al. FDMS with Q-Learning: A Neuro-Fuzzy Approach to Partially Observable Markov Decision Problems , 2004 .
[18] Frédéric Dambreville,et al. The cross-entropy method for solving a variety of hierarchical search problems , 2007, 2007 10th International Conference on Information Fusion.
[19] Gregor Pavlin,et al. A modular approach to adaptive Bayesian information fusion , 2007, 2007 10th International Conference on Information Fusion.