Learning Robust Policies for Object Manipulation with Robot Swarms
暂无分享,去创建一个
Marius Schnaubelt | Kevin Daun | Gerhard Neumann | Gregor H. W. Gebhardt | G. Neumann | Kevin Daun | Marius Schnaubelt
[1] Kunikazu Kobayashi,et al. A reinforcement learning system for swarm behaviors , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).
[2] Sylvain Martel,et al. Using a swarm of self-propelled natural microrobots in the form of flagellated bacteria to perform complex micro-assembly tasks , 2010, 2010 IEEE International Conference on Robotics and Automation.
[3] Radhika Nagpal,et al. Collective transport of complex objects by simple robots: theory and experiments , 2013, AAMAS.
[4] Masashi Furukawa,et al. An actor-critic approach for learning cooperative behaviors of multiagent seesaw balancing problems , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.
[5] Lynne E. Parker,et al. Multiple Mobile Robot Systems , 2008, Springer Handbook of Robotics.
[6] Marius Schnaubelt,et al. Learning to Assemble Objects with a Robot Swarm , 2017, AAMAS.
[7] Zoubin Ghahramani,et al. Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.
[8] Johannes Fürnkranz,et al. Model-Free Preference-Based Reinforcement Learning , 2016, AAAI.
[9] Alcherio Martinoli,et al. Multi-robot learning with particle swarm optimization , 2006, AAMAS '06.
[10] Le Song,et al. A Hilbert Space Embedding for Distributions , 2007, Discovery Science.
[11] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[12] Yoram Koren,et al. Potential field methods and their inherent limitations for mobile robot navigation , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.
[13] Karol Myszkowski,et al. Adaptive Logarithmic Mapping For Displaying High Contrast Scenes , 2003, Comput. Graph. Forum.
[14] Justin A. Boyan,et al. Least-Squares Temporal Difference Learning , 1999, ICML.
[15] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[16] Ai Poh Loh,et al. Model-based contextual policy search for data-efficient generalization of robot skills , 2017, Artif. Intell..
[17] Aaron Becker,et al. Object manipulation and position control using a swarm with global inputs , 2016, 2016 IEEE International Conference on Automation Science and Engineering (CASE).
[18] Alcherio Martinoli,et al. Parallel learning in heterogeneous multi-robot swarms , 2007, 2007 IEEE Congress on Evolutionary Computation.
[19] O. Khatib,et al. Real-Time Obstacle Avoidance for Manipulators and Mobile Robots , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.
[20] Radhika Nagpal,et al. Kilobot: A low cost robot with scalable operations designed for collective behaviors , 2014, Robotics Auton. Syst..
[21] Radhika Nagpal,et al. Kilobot: A low cost scalable robot system for collective behaviors , 2012, 2012 IEEE International Conference on Robotics and Automation.
[22] Robert L. Stevenson,et al. Dynamic range improvement through multiple exposures , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).
[23] Jan Peters,et al. Non-parametric Policy Search with Limited Information Loss , 2017, J. Mach. Learn. Res..
[24] Nils J. Nilsson,et al. A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..
[25] N. Aronszajn. Theory of Reproducing Kernels. , 1950 .
[26] Matthew W. Hoffman,et al. Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization , 2011, EWRL.
[27] Michael Rubenstein,et al. Massive uniform manipulation: Controlling large populations of simple robots with a common input signal , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.