Cyber-Human Approach For Learning Human Intention And Shape Robotic Behavior Based On Task Demonstration
暂无分享,去创建一个
Vinicius G. Goecks | William D. Nothwang | Gregory M. Gremillion | Hannah C. Lehman | G. Gremillion | W. Nothwang | Hannah C. Lehman
[1] George D. C. Cavalcanti,et al. Combining dissimilarity spaces for text categorization , 2017, Inf. Sci..
[2] Jitendra Malik,et al. Combining self-supervised learning and imitation for vision-based rope manipulation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[3] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[4] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[5] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[6] E. Morales,et al. Human Interaction for Effective Reinforcement Learning , 2013 .
[7] W. Bradley Knox,et al. Learning from human-generated reward , 2012 .
[8] Ahmad Hakimi,et al. Ideal Gas Optimization Algorithm , 2017 .
[9] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[10] Teresa Bernarda Ludermir,et al. Optimization of the weights and asymmetric activation function family of neural network for time series forecasting , 2013, Expert Syst. Appl..
[11] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[12] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[13] Ashish Kapoor,et al. AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles , 2017, FSR.
[14] Kevin Barraclough,et al. I and i , 2001, BMJ : British Medical Journal.
[15] Andrea Lockerd Thomaz,et al. Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance , 2006, AAAI.
[16] Carl E. Rasmussen,et al. Gaussian Processes for Data-Efficient Learning in Robotics and Control , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[17] Claudia-Adina Dragos,et al. Online identification of evolving Takagi-Sugeno-Kang fuzzy models for crane systems , 2014, Appl. Soft Comput..
[18] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[19] Peter Stone,et al. Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance , 2015, Artif. Intell..
[20] W. Marsden. I and J , 2012 .
[21] Peter Stone,et al. Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces , 2017, AAAI.