论文信息 - Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study

Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study

DYLAN J. A. BRENNEIS, DeepMind Technologies, Ltd., Canada ADAM S. R. PARKER, University of Alberta, Canada MICHAEL BRADLEY JOHANSON, DeepMind Technologies, Ltd., Canada ANDREW BUTCHER, DeepMind Technologies, Ltd., Canada ELNAZ DAVOODI, DeepMind Technologies, Ltd., Canada LESLIE ACKER, DeepMind Technologies, Ltd., Canada MATTHEW M. BOTVINICK, DeepMind Technologies, Ltd., United Kingdom JOSEPH MODAYIL, DeepMind Technologies, Ltd., Canada ADAMWHITE, DeepMind & University of Alberta & Alberta Machine Intelligence Institute, Canada PATRICK M. PILARSKI, DeepMind & University of Alberta & Alberta Machine Intelligence Institute, Canada

[1] Elliot A. Ludvig,et al. From eye-blinks to state construction: Diagnostic benchmarks for online representation learning , 2020 .

[2] Patrick M. Pilarski,et al. Learned human-agent decision-making, communication and joint action in a virtual reality environment , 2019, ArXiv.

[3] Iyad Rahwan,et al. Cooperating with machines , 2017, Nature Communications.

[4] Angeliki Lazaridou,et al. Emergent Multi-Agent Communication in the Deep Learning Era , 2020, ArXiv.

[5] Richard S. Sutton,et al. Prediction Driven Behavior: Learning Predictions that Drive Fixed Responses , 2014, AAAI 2014.

[6] Evan F. Risko,et al. Cognitive Offloading , 2016, Trends in Cognitive Sciences.

[7] Jessie Y. C. Chen,et al. A Meta-Analysis of Factors Affecting Trust in Human-Robot Interaction , 2011, Hum. Factors.

[8] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[9] E. Kehoe,et al. Fundamental Behavioral Methods and Findings in Classical Conditioning , 2002 .

[10] Pattie Maes,et al. Agents that reduce work and information overload , 1994, CACM.

[11] Maya Cakmak,et al. Power to the People: The Role of Humans in Interactive Machine Learning , 2014, AI Mag..

[12] Patrick M. Pilarski,et al. Adaptive artificial limbs: a real-time approach to prediction and anticipation , 2013, IEEE Robotics & Automation Magazine.

[13] G. Ritchie,et al. Signalling signalhood and the emergence of communication , 2008, Cognition.

[14] Craig Sherstan,et al. Towards Prosthetic Arms as Wearable Intelligent Robots , 2015 .

[15] Craig Sherstan,et al. Application of real-time machine learning to myoelectric prosthesis control: A case series in adaptive switching , 2016, Prosthetics and orthotics international.

[16] Richard S. Sutton,et al. Pavlovian control of intraspinal microstimulation to produce over-ground walking , 2019, bioRxiv.

[17] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.

[18] Bo He,et al. Human-Centered Reinforcement Learning: A Survey , 2019, IEEE Transactions on Human-Machine Systems.

[19] Jonathon W. Sensinger,et al. Embodied Cooperation to Promote Forgiving Interactions With Autonomous Machines , 2021, Frontiers in Neurorobotics.

[20] Patrick M. Pilarski,et al. Exploring the Impact of Machine-Learned Predictions on Feedback from an Artificial Limb , 2019, 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR).

[21] Patrick M. Pilarski,et al. Machine learning and unlearning to autonomously switch between the functions of a myoelectric arm , 2016, 2016 6th IEEE International Conference on Biomedical Robotics and Biomechatronics (BioRob).

[22] Dean V. Buonomano,et al. The Neural Basis of Timing: Distributed Mechanisms for Diverse Functions , 2018, Neuron.

[23] Richard S. Sutton,et al. Multi-timescale nexting in a reinforcement learning robot , 2011, Adapt. Behav..

[24] T. Scott-Phillips. Speaking Our Minds: Why human communication is different, and how language evolved to make it special , 2014 .

[25] Jason P. Gallivan,et al. Three-dimensional reach trajectories as a probe of real-time decision-making between multiple competing targets , 2014, Front. Neurosci..