论文信息 - Optimal action sequence generation for assistive agents in fixed horizon tasks

Optimal action sequence generation for assistive agents in fixed horizon tasks

Agents providing assistance to humans are faced with the challenge of automatically adjusting the level of assistance to ensure optimal performance. In this work, we argue that identifying the right level of assistance consists in balancing positive assistance outcomes and some (domain-dependent) measure of cost associated with assistive actions. Towards this goal, we contribute a general mathematical framework for structured tasks where an agent playing the role of a ‘provider’—e.g., therapist, teacher—assists a human ‘receiver’—e.g., patient, student. We specifically consider tasks where the provider agent needs to plan a sequence of actions over a fixed time horizon, where actions are organized along a hierarchy with increasing success probabilities, and some associated costs. The goal of the provider is to achieve a success with the lowest expected cost possible. We present OAssistMe, an algorithm that generates cost-optimal action sequences given the action parameters, and investigate several extensions of it, motivated by different potential application domains. We provide an analysis of the algorithms, including proofs for a number of properties of optimal solutions that, we show, align with typical human provider strategies. Finally, we instantiate our theoretical framework in the context of robot-assisted therapy tasks for children with Autism Spectrum Disorder (ASD). In this context, we present methods for determining action parameters based on a survey of domain experts and real child-robot interaction data. Our contributions unlock increased levels of flexibility for agents introduced in a variety of assistive contexts.

[1] Ki-Hun Cho,et al. Is robot-assisted therapy effective in upper extremity recovery in early stage stroke? —a systematic literature review , 2017, Journal of physical therapy science.

[2] Changchun Liu,et al. Affect-sensitive assistive intervention technologies for children with autism: An individual-specific approach , 2008, RO-MAN 2008 - The 17th IEEE International Symposium on Robot and Human Interactive Communication.

[3] Patrick Olivier,et al. People, sensors, decisions: Customizable and adaptive technologies for assistance in healthcare , 2012, TIIS.

[4] D Feil-Seifer,et al. Socially Assistive Robotics , 2011, IEEE Robotics & Automation Magazine.

[5] Gita Reese Sukthankar,et al. Tractable POMDP representations for intelligent tutoring systems , 2013, TIST.

[6] Pierre-Yves Oudeyer,et al. Online Optimization of Teaching Sequences with Multi-Armed Bandits , 2014, EDM.

[7] Tom Routen,et al. Intelligent Tutoring Systems , 1996, Lecture Notes in Computer Science.

[8] Cristina Conati,et al. Empirically building and evaluating a probabilistic model of user affect , 2009, User Modeling and User-Adapted Interaction.

[9] Zdenko Kovacic,et al. Robot-assisted Autism Spectrum Disorder Diagnostics using POMDPs , 2017, HRI.

[10] Oguzhan Alagoz,et al. Markov Decision Processes: A Tool for Sequential Decision Making under Uncertainty , 2010, Medical decision making : an international journal of the Society for Medical Decision Making.

[11] Eric Horvitz,et al. Agents With Beliefs: Reflections on Bayesian Methods for User Modeling , 1997 .

[12] Siddhartha S. Srinivasa,et al. Human-Robot Mutual Adaptation in Shared Autonomy , 2017, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI.

[13] Karl J. Friston,et al. Computational Phenotyping in Psychiatry: A Worked Example , 2016, eNeuro.

[14] E. Koechlin,et al. The Importance of Falsification in Computational Cognitive Modeling , 2017, Trends in Cognitive Sciences.

[15] Клинические дисциплины. Autism Diagnostic Observation Schedule , 2010 .

[16] Nancy K. Lamport,et al. Activity Analysis: Application to Occupation , 2005 .

[17] H. Head. Aphasia and kindred disorders of speech , 1926 .

[18] Milos Hauskrecht,et al. Planning treatment of ischemic heart disease with partially observable Markov decision processes , 2000, Artif. Intell. Medicine.

[19] Karl J. Friston,et al. Human Neuroscience Hypothesis and Theory Article an Aberrant Precision Account of Autism , 2022 .

[20] Tiffany Barnes,et al. Toward Automatic Hint Generation for Logic Proof Tutoring Using Historical Student Data , 2008, Intelligent Tutoring Systems.

[21] Ana Paiva,et al. Developing Learning Scenarios to Foster Children's Handwriting Skills with the Help of Social Robots , 2017, HRI.

[22] Francisco S. Melo,et al. An Optimization Approach for Structured Agent-Based Provider/Receiver Tasks , 2019, AAMAS.

[23] B. Denton,et al. Robust Markov Decision Processes for Medical Treatment Decisions , 2015 .

[24] N. Sarkar,et al. Can Robotic Interaction Improve Joint Attention Skills? , 2015, Journal of autism and developmental disorders.

[25] Kenneth R. Koedinger,et al. Data-Driven Hint Generation in Vast Solution Spaces: a Self-Improving Python Programming Tutor , 2017, International Journal of Artificial Intelligence in Education.

[26] Iolanda Leite,et al. Long-term interactions with empathic social robots , 2015, SIGAI.

[27] G. Arbanas. Diagnostic and Statistical Manual of Mental Disorders (DSM-5) , 2015 .

[28] Sarel van Vuuren,et al. A Virtual Therapist for Speech and Language Therapy , 2014, IVA.

[29] Freek E. Hoebeek,et al. SLC26A11 (KBAT) in Purkinje Cells Is Critical for Inhibitory Transmission and Contributes to Locomotor Coordination , 2016, eNeuro.

[30] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[31] Shaobo Huang,et al. How to train your DragonBot: Socially assistive robots for teaching children about nutrition through play , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[32] Tom Ziemke,et al. How to Build a Supervised Autonomous System for Robot-Enhanced Therapy for Children with Autism Spectrum Disorder , 2017, Paladyn J. Behav. Robotics.

[33] Andrew J. Schaefer,et al. Modeling Medical Treatment Using Markov Decision Processes , 2005 .

[34] Chih-Wei Chang,et al. A Robot as a Teaching Assistant in an English Class , 2006, Sixth IEEE International Conference on Advanced Learning Technologies (ICALT'06).

[35] Pierre Baldi,et al. Bayesian surprise attracts human attention , 2005, Vision Research.

[36] Maja J. Matarić,et al. Surprise! Predicting Infant Visual Attention in a Socially Assistive Robot Contingent Learning Paradigm , 2019, 2019 28th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN).

[37] Craig W. Linebaugh,et al. Cueing Hierarchies and Word Retrieval: A Therapy Program , 1977 .

[38] A. Mihailidis,et al. Assistive technology for cognitive rehabilitation: State of the art , 2004 .

[39] Albert A. Rizzo,et al. Virtual humans for assisted health care , 2008, PETRA '08.

[40] Kurt VanLehn,et al. A Comparison of Decision-Theoretic, Fixed-Policy and Random Tutorial Action Selection , 2006, Intelligent Tutoring Systems.

[41] Pauline Gibbons,et al. Scaffolding language, scaffolding learning , 2002 .

[42] Mohamed Chetouani,et al. A multimodal and multilevel system for robotics treatment of autism in children , 2016, DAA '16.

[43] Robert C Wilson,et al. Ten simple rules for the computational modeling of behavioral data , 2019, eLife.

[44] Rosemary Luckin,et al. Artificial Intelligence in Education, Building Technology Rich Learning Contexts That Work, Proceedings of the 13th International Conference on Artificial Intelligence in Education, AIED 2007, July 9-13, 2007, Los Angeles, California, USA , 2007, AIED.

[45] Stuart J. Russell,et al. Partially Observable Sequential Decision Making for Problem Selection in an Intelligent Tutoring System , 2011, EDM.

[46] B. Scassellati,et al. Robots for use in autism research. , 2012, Annual review of biomedical engineering.

[47] Subbarao Kambhampati,et al. What can Automated Planning do for Intelligent Tutoring Systems ? , 2018 .