Human-centered planning for effective tast autonomy

Increasingly available mobile devices (e.g., mobile robots, smart phones) are becoming more intelligent in their ability to autonomously perform tasks for users. However, when deployed in complex human environments, these devices still face many sensing, reasoning, and actuation limitations. To overcome limitations, we propose symbiotic relationships as those in which the device can request help from humans in the environment while it performs tasks for them. Because the devices are performing tasks for humans, humans have incentive to help the device complete its tasks effectively. However, they may not always be available or willing to help. We introduce human-centered planning to model and reason about humans in the environment in addition to their own state and goals to determine how to act and whether, who, and how to seek help. The thesis first contributes an understanding of what and how to model humans in the environment through user studies. We first evaluate whether attributes such as availability and interruptibility affect willingness to help. Then, we contribute to the understanding of how to ask humans for help to increase the accuracy of their responses. We show that providing humans with device context, classification prediction and uncertainty, and additional feedback all increase the accuracy of human responses to device questions. Finally, we contribute algorithms to learn these models both through surveys and online while the device is performing tasks. The thesis then introduces human-centered conditional, deliberative, and replanning algorithms that use models of humans. We contribute conditional plans that include asking actions to enable devices to perform tasks that they could not otherwise perform. We then contribute a human-centered deliberative planner for a robot to use to determine which navigational path to take that minimizes its uncertainty and maximizes the likelihood of finding available human helpers. Finally, we contribute a replanning algorithm for a robot to determine which helper to have travel to help in a particular location, such as elevators or kitchens. Through extensive experiments and deployments, in particular with a mobile service robot, this thesis shows that human-centered algorithms trade off task performance with costs of asking and interrupting human helpers increase functionality while maintaining usability.

[1]  M. Eagle,et al.  RECALL AND RECOGNITION IN INTENTIONAL AND INCIDENTAL LEARNING. , 1964, Journal of experimental psychology.

[2]  Leslie Pack Kaelbling,et al.  Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.

[3]  Satoru Hayamizu,et al.  Socially Embedded Learning of the Office-Conversant Mobil Robot Jijo-2 , 1997, IJCAI.

[4]  Anind K. Dey,et al.  Why and why not explanations improve the intelligibility of context-aware intelligent systems , 2009, CHI.

[5]  Jean C. Scholtz,et al.  Evaluation Methods for Human-System Performance of Intelligent Systems , 2002 .

[6]  David Maxwell Chickering,et al.  Intentions: a game for classifying search query intent , 2009, CHI Extended Abstracts.

[7]  Gregory D. Abowd,et al.  Designing Capture Applications to Support the Education of Children with Autism , 2004, UbiComp.

[8]  Michael Kearns,et al.  Near-Optimal Reinforcement Learning in Polynomial Time , 1998, Machine Learning.

[9]  Christopher D. Wickens,et al.  A model for types and levels of human interaction with automation , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[10]  Eric Horvitz,et al.  Principles of mixed-initiative user interfaces , 1999, CHI '99.

[11]  W. Keith Edwards,et al.  Intelligibility and Accountability: Human Considerations in Context-Aware Systems , 2001, Hum. Comput. Interact..

[12]  Brett Browning,et al.  Learning by demonstration with critique from a human teacher , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[13]  Daniel J. Barrett,et al.  An Introduction to Computerized Experience Sampling in Psychology , 2001 .

[14]  D. Aberdeen,et al.  A ( Revised ) Survey of Approximate Methods for Solving Partially Observable Markov Decision Processes , 2003 .

[15]  Terrence Fong,et al.  Collaboration, Dialogue, and Human-Robot Interaction , 2001 .

[16]  Claudia V. Goldman,et al.  Decentralized Control of Cooperative Systems: Categorization and Complexity Analysis , 2004, J. Artif. Intell. Res..

[17]  Joyce Ho,et al.  Using context-aware computing to reduce the perceived burden of interruptions from mobile devices , 2005, CHI.

[18]  R. Bellman,et al.  Dynamic Programming and Markov Processes , 1960 .

[19]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[20]  Mary Czerwinski,et al.  A diary study of task switching and interruptions , 2004, CHI.

[21]  Ashraf Khalil,et al.  Improving Cell Phone Awareness by Using Calendar Information , 2005, INTERACT.

[22]  Craig A. Knoblock,et al.  PDDL-the planning domain definition language , 1998 .

[23]  Stephanie Rosenthal,et al.  Is Someone in this Office Available to Help Me? , 2012, J. Intell. Robotic Syst..

[24]  Mark C. Torrance,et al.  Natural communication with robots , 1994 .

[25]  Andrea Lockerd Thomaz,et al.  Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance , 2006, AAAI.

[26]  Mica R. Endsley,et al.  Being Certain About Uncertainty: How the Representation of System Reliability Affects Pilot Decision Making , 1998 .

[27]  Stephanie Rosenthal,et al.  How robots' questions affect the accuracy of the human responses , 2009, RO-MAN 2009 - The 18th IEEE International Symposium on Robot and Human Interactive Communication.

[28]  Eric Horvitz,et al.  Coordinates: Probabilistic Forecasting of Presence and Availability , 2002, UAI.

[29]  Thomas G. Dietterich,et al.  Predicting User Tasks : I Know What You ’ re Doing ! , 2005 .

[30]  P. Schoemaker The Expected Utility Model: Its Variants, Purposes, Evidence and Limitations , 1982 .

[31]  Milind Tambe,et al.  The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories and Models , 2011, J. Artif. Intell. Res..

[32]  Manuela M. Veloso,et al.  WiFi localization and navigation for autonomous indoor mobile robots , 2010, 2010 IEEE International Conference on Robotics and Automation.

[33]  Stephanie Rosenthal,et al.  Mobile Robot Planning to Seek Help with Spatially-Situated Tasks , 2012, AAAI.

[34]  E. Lehmann Some Principles of the Theory of Testing Hypotheses , 1950 .

[35]  D. Kahneman,et al.  A Survey Method for Characterizing Daily Life Experience: The Day Reconstruction Method , 2004, Science.

[36]  Craig Boutilier,et al.  Accelerating reinforcement learning through imitation , 2003 .

[37]  Joelle Pineau,et al.  Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs , 2008, ICML '08.

[38]  Dafna Shahaf,et al.  Generalized Task Markets for Human and Machine Computation , 2010, AAAI.

[39]  Mary Shaw,et al.  Topes: enabling end-user programmers to validate and reformat data , 2009 .

[40]  Thomas G. Dietterich,et al.  Toward harnessing user feedback for machine learning , 2007, IUI '07.

[41]  L. Steels Evolving grounded communication for robots , 2003, Trends in Cognitive Sciences.

[42]  Dana Angluin,et al.  Queries and concept learning , 1988, Machine Learning.

[43]  Anna R. Karlin,et al.  Competitive randomized algorithms for nonuniform problems , 1990, SODA '90.

[44]  H. H. Clark,et al.  References in Conversation Between Experts and Novices , 1987 .

[45]  Eric Horvitz,et al.  Attention-Sensitive Alerting , 1999, UAI.

[46]  Jean Scholtz,et al.  Beyond usability evaluation: analysis of human-robot interaction at a major robotics competition , 2004 .

[47]  Stephanie Rosenthal,et al.  Learning Accuracy and Availability of Humans Who Help Mobile Robots , 2011, AAAI.

[48]  Manuela M. Veloso,et al.  Confidence-based policy learning from demonstration using Gaussian mixture models , 2007, AAMAS '07.

[49]  Chrystopher L. Nehaniv,et al.  Teaching robots by moulding behavior and scaffolding the environment , 2006, HRI '06.

[50]  David D. Lewis,et al.  Heterogeneous Uncertainty Sampling for Supervised Learning , 1994, ICML.

[51]  Bradley R. Schmerl,et al.  Agent-assisted task management that reduces email overload , 2010, IUI '10.

[52]  Manuela M. Veloso,et al.  Corrective Gradient Refinement for mobile robot localization , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[53]  Jaime G. Carbonell,et al.  Proactive learning: cost-sensitive active learning with multiple imperfect oracles , 2008, CIKM '08.

[54]  Stefan Schaal,et al.  Robot Learning From Demonstration , 1997, ICML.

[55]  Murali Venkatesh Krishnan Availability and Mobile Phone Interruptions Examining the role of technology in coordinating mobile calls , 2008 .

[56]  Yang Xu,et al.  Scaling teamwork to very large teams , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[57]  D.H. Grollman,et al.  Learning robot soccer skills from demonstration , 2007, 2007 IEEE 6th International Conference on Development and Learning.

[58]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[59]  Audris Mockus,et al.  Expertise Browser: a quantitative approach to identifying expertise , 2002, Proceedings of the 24th International Conference on Software Engineering. ICSE 2002.

[60]  Chris Callison-Burch,et al.  Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk , 2009, EMNLP.

[61]  George B. Dantzig,et al.  Linear programming and extensions , 1965 .

[62]  Brett Browning,et al.  Skill Acquisition and Use for a Dynamically-Balancing Soccer Robot , 2004, AAAI.

[63]  Takayuki Kanda,et al.  A semi-autonomous communication robot — A field trial at a train station , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[64]  Bernt Schiele,et al.  Towards Personalized Mobile Interruptibility Estimation , 2006, LoCA.

[65]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[66]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[67]  Arnold W. M. Smeulders,et al.  Active learning using pre-clustering , 2004, ICML.

[68]  Ronald L. Boring,et al.  Shared understanding for collaborative control , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[69]  M. Tscheligi,et al.  Robots asking for directions: the willingness of passers-by to support robots , 2010, HRI 2010.

[70]  Siddhartha S. Srinivasa,et al.  Gracefully mitigating breakdowns in robotic services , 2010, 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[71]  Alexander I. Rudnicky,et al.  Using the Amazon Mechanical Turk for transcription of spoken language , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[72]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[73]  Henrik I. Christensen,et al.  Bringing Together Human and Robotic Environment Representations - A Pilot Study , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[74]  Laurent Jeanpierre,et al.  Partially Observable Markov Decision Process for Managing Robot Collaboration with Human , 2009, 2009 21st IEEE International Conference on Tools with Artificial Intelligence.

[75]  Marti A. Hearst Trends & Controversies: Mixed-initiative interaction , 1999, IEEE Intell. Syst..

[76]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[77]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[78]  James F. Allen,et al.  TRAINS-95: Towards a Mixed-Initiative Planning Assistant , 1996, AIPS.

[79]  Charles E. Thorpe,et al.  Collaborative control: a robot-centric model for vehicle teleoperation , 2001 .

[80]  Brett Browning,et al.  Learning robot motion control with demonstration and advice-operators , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[81]  Paul R. Cohen,et al.  The RADAR Test Methodology: Evaluating a Multi-Task Machine Learning System with Humans in the Loop , 2006 .

[82]  Desney S. Tan,et al.  CueTIP: a mixed-initiative interface for correcting handwriting errors , 2006, UIST.

[83]  Daniel C. McFarlane,et al.  Coordinating the Interruption of People in Human-Computer Interaction , 1999, INTERACT.

[84]  Maarten de Rijke,et al.  Finding experts and their eetails in e-mail corpora , 2006, WWW '06.

[85]  Nikos A. Vlassis,et al.  Decentralized planning under uncertainty for teams of communicating agents , 2006, AAMAS '06.

[86]  Leslie F. Clark,et al.  RATING SCALES NUMERIC VALUES MAY CHANGE THE MEANING OF SCALE LABELS , 1991 .

[87]  Satoru Hayamizu,et al.  Acquiring a Probabilistic Map with Dialogue-Based Learning , 1998 .

[88]  Jennifer Mankoff,et al.  When participants do the capturing: the role of media in diary studies , 2005, CHI.

[89]  Stephanie Rosenthal,et al.  Towards maximizing the accuracy of human-labeled sensor data , 2010, IUI '10.

[90]  James Fogarty,et al.  Amplifying community content creation with mixed initiative information extraction , 2009, CHI.

[91]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[92]  Michael A. Goodrich,et al.  Human-Robot Interaction: A Survey , 2008, Found. Trends Hum. Comput. Interact..

[93]  Manuela M. Veloso,et al.  Effective Semi-autonomous Telepresence , 2012, RoboCup.

[94]  M. Veloso Towards Mixed-Initiative Rationale-Supported Planning , 1996 .

[95]  Raman Chandrasekar,et al.  Improving search engines using human computation games , 2009, CIKM.

[96]  Rebecca Montanari,et al.  Towards Socially Aware Mobile Phones , 2008, SDoW@ISWC.

[97]  Manuela M. Veloso,et al.  Multi-thresholded approach to demonstration selection for interactive robot learning , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[98]  Helge Hüttenrauch,et al.  To help or not to help a service robot - Bystander intervention as a resource in human-robot collaboration , 2006 .

[99]  Eric Horvitz,et al.  Experience sampling for building predictive user models: a comparative study , 2008, CHI.

[100]  Gregory D. Abowd,et al.  OOPS: a toolkit supporting mediation techniques for resolving ambiguity in recognition-based interfaces , 2000, Comput. Graph..

[101]  Jeffrey Heer,et al.  Momento: support for situated ubicomp experimentation , 2007, CHI.

[102]  Herbert H. Clark Talking as if , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[103]  Hema Raghavan,et al.  Active Learning with Feedback on Features and Instances , 2006, J. Mach. Learn. Res..

[104]  Eric Horvitz,et al.  On Discarding, Caching, and Recalling Samples in Active Learning , 2007, UAI.

[105]  Sean M. McNee,et al.  Confidence Displays and Training in Recommender Systems , 2003, INTERACT.

[106]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[107]  H. H. Clark,et al.  Asking questions and influencing answers. , 1992 .

[108]  Luis von Ahn,et al.  Matchin: eliciting user preferences with an online game , 2009, CHI.

[109]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[110]  Richard M. Anderson,et al.  Complexity results for infinite-horizon markov decision processes , 2000 .

[111]  Stephanie Rosenthal,et al.  Using Decision-Theoretic Experience Sampling to Build Personalized Mobile Phone Interruption Models , 2011, Pervasive.

[112]  Terrence Fong,et al.  Robot, asker of questions , 2003, Robotics Auton. Syst..

[113]  Eric Horvitz,et al.  BusyBody: creating and fielding personalized models of the cost of interruption , 2004, CSCW.

[114]  Anthony Jameson,et al.  When actions have consequences: empirically based decision making for intelligent user interfaces , 2001, Knowl. Based Syst..

[115]  Craig Boutilier,et al.  Eliciting Additive Reward Functions for Markov Decision Processes , 2011, IJCAI.

[116]  Hans-Jürgen Hippler,et al.  SUBSEQUENT QUESTIONS MAY INFLUENCE ANSWERS TO PRECEDING QUESTIONS IN MAIL SURVEYS , 1995 .

[117]  Sunny Consolvo,et al.  Using the Experience Sampling Method to Evaluate Ubicomp Applications , 2003, IEEE Pervasive Comput..

[118]  Manuela M. Veloso,et al.  A real-time world model for multi-robot teams with high-latency communication , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[119]  Debra Schreckenghost,et al.  Adjustable Autonomy for Human-Centered Autonomous Systems , 1999 .

[120]  Claudia V. Goldman,et al.  Optimizing information exchange in cooperative multi-agent systems , 2003, AAMAS '03.

[121]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[122]  Rüdiger Dillmann,et al.  Reasoning for a multi-modal service robot considering uncertainty in human-robot interaction , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[123]  Manuela M. Veloso,et al.  Reasoning about joint beliefs for execution-time communication decisions , 2005, AAMAS '05.

[124]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[125]  Shengrui Wang,et al.  Identifying authoritative actors in question-answering forums: the case of Yahoo! answers , 2008, KDD.

[126]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[127]  Robin Jeffries,et al.  User interface evaluation in the real world: a comparison of four techniques , 1991, CHI.

[128]  Reid G. Simmons,et al.  Coordinated Multiagent Teams and Sliding Autonomy for Large-Scale Assembly , 2006, Proceedings of the IEEE.

[129]  Bernt Schiele,et al.  Evaluating the Effects of Displaying Uncertainty in Context-Aware Applications , 2004, UbiComp.

[130]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[131]  W. Weiten,et al.  Psychology: Themes and Variations , 1991 .

[132]  Manuela M. Veloso,et al.  Exploiting factored representations for decentralized execution in multiagent teams , 2007, AAMAS '07.

[133]  Reid G. Simmons,et al.  Socially Distributed Perception: GRACE plays social tag at AAAI 2005 , 2007, Auton. Robots.

[134]  Paul A. Viola,et al.  Corrective feedback and persistent learning for information extraction , 2006, Artif. Intell..

[135]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[136]  Paul Green,et al.  The Rapid Development of User Interfaces: Experience with the Wizard of OZ Method , 1985 .

[137]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[138]  N. Metropolis,et al.  The Monte Carlo method. , 1949 .

[139]  Maja J. Matarić,et al.  A framework for learning from demonstration, generalization and practice in human-robot domains , 2003 .

[140]  Jaime G. Carbonell,et al.  Efficiently learning the accuracy of labeling sources for selective sampling , 2009, KDD.

[141]  Stephanie Rosenthal,et al.  An effective personal mobile robot agent through symbiotic human-robot interaction , 2010, AAMAS.

[142]  Joelle Pineau,et al.  Active Learning in Partially Observable Markov Decision Processes , 2005, ECML.

[143]  H. H. Clark,et al.  Referring as a collaborative process , 1986, Cognition.

[144]  Christopher G. Atkeson,et al.  Predicting human interruptibility with sensors , 2005, TCHI.

[145]  Paul P. Maglio,et al.  Expertise identification using email communications , 2003, CIKM '03.

[146]  Wendy A. Kellogg,et al.  Social translucence: an approach to designing systems that support social processes , 2000, TCHI.

[147]  Manuela M. Veloso,et al.  Oracular Partially Observable Markov Decision Processes: A Very Special Case , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[148]  B. Depaulo,et al.  The Costs of Asking for Help , 1980 .

[149]  T. Heberlein,et al.  Factors affecting response rates to mailed questionnaires: A quantitative analysis of the published literature. , 1978 .

[150]  Mary Czerwinski,et al.  Instant Messaging and Interruption: Influence of Task Type on Performance , 2000 .

[151]  E. Diener,et al.  Experience Sampling: Promises and Pitfalls, Strengths and Weaknesses , 2003 .

[152]  Seiji Yamada,et al.  Real Robot Learning with Human Teaching , 2002 .

[153]  Scott E. Hudson,et al.  Using visualizations to increase compliance in experience sampling , 2008, UbiComp.

[154]  Manuel Blum,et al.  reCAPTCHA: Human-Based Character Recognition via Web Security Measures , 2008, Science.

[155]  Eric Horvitz,et al.  Bayesphone: Precomputation of Context-Sensitive Policies for Inquiry and Action in Mobile Devices , 2005, User Modeling.

[156]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[157]  Naoki Abe,et al.  Collaborative Filtering Using Weighted Majority Prediction Algorithms , 1998, ICML.

[158]  Nigel Shadbolt,et al.  The empirical study of knowledge elicitation techniques , 1989, SGAR.

[159]  Andrea Lockerd Thomaz,et al.  Tutelage and socially guided robot learning , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[160]  David W. McDonald,et al.  Activity sensing in the wild: a field trial of ubifit garden , 2008, CHI.

[161]  Mark Plutowski,et al.  Selecting concise training sets from clean data , 1993, IEEE Trans. Neural Networks.

[162]  Lawrence Carin,et al.  Learning to Explore and Exploit in POMDPs , 2009, NIPS.

[163]  Manuela M. Veloso,et al.  A Team of Humanoid Game Commentators , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[164]  Gillian M. Hayes,et al.  A Robot Controller Using Learning by Imitation , 1994 .

[165]  Jiang Hu,et al.  Adaptive language behavior in HCI: how expectations and beliefs about a system affect users' word choice , 2006, CHI.

[166]  Allen E. Milewski,et al.  Providing presence cues to telephone users , 2000, CSCW '00.

[167]  Sonia Chernova,et al.  Confidence-based robot policy learning from demonstration , 2009 .

[168]  Douglas Currivan,et al.  Methods for Testing and Evaluating Survey Questionnaires Edited by Stanley Presser, Jennifer M. Rothgeb, Mick P. Couper, Judith T. Lessler, Elizabeth Martin, Jean Martin and Eleanor Singer John Wiley & Sons, 2004. 606 pages. $59.95 (cloth) , 2006 .

[169]  Manuela M. Veloso,et al.  Learning of coordination: exploiting sparse interactions in multiagent systems , 2009, AAMAS.

[170]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[171]  Stephanie Rosenthal,et al.  Modeling humans as observation providers using POMDPs , 2011, 2011 RO-MAN.

[172]  Daniel C. McFarlane,et al.  Comparison of Four Primary Methods for Coordinating the Interruption of People in Human-Computer Interaction , 2002, Hum. Comput. Interact..

[173]  Makoto Yokoo,et al.  Communications for improving policy computation in distributed POMDPs , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[174]  M. Csíkszentmihályi,et al.  Validity and Reliability of the Experience‐Sampling Method , 1987, The Journal of nervous and mental disease.

[175]  Andrea Lockerd Thomaz,et al.  Reinforcement Learning with Human Teachers: Understanding How People Want to Teach Robots , 2006, ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication.

[176]  Eric Horvitz,et al.  Learning and reasoning about interruption , 2003, ICMI '03.

[177]  Manuela M. Veloso,et al.  Teaching multi-robot coordination using demonstration of communication and state sharing , 2008, AAMAS.

[178]  Tim Paek,et al.  People watcher: a game for eliciting human-transcribed data for automated directory assistance , 2007, INTERSPEECH.