Inferring Intent for Novice Human-in-the-Loop Iterative Learning Control

This paper studies human-in-the-loop iterative learning control for output-trajectory tracking of linear time-invariant (LTI) systems. Note that knowledge of the intended output is necessary to ascertain the tracking error and find the corrective input needed to reduce the error at each iteration step. In human-in-the-loop systems, the controller does not have direct access to the intended output, and therefore, the tracking error needs to be inferred from human actions. A challenge, however, in inferring the intent and the tracking error is that the human-response dynamics affects the relation between the tracking error (known to the human) and the resulting human action. The main contribution of this paper is to infer the tracking error by inverting a model of the human response and using it for iteratively controlling general LTI systems. This is enabled by modifying the controlled-system dynamics perceived by the human to be of a form for which the human-response model is known. An advantage of the proposed approach is that output tracking is achieved in spite of (potentially) imperfect corrective actions of the novice human user. Results of human-in-the-loop tracking experiments with nine human subjects show that the proposed approach can estimate the human intent (to within 97% accuracy). Moreover, the iterative approach substantially reduces the tracking error (about 93%) when compared with the tracking error achieved by the human alone.

[1]  Gregory D. Hager,et al.  Motion generation of robotic surgical tasks: Learning from expert demonstrations , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[2]  Maarten Steinbuch,et al.  Learning-based identification and iterative learning control of direct-drive robots , 2005, IEEE Transactions on Control Systems Technology.

[3]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[4]  A. Terry Bahill,et al.  Smooth pursuit eye movements in response to predictable target motions , 1983, Vision Research.

[5]  Qingze Zou,et al.  Iterative control of dynamics-coupling-caused errors in piezoscanners during high-speed AFM operation , 2005, IEEE Transactions on Control Systems Technology.

[6]  Suguru Arimoto,et al.  Bettering operation of Robots by learning , 1984, J. Field Robotics.

[7]  Eric Rogers,et al.  Iterative Learning Control for Electrical Stimulation and Stroke Rehabilitation , 2015 .

[8]  Rahul B. Warrier,et al.  Iterative Learning From Novice Human Demonstrations for Output Tracking , 2016, IEEE Transactions on Human-Machine Systems.

[9]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[10]  Jarrett Webb,et al.  Beginning Kinect Programming with the Microsoft Kinect SDK , 2012, Apress.

[11]  A. Tickner,et al.  Structured and Blank Backgrounds in a Pursuit Tracking Task , 1970 .

[12]  Pieter Abbeel,et al.  Superhuman performance of surgical tasks by robots using iterative learning from human-guided demonstrations , 2010, 2010 IEEE International Conference on Robotics and Automation.

[13]  Qingze Zou,et al.  Preview-Based Stable-Inversion for Output Tracking of Linear Systems , 1999 .

[14]  Bo Yu,et al.  Interaction Dynamics in Oscillator and Human-in-the-loop Systems. , 2014 .

[15]  Lie Tang,et al.  Multiaxis Contour Control—the State of the Art , 2013, IEEE Transactions on Control Systems Technology.

[16]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[17]  Daniel H. Grollman,et al.  Dogged Learning for Robots , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[18]  Kenn Richard Oldham,et al.  Convergence and Energy Analysis for Iterative Adaptive ON-OFF Control of Piezoelectric Microactuators , 2014, IEEE Transactions on Control Systems Technology.

[19]  Qingze Zou,et al.  A review of feedforward control approaches in nanopositioning for high-speed spm , 2009 .

[20]  Rahul B. Warrier,et al.  Inverse control for inferring intent in novice human-in-the-loop iterative learning , 2016, 2016 American Control Conference (ACC).

[21]  Andrew G. Alleyne,et al.  Bumpless Transfer Filter for Exogenous Feedforward Signals , 2014, IEEE Transactions on Control Systems Technology.

[22]  Duane T. McRuer,et al.  A Review of Quasi-Linear Pilot Models , 1967 .

[23]  Masayoshi Tomizuka,et al.  Dual-Stage Iterative Learning Control for MIMO Mismatched System With Application to Robots With Joint Elasticity , 2014, IEEE Transactions on Control Systems Technology.

[24]  E. R. F. W. Grossman,et al.  The Information-Capacity of the Human Motor-System in Pursuit Tracking , 1960 .

[25]  Santosh Devasia,et al.  Output tracking with nonhyperbolic and near nonhyperbolic internal dynamics: helicopter hover control , 1997, Proceedings of the 1997 American Control Conference (Cat. No.97CH36041).

[26]  David L. Kleinman,et al.  An optimal control model of human response part I: Theory and validation , 1970 .

[27]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[28]  Ronald A. Hess Effects of Time Delays on Systems Subject to Manual Control , 1984 .

[29]  Chi-Tsong Chen,et al.  Linear System Theory and Design , 1995 .

[30]  Daxue Liu,et al.  Self-Learning Cruise Control Using Kernel-Based Least Squares Policy Iteration , 2014, IEEE Transactions on Control Systems Technology.