Learning and validation of human control strategies

In this thesis, we apply machine learning techniques and statistical analysis towards the learning and validation of human control strategy (HCS) models. This work has potential impact in a number of applications ranging from space telerobotics and real-time training to the Intelligent Vehicle Highway System (IVHS) and human-machine interfacing. We specifically focus on the following two important questions: (1) how to efficiently and effectively model human control strategies from real-time human control data and (2) how to validate the performance of the learned HCS models in the feedback control loop. To these ends, we propose two discrete-time modeling frameworks, one for continuous and another for discontinuous human control strategies. For the continuous case, we propose and develop an efficient neural-network learning architecture that combines flexible cascade neural networks with extended Kalman filtering. This learning architecture demonstrates convergence to better local minima in many fewer epochs than alternative, competing neural network learning regimes. For the discontinuous case, we propose and develop a statistical framework that models control actions by individual statistical models. A stochastic selection criterion, based on the posterior probabilities for each model, then selects a particular control action at each time step. Next, we propose and develop a stochastic similarity measure—based on Hidden Markov Model (HMM) analysis—that compares dynamic, stochastic control trajectories. We derive important properties for this similarity measure, and then, by quantifying the similarity between model-generated control trajectories and corresponding human control data, apply this measure towards validating the learned models of human control strategy. The degree of similarity (or dissimilarity) between a model and its training data indicates how appropriate a specific modeling approach is within a specific context. Throughout, the learning and validation methods proposed herein are tested on human control data, collected through a dynamic, graphic driving simulator that we have developed for this purpose. In addition, we analyze actual driving data collected through the Navlab project at Carnegie Mellon University.

[1]  Hans Henrik Thodberg,et al.  Improving Generalization of Neural Networks Through Pruning , 1991, Int. J. Neural Syst..

[2]  Yangsheng Xu,et al.  Human skill transfer: neural networks as learners and teachers , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[3]  Rüdiger Dillmann,et al.  What Can Robots Learn from Humans , 1996 .

[4]  Sukhan Lee,et al.  Skill learning from observations , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[5]  Charles M. Rader,et al.  Fast transforms: Algorithms, analyses, applications , 1984 .

[6]  Yangsheng Xu,et al.  On the fidelity of human skill models , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[7]  A. Guez,et al.  A trainable neuromorphic controller , 1988 .

[8]  V. Gullapalli,et al.  Acquiring robot skills via reinforcement learning , 1994, IEEE Control Systems.

[9]  Sheng Liu,et al.  Transfer of human skills to neural net robot controllers , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[10]  Geir Hovland,et al.  Skill acquisition from human demonstration using a hidden Markov model , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[11]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[12]  Timur Ash,et al.  Dynamic node creation in backpropagation networks , 1989 .

[13]  Pradeep K. Khosla,et al.  Towards gesture-based programming: shape from motion primordial learning of sensorimotor primitives , 1997, Robotics Auton. Syst..

[14]  S. Schaal,et al.  Robot juggling: implementation of memory-based learning , 1994, IEEE Control Systems.

[15]  Yangsheng Xu,et al.  Human control strategy: abstraction, verification, and replication , 1997 .

[16]  Todd Jochem,et al.  Rapidly Adapting Machine Vision for Automated Vehicle Steering , 1996, IEEE Expert.

[17]  E. Fix,et al.  Modeling human performance with neural networks , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[18]  Blake Hannaford,et al.  Hidden Markov Model Analysis of Force/ Torque Information in Telemanipulation , 1989, ISER.

[19]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[20]  Yangsheng Xu,et al.  Cascade neural networks with node-decoupled extended Kalman filtering , 1997, Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97. 'Towards New Computational Principles for Robotics and Automation'.

[21]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[22]  Giovanna Castellano,et al.  An empirical comparison of node pruning methods for layered feedforward neural networks , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[23]  Martial Hebert,et al.  On 3D shape similarity , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Daphna Weinshall,et al.  Distance Metric Between 3D Models and 2D Images for Recognition and Classification , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Wayne Iba,et al.  Modeling the Acquisition and Improvement of Motor Skkills , 1991, ML.

[26]  S Z Qin,et al.  Comparison of four neural net learning methods for dynamic system identification , 1992, IEEE Trans. Neural Networks.

[27]  Denis Royston Towill,et al.  Modelling the Helmsman in a Ship Steering System Using Fuzzy Sets , 1988 .

[28]  Pietro Burrascano A pruning technique maximizing generalization , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[29]  Yangsheng Xu,et al.  Human action learning via hidden Markov model , 1997, IEEE Trans. Syst. Man Cybern. Part A.

[30]  Geoffrey E. Hinton,et al.  Experiments on Learning by Back Propagation. , 1986 .

[31]  Richard A. Volz,et al.  Learning force sensory patterns and skills front human demonstration , 1997, Proceedings of International Conference on Robotics and Automation.

[32]  Yangsheng Xu,et al.  On discontinuous human control strategies , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[33]  H. Harry Asada,et al.  Skill acquisition from human experts through pattern processing of teaching data , 1989, Proceedings, 1989 International Conference on Robotics and Automation.

[34]  Parag H. Batavia,et al.  Driver Adaptive Warning Systems , 1998 .

[35]  Junji Yamato,et al.  Recognizing human action in time-sequential images using hidden Markov model , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[36]  Marco Saerens,et al.  Classification of temporal trajectories by continuous-time recurrent nets , 1994, Neural Networks.

[37]  Dean Pomerleau,et al.  Reliability estimation for neural network based autonomous driving , 1994, Robotics Auton. Syst..

[38]  Yoshio Hirose,et al.  Backpropagation algorithm which varies the number of hidden units , 1989, International 1989 Joint Conference on Neural Networks.

[39]  H. Hatwal,et al.  Some Inverse Solutions to an Automobile Path-Tracking Problem with Input Control of Steering and Brakes , 1986 .

[40]  W.C. Collier,et al.  Smart cars, smart highways , 1994, IEEE Spectrum.

[41]  N. V. Bhat,et al.  determining model structure for neural models by network stripping , 1992 .

[42]  Yangsheng Xu,et al.  Neural network approach to control system identification with variable activation functions , 1994, Proceedings of 1994 9th IEEE International Symposium on Intelligent Control.

[43]  Scott E. Fahlman,et al.  An empirical study of learning speed in back-propagation networks , 1988 .

[44]  Albert Y. Zomaya,et al.  Toward generating neural network structures for function approximation , 1994, Neural Networks.

[45]  Ken-ichi Funahashi,et al.  On the approximate realization of continuous mappings by neural networks , 1989, Neural Networks.

[46]  Ronald A. Hess,et al.  A Model of Driver Steering Control Behavior for Use in Assessing Vehicle Handling Qualities , 1993 .

[47]  Yamashita,et al.  Backpropagation algorithm which varies the number of hidden units , 1989 .

[48]  J. Sitte,et al.  A cartpole experiment benchmark for trainable controllers , 1993, IEEE Control Systems.

[49]  Tariq Samad Neurocontrol: concepts and applications , 1992, [Proceedings] 1992 IEEE International Conference on Systems, Man, and Cybernetics.

[50]  Jos Nijhuis,et al.  Neurocontrol for lateral vehicle guidance , 1993, IEEE Micro.

[51]  Yangsheng Xu,et al.  Hidden Markov model approach to skill learning and its application to telerobotics , 1993, IEEE Trans. Robotics Autom..

[52]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[53]  Ulrich Kramer,et al.  On the application of fuzzy sets to the analysis of the system driver-vehicle-environment , 1985, Autom..

[54]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[55]  Eric B. Bartlett,et al.  Dynamic node architecture learning: An information theoretic approach , 1994, Neural Networks.

[56]  Thomas B. Sheridan,et al.  Space teleoperation through time delay: review and prognosis , 1993, IEEE Trans. Robotics Autom..

[57]  Duane T. McRuer,et al.  Human dynamics in man-machine systems , 1980, Autom..

[58]  James S. Albus,et al.  New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)1 , 1975 .

[59]  Ramesh C. Jain,et al.  Similarity measures for image databases , 1995, Electronic Imaging.

[60]  C G Gingrich,et al.  Modeling human operators using neural networks. , 1992, ISA transactions.

[61]  T. Fukuda,et al.  Acquisition of human skills for robotic systems , 1991, Proceedings of the 1991 IEEE International Symposium on Intelligent Control.

[62]  S N Roscoe,et al.  FLIGHTDECK PERFORMANCE: THE HUMAN FACTOR (1ST ED. ) , 1990 .

[63]  A. Pentland,et al.  Toward augmented control systems , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[64]  Nathan Delson,et al.  Robot programming by human demonstration: adaptation and inconsistency in constrained motion , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[65]  Ales Ude,et al.  Trajectory generation from noisy positions of object features for teaching robot paths , 1993, Robotics Auton. Syst..

[66]  Haim J. Wolfson,et al.  On shape similarity , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[67]  Chuen-Chien Lee FUZZY LOGIC CONTROL SYSTEMS: FUZZY LOGIC CONTROLLER - PART I , 1990 .

[68]  Stephen M. Omohundro,et al.  Bumptrees for Efficient Function, Constraint and Classification Learning , 1990, NIPS.

[69]  Michael Kaiser,et al.  Transfer of Elementary Skills via Human-Robot Interaction , 1997, Adapt. Behav..

[70]  Sheng Liu,et al.  Programming deburring robots based on human demonstration with direct burr size measurement , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[71]  Richard S. Sutton,et al.  Learning and Sequential Decision Making , 1989 .

[72]  L. R. Rabiner,et al.  A probabilistic distance measure for hidden Markov models , 1985, AT&T Technical Journal.

[73]  John Moody,et al.  Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[74]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[75]  Wayne L. Shebilske,et al.  Video Games, Training, and Investigating Complex Skills , 1992 .

[76]  Yeung Yam,et al.  Two performance measures for evaluating human control strategy , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[77]  Michael C. Mozer,et al.  Skeletonization: A Technique for Trimming the Fat from a Network via Relevance Assessment , 1988, NIPS.

[78]  Kumpati S. Narendra,et al.  Identification and control of dynamical systems using neural networks , 1990, IEEE Trans. Neural Networks.

[79]  J. Spall Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .

[80]  Shoji Kurakake,et al.  Human Action Recognition Using HMM with Category-Separated Vector Quantization , 1994 .

[81]  Ulrich Krämer Brief paper: On the application of fuzzy sets to the analysis of the system driver-vehicle-environment , 1985 .

[82]  E. Fix Neural network based human performance modeling , 1990, IEEE Conference on Aerospace and Electronics.

[83]  Richard S. Sutton,et al.  Neural networks for control , 1990 .

[84]  Stefan Schaal,et al.  Memory-based robot learning , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[85]  Yangsheng Xu,et al.  Stochastic similarity for validating human control strategy models , 1998, IEEE Trans. Robotics Autom..

[86]  Yeung Yam,et al.  Optimization of human control strategy with simultaneously perturbed stochastic approximation , 1998, Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190).

[87]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[88]  Peter J. Gawthrop,et al.  Neural networks for control systems - A survey , 1992, Autom..

[89]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[90]  Mingui Sun,et al.  Measurement of signal similarity using the maxima of the wavelet transform , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[91]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[92]  李幼升,et al.  Ph , 1989 .

[93]  Michael Werman,et al.  Similarity and Affine Invariant Distances Between 2D Point Sets , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[94]  C. Atkeson,et al.  Prioritized Sweeping : Reinforcement Learning withLess Data and Less Real , 1993 .

[95]  Stefanos Kollias,et al.  An adaptive least squares algorithm for the efficient training of artificial neural networks , 1989 .

[96]  A. Kundu,et al.  Transient sonar signal classification using hidden Markov models and neural nets , 1994 .

[97]  Andrew W. Moore,et al.  Learning to recognize time series: combining ARMA models with memory-based learning , 1997, Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97. 'Towards New Computational Principles for Robotics and Automation'.

[98]  Dean A. Pomerleau,et al.  Neural Network Perception for Mobile Robot Guidance , 1993 .

[99]  L. Glass,et al.  Oscillation and chaos in physiological control systems. , 1977, Science.

[100]  H. Harry Asada,et al.  Hybrid linguistic/numeric control of deburring robots based on human skills , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[101]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[102]  Lee A. Feldkamp,et al.  Decoupled extended Kalman filter training of feedforward layered networks , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[103]  Sharad Singhal,et al.  Training Multilayer Perceptrons with the Extende Kalman Algorithm , 1988, NIPS.

[104]  Rich Caruana,et al.  Removing the Genetics from the Standard Genetic Algorithm , 1995, ICML.

[105]  H. Harry Asada,et al.  Transferring Manipulative Skills to Robots-Representation and Acquisition of Tool Manipulative Skill , 1992 .

[106]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[107]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[108]  B. Anderson,et al.  Optimal Filtering , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[109]  R D Ervin,et al.  WORLDWIDE IVHS ACTIVITIES: A COMPARATIVE OVERVIEW , 1992 .

[110]  S. Lee,et al.  Learning expert systems for robot fine motion control , 1988, Proceedings IEEE International Symposium on Intelligent Control 1988.

[111]  M. Boninsegna,et al.  Similarity measures in computer vision , 1994, Pattern Recognit. Lett..

[112]  Daniel Gopher,et al.  The Transfer of Skill from a Computer Game Trainer to Actual Flight , 1992 .

[113]  Richard A. Davis,et al.  Time Series: Theory and Methods , 2013 .

[114]  Fuzzy Logic in Control Systems : Fuzzy Logic , 2022 .

[115]  Celestine A. Ntuen,et al.  Adaptive learning of human motion by a telerobot using a neural network model as a teacher , 1994 .

[116]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[117]  Masafumi Hagiwara,et al.  Removal of hidden units and weights for back propagation networks , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[118]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[119]  Yangsheng Xu,et al.  Learning and Transfer of Human Real-Time Control Strategies , 1997, J. Adv. Comput. Intell. Intell. Informatics.

[120]  Jeff Schneider,et al.  Robot Skill Learning Through Intelligent Experimentation , 1995 .

[121]  L. R. Rabiner,et al.  Some properties of continuous hidden Markov model representations , 1985, AT&T Technical Journal.