Metric selection for evaluating human supervisory control of unmanned vehicles

Broad metric classes were proposed in the literature in order to facilitate metric selection for evaluating human-autonomous vehicle interaction. However, there still lacks a systematic method for selecting an efficient set of metrics from the many metrics available. We previously identified a list of evaluation criteria that can help determine the quality of a metric, and generated a list of potential metric costs and benefits. Depending on research objectives and limitations, these costs and benefits can have different weights of importance. Through an experiment with subject matter experts, we investigated which metric characteristics human factors practitioners consider to be important in evaluating human supervisory control of unmanned vehicles. We also tested two different multi-criteria decision making methods to help practitioners assign subjective weights to the cost/benefit criteria. The majority of participants rated the evaluation criteria used in both tools as very useful. However, the majority of participants' metric selections before using the methods were the same as the suggestions provided by the methods. Since determining weights of metric importance is an inherently subjective process, even with objective computational tools, the real value of using such a tool may be reminding human factors practitioners of the important experimental criteria and relationships between these criteria that should be considered when designing an experiment.

[1]  P. Pina Identifying Generalizable Metric Classes to Evaluate Human-Robot Teams , 2008 .

[2]  V. Gawron Human Performance, Workload, and Situational Awareness Measures Handbook , 2008 .

[3]  J. C. Byers,et al.  Comparison of Four Subjective Workload Rating Scales , 1992 .

[4]  Yoji Akao,et al.  Quality Function Deployment : Integrating Customer Requirements into Product Design , 1990 .

[5]  Kim J. Vicente,et al.  Attention Allocation within the Abstraction Hierarchy , 1997, Int. J. Hum. Comput. Stud..

[6]  T. Saaty Fundamentals of Decision Making and Priority Theory With the Analytic Hierarchy Process , 2000 .

[7]  F. Thomas Eggemeier,et al.  Workload assessment methodology. , 1986 .

[8]  John G. Casali,et al.  A Validated Rating Scale for Global Mental Workload Measurement Applications , 1983 .

[9]  Thomas B. Sheridan,et al.  Telerobotics, Automation, and Human Supervisory Control , 2003 .

[10]  Nancy J. Cooke,et al.  Advances in Measuring Team Cognition , 2003 .

[11]  F. T. Eggemeier,et al.  The Effect of Delayed Report on Subjective Ratings of Mental Workload , 1983 .

[12]  Jacob W. Crandall,et al.  Identifying Predictive Metrics for Supervisory Control of Multiple Robots , 2007, IEEE Transactions on Robotics.

[13]  Jean Scholtz,et al.  Common metrics for human-robot interaction , 2006, HRI '06.

[14]  G. Oehlert A first course in design and analysis of experiments , 2000 .

[15]  Dominik Aronsky,et al.  Tracking Workload in the Emergency Department , 2006, Hum. Factors.

[16]  Jean Scholtz,et al.  Evaluation of human-robot interaction awareness in search and rescue , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[17]  M. Bohanec,et al.  The Analytic Hierarchy Process , 2004 .

[18]  Robin R. Murphy,et al.  Review of Human Studies Methods in HRI and Recommendations , 2010, Int. J. Soc. Robotics.

[19]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[20]  Alphonse Chapanis,et al.  Research techniques in human engineering , 1959 .

[21]  Birsen Donmez,et al.  Selecting Metrics to Evaluate Human Supervisory Control Applications , 2008 .

[22]  Birsen Donmez,et al.  Evaluation criteria for human-automation performance metrics , 2008, PerMIS.

[23]  Jean Scholtz,et al.  Theory and evaluation of human robot interactions , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[24]  Stan Schenkerman,et al.  Inducement of Nonexistent Order by the Analytic Hierarchy Process , 1997 .

[25]  Dan R. Olsen,et al.  Metrics for Evaluating Human-Robot Interactions , 2003 .

[26]  Linda Ng Boyle,et al.  Safety implications of providing real-time feedback to distracted drivers. , 2007, Accident; analysis and prevention.

[27]  R. D. Holder,et al.  Some Comments on the Analytic Hierarchy Process , 1990 .

[28]  Birsen Donmez,et al.  Auditory Decision Aiding in Supervisory Control of Multiple Unmanned Aerial Vehicles , 2009, Hum. Factors.

[29]  S. Hart,et al.  Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research , 1988 .

[30]  L. Kaufman,et al.  Handbook of Perception and Human Performance. Volume 2. Cognitive Processes and Performance , 1994 .

[31]  Michael A. Vidulich,et al.  Testing a Subjective Metric of Situation Awareness , 1991 .

[32]  F. Thomas Eggemeier,et al.  Workload Measurement in System Design and Evaluation , 1985 .

[33]  Mica R. Endsley,et al.  Designing for Situation Awareness : An Approach to User-Centered Design , 2003 .