Project Ernestine: Validating a GOMS Analysis for Predicting and Explaining Real-World Task Performance

Project Ernestine served a pragmatic as well as a scientific goal: to compare the worktimes of telephone company toll and assistance operators on two different workstations and to validate a GOMS analysis for predicting and explaining real-world performance. Contrary to expectations, GOMS predicted and the data confirmed that performance with the proposed workstation was slower than with the current one. Pragmatically, this increase in performance time translates into a cost of almost $2 million a year to NYNEX. Scientifically, the GOMS models predicted performance with exceptional accuracy. The empirical data provided us with three interesting results: proof that the new workstation was slower than the old one, evidence that this difference was not constant but varied with call category, and (in a trial that spanned 4 months and collected data on 72,450 phone calls) proof that performance on the new workstation stabilized after the first month. The GOMS models predicted the first two results and explained all three. In this article, we discuss the process and results of model building as well as the design and outcome of the field trial. We assess the accuracy of GOMS predictions and use the mechanisms of the models to explain the empirical results. Last, we demonstrate how the GOMS models can be used to guide the design of a new workstation and evaluate design decisions before they are implemented.

[1]  David E. Kieras,et al.  The Acquisition and Performance of Text-Editing Skill: A Cognitive Complexity Analysis , 1990, Hum. Comput. Interact..

[2]  O. L. Davies,et al.  Statistical Methods. 6th Edition. , 1968 .

[3]  Allen Newell,et al.  Computer text-editing: An information-processing analysis of a routine cognitive skill , 1980, Cognitive Psychology.

[4]  Arthur Conan Sir Doyle,et al.  Sherlock Holmes: The Complete Novels and Stories , 1991 .

[5]  Shelly Dews,et al.  Natural dialog in a time-sensitive setting: a study of telephone operators , 1992, CHI '92.

[6]  G. W. Snedecor Statistical Methods , 1964 .

[7]  Thomas P. Moran,et al.  The evaluation of text editors: methodology and empirical results. , 1983, CACM.

[8]  G. Keppel,et al.  Design and Analysis: A Researcher's Handbook , 1976 .

[9]  P. Fitts The information capacity of the human motor system in controlling the amplitude of movement. , 1954, Journal of experimental psychology.

[10]  Marilyn Tremaine,et al.  Skilled financial planning: the cost of translating ideas into action , 1989, CHI '89.

[11]  Bonnie E. John Extensions of GOMS analyses to expert performance requiring perception of dynamic visual and auditory information , 1990, CHI '90.

[12]  Allen Newell,et al.  The Prospects for Psychological Science in Human-Computer Interaction , 1985, Hum. Comput. Interact..

[13]  O. J. Murphy,et al.  Characteristic time intervals in telephonic conversation , 1938 .

[14]  F. H. Best New transmission measuring systems for telephone circuit maintenance , 1938 .

[15]  J. Stevens Applied Multivariate Statistics for the Social Sciences , 1986 .

[16]  Allen Newell,et al.  Cumulating the science of HCI: from s-R compatibility to transcription typing , 1989, CHI '89.

[17]  Michael E. Atwood,et al.  The precis of Project Ernestine or an overview of a validation of GOMS , 1992, CHI.

[18]  Bonnie E. John,et al.  GOMS analysis for parallel activities , 1994, CHI '94.

[19]  Bonnie E. John,et al.  Toward an Engineering Model of Stimulus-Response Compatibility , 1990 .

[20]  Allen Newell,et al.  A theory of stimulus-response compatibility applied to human-computer interaction , 1985, CHI '85.

[21]  Allen and Rosenbloom Paul S. Newell,et al.  Mechanisms of Skill Acquisition and the Law of Practice , 1993 .

[22]  Bing S. Yao,et al.  Benchmark analysis of database architectures: a case study , 1986 .

[23]  G. Vining,et al.  Data Analysis: A Model-Comparison Approach , 1989 .

[24]  Gary M. Olson,et al.  The growth of cognitive modeling in human-computer interaction since GOMS , 1990 .

[25]  Bonnie Elizabeth John Contributions to engineering models of human-computer interaction. (volumes i and ii) , 1988 .

[26]  Michael E. Atwood,et al.  GOMS meets the phone company: Analytic modeling applied to real-world problems , 1990, INTERACT.

[27]  Allen Newell,et al.  The keystroke-level model for user performance time with interactive systems , 1980, CACM.

[28]  Daniel R Benigni Benchmark analysis of database architictures , 1985 .

[29]  Rory Stuart,et al.  A speech compression proposal for directory assistance operators: GOMS predictions , 1993, CHI '93.

[30]  Gary M. Olson,et al.  The Growth of Cognitive Modeling in Human-Computer Interaction Since GOMS , 1990, Hum. Comput. Interact..