Combining Concurrent Think-Aloud Protocols and Eye-Tracking Observations: An Analysis of Verbalizations and Silences

Research problem: Concurrent think-aloud (CTA) protocols are one of the dominant approaches of usability testing. However, there is still debate about the validity of the method, partly focusing on the usefulness and exhaustiveness of participants' verbalizations. The rise of eye-tracking technology sheds new light on this discussion, as participants' working processes can now be observed in more detail. Research questions: (1) What kinds of verbalizations do participants produce, and how do they relate to the information that can be directly observed using eye tracking? (2) What do eye movements reveal about cognitive processes at times when participants stop verbalizing? Literature review: Our study replicates an earlier study by Cooke (2010), who used a combination of CTA protocols and eye tracking in a small sample with experienced and highly educated participants to investigate the validity of CTA. Cooke's results suggest that the additional value of participants' verbalizations is limited: at least 77% of the verbalizations referred to things that could be easily observed with eye tracking. Methodology: We conducted a study in which 60 participants with different characteristics performed tasks on informational websites. During their task performance, they verbalized their thoughts, and simultaneously their eye movements were measured. The resulting think-aloud protocols were divided in verbalization units, which were coded into content types. Silences were registered, and eye movements during these silences were analyzed. Results and discussion: We found a different distribution of verbalization types than Cooke (2010) reported, with far more verbalizations where participants formulated doubts, judgments on the website, or expressions of frustration. In our study, verbalizations provided a substantial contribution in addition to the directly observable user problems. We measured a rather high percentage of silences (27%), during which participants most often were scanning pages for information. During these silences, interesting observations could be made about users' processes and obstacles on the website. The implication of our study is that we now have a better understanding of the types of verbalizations that a CTA evaluation might generate. Further, we know that relevant usability observations can be made during silences. A limitation is that we do not know yet the influence of specific characteristics of the evaluation setting on the types of verbalizations and silences. Future research should focus on the influence of evaluation settings on the outcomes of an evaluation, in particular, the influence of characteristics of the participants who are involved in the study.

[1]  Joseph H. Goldberg,et al.  Chapter 23 – Eye Tracking in Usability Evaluation: A Practitioner's Guide , 2003 .

[2]  Fred Paas,et al.  Uncovering cognitive processes: Different techniques that can contribute to cognitive load research and instruction , 2009, Comput. Hum. Behav..

[3]  L. Cooke,et al.  Using eye tracking to address limitations in think-aloud protocol , 2005, IPCC 2005. Proceedings. International Professional Communication Conference, 2005..

[4]  Linden J. Ball,et al.  An Eye Movement Analysis of Web Page Usability , 2002 .

[5]  Ted Boren,et al.  Thinking aloud: reconciling theory and practice , 2000 .

[6]  K. Rayner Eye movements in reading and information processing: 20 years of research. , 1998, Psychological bulletin.

[7]  Stephanie Wilson,et al.  Identifying web usability problems from eye-tracking data , 2007, BCS HCI.

[8]  Menno D. T. de Jong,et al.  Does think aloud work?: how do we know? , 2006, CHI Extended Abstracts.

[9]  Tommy Strandvall,et al.  Eye Tracking in Human-Computer Interaction and Usability Research , 2009, INTERACT.

[10]  Linden J. Ball,et al.  Applying the post-experience eye-tracked protocol (PEEP) method in usability testing. , 2006 .

[11]  Zhiwei Guan,et al.  The validity of the stimulated retrospective think-aloud method as measured by eye tracking , 2006, CHI.

[12]  F. Paas,et al.  Uncovering the problem-solving process: cued retrospective reporting versus concurrent and retrospective reporting. , 2005, Journal of experimental psychology. Applied.

[13]  Menno D.T. de Jong,et al.  Users’ Abilities to Review Web Site Pages , 2012 .

[14]  L. van Waes,et al.  Thinking aloud as a method for testing the usability of Websites: the influence of task variation on the evaluation of hypertext , 2000 .

[15]  Morten Hertzum,et al.  Scrutinising usability evaluation: does thinking aloud affect behaviour and mental workload? , 2009, Behav. Inf. Technol..

[16]  Anna L. Cox,et al.  The Role of Mouse Movements in Interactive Search , 2006 .

[17]  S. Monsell Task switching , 2003, Trends in Cognitive Sciences.

[18]  Timothy D. Wilson,et al.  Telling more than we can know: Verbal reports on mental processes. , 1977 .

[19]  Huub van den Bergh,et al.  Measuring the quality of governmental websites in a controlled versus an online setting with the 'Website Evaluation Questionnaire' , 2012, Gov. Inf. Q..

[20]  Menno D. T. de Jong,et al.  Evaluating municipal websites: A methodological comparison of three think-aloud variants , 2009, Gov. Inf. Q..

[21]  Linden J. Ball,et al.  In Search of Salience: A Response-time and Eye-movement Analysis of Bookmark Recognition , 2004, BCS HCI.

[22]  M.D.T. de Jong,et al.  Constructive Interaction: An Analysis of Verbal Interaction in a Usability Setting , 2006, IEEE Transactions on Professional Communication.

[23]  K. A. Ericsson,et al.  Protocol analysis: Verbal reports as data, Rev. ed. , 1993 .

[24]  Menno D. T. de Jong,et al.  Employing think-aloud protocols and constructive interaction to test the usability of online library catalogues: a methodological comparison , 2004, Interact. Comput..

[25]  Joseph H. Goldberg,et al.  Computer interface evaluation using eye movements: methods and constructs , 1999 .

[26]  M. Just,et al.  Eye fixations and cognitive processes , 1976, Cognitive Psychology.

[27]  Tingting Zhao,et al.  Keep talking: an analysis of participant utterances gathered using two concurrent think-aloud methods , 2010, NordiCHI.

[28]  N. Ummelen,et al.  Measuring reading behavior in policy documents: a comparison of two instruments , 2000 .

[29]  Leo Lentz,et al.  Retrospective think-aloud method: using eye movements as an extra cue for participants' verbalizations , 2011, CHI.

[30]  Victoria A. Bowers Concurrent versus Retrospective Verbal Protocol for Comparing Window Usability , 1990 .

[31]  Peter Jan Schellens,et al.  Evaluation of an Informational Web Site: Three Variants of the Think-aloud Method Compared , 2007 .

[32]  Elizabeth D. Murphy,et al.  Think-aloud protocols: a comparison of three think-aloud protocols for use in testing data-dissemination web sites for usability , 2010, CHI.

[33]  Linden J. Ball,et al.  Cueing retrospective verbal reports in usability testing through eye-movement replay , 2007, BCS HCI.

[34]  Menno D. T. de Jong,et al.  Retrospective vs. concurrent think-aloud protocols: testing the usability of an online library catalogue , 2003, Behav. Inf. Technol..

[35]  Kasper Hornbæk,et al.  What do usability evaluators do in practice?: an explorative study of think-aloud testing , 2006, DIS '06.

[36]  William P. Eveland,et al.  Examining Information Processing on the World Wide Web Using Think Aloud Protocols , 2000 .

[37]  Mark A. Neerincx,et al.  Support concepts for Web navigation: a cognitive engineering approach , 2001, WWW '01.

[38]  Aulikki Hyrskykari,et al.  Gaze Path Stimulation in Retrospective Think-Aloud , 2008 .

[39]  Lynne Cooke,et al.  Assessing Concurrent Think-Aloud Protocol as a Usability Test Method: A Technical Communication Approach , 2010, IEEE Transactions on Professional Communication.

[40]  E. Krahmer,et al.  Thinking about thinking aloud: a comparison of two verbal protocols for usability testing , 2004, IEEE Transactions on Professional Communication.