论文信息 - Evaluating website quality: Five studies on user-focused evaluation methods

Evaluating website quality: Five studies on user-focused evaluation methods

The benefits of evaluating websites among potential users are widely acknowledged. There are several methods that can be used to evaluate the websites’ quality from a users’ perspective. In current practice, many evaluations are executed with inadequate methods that lack research-based validation. This thesis aims to gain more insight into evaluation methodology and to contribute to a higher standard of website evaluation in practice. A first way to evaluate website quality is measuring the users’ opinions. This is often done with questionnaires, which gather opinions in a cheap, fast, and easy way. However, many questionnaires seem to miss a solid statistical basis and a justification of the choice of quality dimensions and questions. We therefore developed the ‘Website Evaluation Questionnaire’ (WEQ), which was specifically designed for the evaluation of governmental websites. In a study in online and laboratory settings the WEQ has proved to be a valid and reliable instrument. A way to gather more specific user opinions, is inviting participants to review website pages. Participants provide their comments by clicking on a feedback button, marking a problematic segment, and formulating their feedback. There has been debate about the extent to which users are able to provide relevant feedback. The results of our studies showed that participants were able to provide useful feedback. They signalled many relevant problems that indeed were experienced by users who needed to find information on the website. Website quality can also be measured during participants’ task performance. A frequently used method is the concurrent think-aloud method (CTA), which involves participants who verbalize their thoughts while performing tasks. There have been doubts on the usefulness and exhaustiveness of participants’ verbalizations. Therefore, we have combined CTA and eye tracking in order to examine the cognitive processes that participants do and do not verbalize. The results showed that the participants’ verbalizations provided substantial information in addition to the directly observable user problems. There was also a rather high percentage of silences (27%) during which interesting observations could be made about the users’ processes and obstacles. A thorough evaluation should therefore combine verbalizations and (eye tracking) observations. In a retrospective think-aloud (RTA) evaluation participants verbalize their thoughts afterwards while watching a recording of their performance. A problem with RTA is that participants not always remember the thoughts they had during their task performance. We therefore complemented the dynamic screen replay of their actions (pages visited and mouse movements) with a dynamic gaze replay of the participants’ eye movements. Contrary to our expectations, no differences were found between the two conditions. It is not possible to draw conclusions on the single best method. The value of a specific method is strongly influenced by the goals and context of an evaluation. Also, the outcomes of the evaluation not only depend on the method, but also on other choices during the evaluation, such as participant selection, tasks, and the subsequent analysis.

Sanne Elling | S. Elling

[1] Tingting Zhao,et al. Keep talking: an analysis of participant utterances gathered using two concurrent think-aloud methods , 2010, NordiCHI.

[2] Morten Hertzum,et al. Usability inspections by groups of specialists: perceived agreement in spite of disparate observations , 2002, CHI Extended Abstracts.

[3] Kasper Hornbæk,et al. Dogmas in the assessment of usability evaluation methods , 2010, Behav. Inf. Technol..

[4] M. Larsen,et al. The Psychology of Survey Response , 2002 .

[5] Roger Tourangeau,et al. Cognitive Aspects of Survey Methodology: Building a Bridge Between Disciplines. , 1987 .

[6] Gregoris Mentzas,et al. An Ontology for the Multi-perspective Evaluation of Quality in E-Government Services , 2007, EGOV.

[7] N. Ummelen,et al. Measuring reading behavior in policy documents: a comparison of two instruments , 2000 .

[8] Linden J. Ball,et al. Eye Tracking in Human-Computer Interaction and Usability Research : Current Status and Future Prospects , 2004 .

[9] Menno D.T. de Jong,et al. Users’ Abilities to Review Web Site Pages , 2012 .

[10] Kasper Hornbæk,et al. Ingredients and Meals Rather Than Recipes: A Proposal for Research That Does Not Treat Usability Evaluation Methods as Indivisible Wholes , 2011, Int. J. Hum. Comput. Interact..

[11] Marilyn Hughes Blackmon,et al. A Comprehension-based Model of Web Navigation and Its Application to Web Usability Analysis , 2000, BCS HCI.