Automated comprehensive evaluation approach for user interface satisfaction based on concurrent think-aloud method

Abstract The concurrent think-aloud protocol (CTA) is an effective method for collecting abundant product comments related to user satisfaction during the execution of evaluation tasks. However, manual analysis of these audio comments is time-consuming and labor-intensive. This paper aims to propose an approach for automated comprehensive evaluation of user interface (UI) satisfaction. It takes advantage of text mining and sentiment analysis (SA) techniques instead of manual analysis in order to assess user comments collected by the CTA. Based on the results of the SA, the proposed approach makes use of the analytic hierarchy process (AHP) method to evaluate the overall satisfaction and support developers for UI design improvements. In order to enhance the objectivity of evaluation, a sentiment matrix originating from text mining and SA on user comments is used to replace the criteria and the relative weights of the AHP method which were previously defined by experts. A comparison between the questionnaire survey method and the proposed approach in the empirical study suggested that the latter can efficiently evaluate UI satisfaction with high accuracy and provide designers abundant and specific information directly related to defects in design. It is argued that the proposed approach could be used as an automated framework for handling any type of comments.

[1]  Ted Boren,et al.  Thinking aloud: reconciling theory and practice , 2000 .

[2]  T. Saaty Relative measurement and its generalization in decision making why pairwise comparisons are central in mathematics for the measurement of intangible factors the analytic hierarchy/network process , 2008 .

[3]  Zülal Güngör,et al.  The usability analysis with heuristic evaluation and analytic hierarchy process , 2009 .

[4]  Jean Vanderdonckt,et al.  Automated Web Evaluation by Guideline Review , 2005, J. Web Eng..

[5]  Mogamat Razeen Davids,et al.  Effect of improving the usability of an e-learning resource: a randomized trial. , 2014, Advances in physiology education.

[6]  David Pinelle,et al.  Heuristic evaluation for games: usability principles for video game design , 2008, CHI.

[7]  Jakob Grue Simonsen,et al.  Extracting usability and user experience information from online user reviews , 2013, CHI.

[8]  Yildiz Esra Albayrak,et al.  Using analytic hierarchy process (AHP) to improve human performance: An application of multiple criteria decision making problem , 2004, J. Intell. Manuf..

[9]  Nicolette de Keizer,et al.  The value of Retrospective and Concurrent Think Aloud in formative usability testing of a physician data query tool , 2015, J. Biomed. Informatics.

[10]  Marti A. Hearst,et al.  The state of the art in automating usability evaluation of user interfaces , 2001, CSUR.

[11]  Ma Ying,et al.  A Novel Chinese Text Subject Extraction Method Based on Character Co-occurrence , 2003 .

[12]  T. Hase,et al.  Design Method of UI of AV Remote Controller Based on AHP , 2008, 2008 Digest of Technical Papers - International Conference on Consumer Electronics.

[13]  Jean Vanderdonckt,et al.  Advance human–machine interface automatic evaluation , 2013, Universal Access in the Information Society.

[14]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[15]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[16]  K. A. Ericsson,et al.  Protocol Analysis: Verbal Reports as Data , 1984 .

[17]  Mehrdad Sabetzadeh,et al.  Automated Checking of Conformance to Requirements Templates Using Natural Language Processing , 2015, IEEE Transactions on Software Engineering.

[18]  Elizabeth D. Murphy,et al.  Think-aloud protocols: a comparison of three think-aloud protocols for use in testing data-dissemination web sites for usability , 2010, CHI.

[19]  Tang Da-quan Word’s semantic similarity computation method based on Hownet , 2010 .

[20]  Michel R. V. Chaudron,et al.  A cognitive perspective on developer comprehension of software design documentation , 2011, SIGDOC '11.

[21]  Erik Frekjmr,et al.  Measuring Usability: Are Effectiveness, Efficiency, and Satisfaction Really Correlated? , 2000 .

[22]  H. Simon,et al.  Protocol Analysis: Verbal Reports as Data , 1986 .

[23]  Thomas L. Saaty,et al.  DECISION MAKING WITH THE ANALYTIC HIERARCHY PROCESS , 2008 .

[24]  John Dowell,et al.  A framework for human factors evaluation , 1991 .

[25]  Yongtae Park,et al.  Review-based measurement of customer satisfaction in mobile service: Sentiment analysis and VIKOR approach , 2014, Expert Syst. Appl..

[26]  E. Anderson Customer Satisfaction and Word of Mouth , 1998 .

[27]  Paul A. Pavlou,et al.  Can online reviews reveal a product's true quality?: empirical findings and analytical modeling of Online word-of-mouth communication , 2006, EC '06.

[28]  Ming Li,et al.  An approach of product usability evaluation based on Web mining in feature fatigue analysis , 2014, Comput. Ind. Eng..

[29]  Steven Heim The Resonant Interface: HCI Foundations for Interaction Design , 2007 .

[30]  Morten Hertzum,et al.  Scrutinising usability evaluation: does thinking aloud affect behaviour and mental workload? , 2009, Behav. Inf. Technol..

[31]  Stefano Federici,et al.  Web usability evaluation with screen reader users: implementation of the partial concurrent thinking aloud technique , 2010, Cognitive Processing.

[32]  Beverly Freeman,et al.  Triggered think-aloud protocol: using eye tracking to improve usability test moderation , 2011, CHI.

[33]  Lynne Cooke,et al.  Assessing Concurrent Think-Aloud Protocol as a Usability Test Method: A Technical Communication Approach , 2010, IEEE Transactions on Professional Communication.

[34]  Valentina Grigoreanu,et al.  Informal cognitive walkthroughs (ICW): paring down and pairing up for an agile world , 2013, CHI.