Crowdsourcing-Based Web Accessibility Evaluation with Golden Maximum Likelihood Inference

Web accessibility evaluation examines how well websites comply with accessibility guidelines which help people with disabilities to perceive, navigate and contribute to the Web. This demanding task usually requires manual assessment by experts with many years of training and experience. However, not enough experts are available to carry out the increasing number of evaluation projects while non-experts often have different opinions about the presence of accessibility barriers. Addressing these issues, we introduce a crowdsourcing system with a novel truth inference algorithm to derive reliable and accurate assessments from conflicting opinions of evaluators. Extensive evaluation on 23,901 complex tasks assessed by 50 people with and without disabilities shows that our approach outperforms state of the art approaches. In addition, we conducted surveys to identify frequent barriers that people with disabilities are facing in their daily lives and the difficulty to access Web pages when they encounter these barriers. The frequencies and severities of barriers correlate with their derived importance in our evaluation project.

[1]  Hyun-Chul Kim,et al.  Bayesian Classifier Combination , 2012, AISTATS.

[2]  Matthew King,et al.  Managing usability for people with disabilities in a large Web presence , 2005, IBM Syst. J..

[3]  Chris Callison-Burch,et al.  Shared task: crowdsourced accessibility elicitation of Wikipedia articles , 2010, HLT-NAACL 2010.

[4]  Sergio Luján Mora,et al.  Introduction to Web Accessibility , 2011 .

[5]  Bin Bi,et al.  Iterative Learning for Reliable Crowdsourcing Systems , 2012 .

[6]  Jiajun Bu,et al.  Web Accessibility Evaluation in a Crowdsourcing-Based System with Expertise-Based Decision Strategy , 2018, W4A.

[7]  Xianghua Ding,et al.  Socially Embedded Work: A Study of Wheelchair Users Performing Online Crowd Work in China , 2017, CSCW.

[8]  C. Riddle Disability and Health , 2017 .

[9]  Meredith Ringel Morris,et al.  Accessible Crowdwork?: Understanding the Value in and Challenge of Microtask Employment for People with Disabilities , 2015, CSCW.

[10]  Digital Education Strategies,et al.  Introduction to Web Accessibility , 2019 .

[11]  Luís Carriço,et al.  Web not for all: a large scale study of web accessibility , 2010, W4A.

[12]  Gregg C. Vanderheiden,et al.  Web Content Accessibility Guidelines (WCAG) 2.0 , 2008 .

[13]  Markel Vigo,et al.  Quantitative metrics for measuring web accessibility , 2007, W4A '07.

[14]  Chun Chen,et al.  An optimal sampling method for web accessibility quantitative metric , 2015, W4A.

[15]  Guoliang Li,et al.  Truth Inference in Crowdsourcing: Is the Problem Solved? , 2017, Proc. VLDB Endow..

[16]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..

[17]  Jiajun Bu,et al.  Reliability Aware Web Accessibility Experience Metric , 2018, W4A.

[18]  Markel Vigo,et al.  Exploring the relationship between web accessibility and user experience , 2016, Int. J. Hum. Comput. Stud..

[19]  Xiaoming Zeng EVALUATION AND ENHANCEMENT OF WEB CONTENT ACCESSIBILITY FOR PERSONS WITH DISABILITIES , 2004 .

[20]  Gianluca Demartini,et al.  ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking , 2012, WWW.

[21]  David M. Pennock,et al.  Methods for Sampling Pages Uniformly from the World Wide Web , 2001 .

[22]  Joav Merrick,et al.  Disability and health. , 2004, International journal of adolescent medicine and health.

[23]  Murat Demirbas,et al.  Crowdsourcing for Multiple-Choice Question Answering , 2014, AAAI.

[24]  W. Frontera The world report on disability. , 2012, American journal of physical medicine & rehabilitation.

[25]  Bo Zhao,et al.  Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation , 2014, SIGMOD Conference.

[26]  Jiajun Bu,et al.  A task assignment strategy for crowdsourcing-based web accessibility evaluation system , 2017, W4A.

[27]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[28]  Yeliz Yesilada,et al.  The Expertise Effect on Web Accessibility Evaluation Methods , 2011, Hum. Comput. Interact..

[29]  A. P. Dawid,et al.  Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .