Adapting Web content for low-literacy readers by using lexical elaboration and named entities labeling

This paper presents an approach for assisting low-literacy readers in accessing Web online information. The “Educational FACILITA” tool is a Web content adaptation tool that provides innovative features and follows more intuitive interaction models regarding accessibility concerns. Especially, we propose an interaction model and a Web application that explore the natural language processing tasks of lexical elaboration and named entity labeling for improving Web accessibility. We report on the results obtained from a pilot study on usability analysis carried out with low-literacy users. The preliminary results show that “Educational FACILITA” improves the comprehension of text elements, although the assistance mechanisms might also confuse users when word sense ambiguity is introduced, by gathering, for a complex word, a list of synonyms with multiple meanings. This fact evokes a future solution in which the correct sense for a complex word in a sentence is identified, solving this pervasive characteristic of natural languages. The pilot study also identified that experienced computer users find the tool to be more useful than novice computer users do.

[1]  Nikhil Welankar,et al.  Rangoli: a visual phonebook for low-literate users , 2008, Mobile HCI.

[2]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.

[3]  Madelaine Plauché,et al.  Tamil market: a spoken dialog system for rural India , 2006, CHI EA '06.

[4]  Caroline Gasperin,et al.  Fostering Digital Inclusion and Accessibility: The PorSimples project for Simplification of Portuguese Texts , 2010, NAACL.

[5]  Goiara Mendonça de Castilho,et al.  Normas de concretude para 909 palavras da língua portuguesa , 2007 .

[6]  Renata Pontin de Mattos Fortes,et al.  Facilita: reading assistance for low-literacy readers , 2009, SIGDOC '09.

[7]  Maria Tereza Camargo Biderman,et al.  Dicionario ilustrado de portugues , 2008 .

[8]  Renata Pontin de Mattos Fortes,et al.  Towards Brazilian Portuguese automatic text simplification systems , 2008, DocEng '08.

[9]  Mari Ostendorf,et al.  Text simplification for language learners: a corpus analysis , 2007, SLaTE.

[10]  R. Hambleton,et al.  Fundamentals of Item Response Theory , 1991 .

[11]  David Sloan,et al.  Forcing standardization or accommodating diversity?: a framework for applying the WCAG in the real world , 2005, W4A '05.

[12]  Renata Pontin de Mattos Fortes,et al.  Facilita: helping the reading of texts available on the web , 2009, WebMedia.

[13]  Siobhan Devlin,et al.  Helping aphasic people process online information , 2006, Assets '06.

[14]  Gail E. Kaiser,et al.  DOM-based content extraction of HTML documents , 2003, WWW '03.

[15]  D. J. Young Linguistic Simplification of SL Reading Material: Effective Instructional Practice? , 1999 .

[16]  Kentaro Toyama,et al.  Optimal audio-visual representations for illiterate users of computers , 2007, WWW '07.

[17]  Sandra M. Aluísio,et al.  Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building , 2010, LREC.

[18]  Shirley Ann Becker,et al.  A study of web usability for older adults seeking online health resources , 2004, TCHI.

[19]  Gregg C. Vanderheiden,et al.  Web Content Accessibility Guidelines 1.0 W3C DRAFT Proposed Recommendation , 1999 .

[20]  Nuno Cardoso REMBRANDT - Reconhecimento de Entidades Mencionadas Baseado em Relações e ANálise Detalhada do Texto , 2009 .

[21]  Renata Pontin de Mattos Fortes,et al.  The perception of accessibility in Web development by academy, industry and government: a survey of the Brazilian scenario , 2008, New Rev. Hypermedia Multim..

[22]  Aurélien Max Writing for Language-Impaired Readers , 2006, CICLing.

[23]  Kathryn Summers,et al.  Designing Web-based Forms for Users with Lower Literacy Skills , 2007, ASIST.

[24]  Gregg C. Vanderheiden,et al.  Web Content Accessibility Guidelines (WCAG) 2.0 , 2008 .

[25]  Michael H. Long,et al.  THE EFFECTS OF SIMPLIFIED AND ELABORATED TEXTS ON FOREIGN LANGUAGE READING COMPREHENSION , 1994 .

[26]  Apala Lahiri Chavan,et al.  Design considerations for a financial management system for rural, semi-literate users , 2003, CHI Extended Abstracts.

[27]  John Sabatini,et al.  The Automated Text Adaptation Tool , 2007, NAACL.

[28]  Richard E. Ladner,et al.  Accessmonkey: a collaborative scripting framework for web users and developers , 2007, W4A '07.

[29]  Gregg C. Vanderheiden,et al.  Web content accessibility guidelines 1.0 , 2001, INTR.

[30]  Renata Pontin de Mattos Fortes,et al.  An evaluation of web accessibility metrics based on their attributes , 2008, SIGDOC '08.

[31]  Renata Pontin de Mattos Fortes,et al.  Techniques for developing more accessible web applications: a survey towards a process classification , 2007, SIGDOC '07.

[32]  Noémie Elhadad Comprehending Technical Texts: Predicting and Defining Unfamiliar Terms , 2006, AMIA.

[33]  Driss Kettani,et al.  Designing an e-government portal accessible to illiterate citizens , 2007, ICEGOV '07.