Accessible skimming: faster screen reading of web pages

In our information-driven web-based society, we are all gradually falling ""victims"" to information overload [5].However, while sighted people are finding ways to sift through information faster, Internet users who are blind are experiencing an even greater information overload. These people access computers and Internet using screen-reader software, which reads the information on a computer screen sequentially using computer-generated speech. While sighted people can learn how to quickly glance over the headlines and news articles online to get the gist of information, people who are blind have to use keyboard shortcuts to listen through the content narrated by a serial audio interface. This interface does not give them an opportunity to know what content to skip and what to listen to. So, they either listen to all of the content or listen to the first part of each sentence or paragraph before they skip to the next one. In this paper, we propose an automated approach to facilitate non-visual skimming of web pages. We describe the underlying algorithm, outline a non-visual skimming interface, and report on the results of automated experiments, as well as on our user study with 23 screen-reader users. The results of the experiments suggest that we have been moderately successful in designing a viable algorithm for automatic summarization that could be used for non-visual skimming. In our user studies, we confirmed that people who are blind could read and search through online articles faster and were able to understand and remember most of what they have read with our skimming system. Finally, all 23 participants expressed genuine interest in using non-visual skimming in the future.

[1]  Leming Shi,et al.  Effect of training-sample size and classification difficulty on the accuracy of genomic predictors , 2010, Breast Cancer Research.

[2]  Mark Last,et al.  Graph-Based Keyword Extraction for Single-Document Summarization , 2008, COLING 2008.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Richard E. Ladner,et al.  WebinSitu: a comparative analysis of blind and sighted browsing behavior , 2007, Assets '07.

[5]  I. V. Ramakrishnan,et al.  What's new?: making web page updates accessible , 2008, Assets '08.

[6]  Paul S. Jacobs,et al.  To Parse or Not to Parse: Relation-Driven Text Skimming , 1990, COLING.

[7]  Stephen W. Wharton An analysis of the effects of sample size on classification performance of a histogram based cluster analysis procedure , 1984, Pattern Recognit..

[8]  I. V. Ramakrishnan,et al.  The HearSay non-visual web browser , 2007, W4A '07.

[9]  I. V. Ramakrishnan,et al.  Assistive web browsing with touch interfaces , 2010, ASSETS '10.

[10]  Mary Zajicek,et al.  A Web navigation tool for the blind , 1998, Assets '98.

[11]  I. V. Ramakrishnan,et al.  Why read if you can skim: towards enabling faster screen reading , 2012, W4A.

[12]  Gerhard Fischer,et al.  Information access in complex, poorly structured information spaces , 1991, CHI '91.

[13]  I. V. Ramakrishnan,et al.  Csurf: a context-driven non-visual web-browser , 2007, WWW '07.

[14]  Klaus Zechner,et al.  Fast Generation of Abstracts from General Domain Text Corpora by Extracting Relevant Sentences , 1996, COLING.

[15]  Vibhu O. Mittal,et al.  OCELOT: a system for summarizing Web pages , 2000, SIGIR '00.

[16]  Hal Berghel,et al.  Cyberspace 2000: dealing with information overload , 1997, CACM.

[17]  Sadaoki Furui,et al.  A Statistical Approach to Automatic Speech Summarization , 2003, EURASIP J. Adv. Signal Process..

[18]  Jihoon Yang,et al.  Extracting sentence segments for text summarization: a machine learning approach , 2000, SIGIR '00.

[19]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[20]  D. Laskin Dealing with information overload. , 1994, Journal of oral and maxillofacial surgery : official journal of the American Association of Oral and Maxillofacial Surgeons.

[21]  Ii Gerald Francis Dejong Skimming stories in real time: an experiment in integrated understanding. , 1979 .

[22]  I. V. Ramakrishnan,et al.  More than meets the eye: a survey of screen-reader browsing strategies , 2010, W4A.

[23]  Jade Goldstein-Stewart,et al.  Summarizing text documents: sentence selection and evaluation metrics , 1999, SIGIR '99.

[24]  Cécile Paris,et al.  Automatically summarising Web sites: is there a way around it? , 2000, CIKM '00.

[25]  Dragomir R. Radev,et al.  Introduction to the Special Issue on Summarization , 2002, CL.

[26]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[27]  Simon Harper,et al.  Gist summaries for visually impaired surfers , 2005, Assets '05.

[28]  I. V. Ramakrishnan,et al.  Hearsay: a new generation context-driven multi-modal assistive web browser , 2010, WWW '10.

[29]  I. V. Ramakrishnan,et al.  Mixture model based label association techniques for web accessibility , 2010, UIST.

[30]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[31]  John Sweller,et al.  Cognitive Load During Problem Solving: Effects on Learning , 1988, Cogn. Sci..