Purely vision-based segmentation of web pages for assistive technology

We use a novel vision-based method to analyze the layout of a web page.Our method produces a hierarchical segmentation of the page reflecting its structure.Vision-based methods are not sensitive to implementation language or complexity.The visual presentation of a page provides rich information about semantic structure.This structure can help create modified presentations for users with assistive needs. We propose a system for analyzing the structure of a web page based on purely visual information, rather than on implementation details. This is advantageous because regardless of the complexity of the underlying implementation, the web page is designed to be easily interpreted visually. Our method produces a hierarchical segmentation reflecting the visual structure of the rendered page. This rich information about the presentation of the web page can be used by other systems which produce alternate presentations more suitable for users with visual or cognitive disabilities.

[1]  Carole A. Goble,et al.  Screen Readers Cannot See Ontology Based Semantic Annotation for Visually Impaired Web Travellers , 2004 .

[2]  David Doermann,et al.  Handbook of Document Image Processing and Recognition , 2014, Springer London.

[3]  A. A. Zlatopolsky Automated document segmentation , 1994, Pattern Recognit. Lett..

[4]  Frank Y. Shih,et al.  A document segmentation, classification and recognition system , 1992, Proceedings of the Second International Conference on Systems Integration.

[5]  Hassan F. Eldirdiery,et al.  Web Document Segmentation for Better Extraction of Information: A Review , 2015 .

[6]  I. V. Ramakrishnan,et al.  The HearSay non-visual web browser , 2007, W4A '07.

[7]  Masahiro Watanabe,et al.  Improving Accessibility Through the Visual Structure of Web Contents , 2007, HCI.

[8]  Wei-Ying Ma,et al.  Learning important models for web page blocks based on layout and content analysis , 2004, SKDD.

[9]  Richard E. Ladner,et al.  WebInSight:: making web images accessible , 2006, Assets '06.

[10]  Vicki L. Hanson,et al.  Web accessibility: a broader view , 2004, WWW '04.

[11]  Thomas S. Tullis,et al.  Older Adults and the Web: Lessons Learned from Eye-Tracking , 2007, HCI.

[12]  Vicki L. Hanson Cognition, Age, and Web Browsing , 2009, HCI.

[13]  Antonis A. Argyros,et al.  Vision-Based SLAM and Moving Objects Tracking for the Perceptual Support of a Smart Walker Platform , 2014, ECCV Workshops.

[14]  Barbara S. Chaparro,et al.  Evaluating websites for older adults: adherence to ‘senior-friendly’ guidelines and end-user performance , 2008, Behav. Inf. Technol..

[15]  I. V. Ramakrishnan,et al.  Csurf: a context-driven non-visual web-browser , 2007, WWW '07.

[16]  Harry Hochheiser,et al.  Revisiting breadth vs. depth in menu structures for blind users of screen readers , 2010, Interact. Comput..

[17]  G. W. Furnas,et al.  Generalized fisheye views , 1986, CHI '86.

[18]  David H. Marimont,et al.  A probabilistic framework for edge detection and scale selection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[19]  Guillermo Sapiro,et al.  Edges as Outliers: Anisotropic Smoothing Using Local Image Statistics , 1999, Scale-Space.

[20]  Wei-Ying Ma,et al.  Detecting web page structure for adaptive viewing on small form factor devices , 2003, WWW '03.

[21]  Richard E. Ladner,et al.  Accessmonkey: a collaborative scripting framework for web users and developers , 2007, W4A '07.

[22]  I. V. Ramakrishnan,et al.  More than meets the eye: a survey of screen-reader browsing strategies , 2010, W4A.

[23]  Wei-Ying Ma,et al.  VIPS: a Vision-based Page Segmentation Algorithm , 2003 .

[24]  Alasdair King,et al.  Personalising web page presentation for older people , 2006, Interact. Comput..

[25]  Francesca Cesarini,et al.  Structured document segmentation and representation by the modified X-Y tree , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[26]  Xing Xie,et al.  Adapting Web pages for small-screen devices , 2005, IEEE Internet Computing.

[27]  Allan D. Jepson,et al.  Qualitative probabilities for image interpretation , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[28]  Yuanzhen Li,et al.  Feature congestion: a measure of display clutter , 2005, CHI.

[29]  Clayton Lewis Issues in Web Presentation for Cognitive Accessibility , 2011, HCI.

[30]  Wee Sun Lee,et al.  Understanding the function of web elements for mobile content delivery using random walk models , 2005, WWW '05.

[31]  Aaron Andersen,et al.  Improving the outcomes of students with cognitive and learning disabilities: phase I development for a web accessibility tool , 2007, Assets '07.

[32]  M. Talbot,et al.  Trajectory capture in frontal plane geometry for visually impaired , 2006 .

[33]  Ping Zhong,et al.  Detecting Web Content Function Using Generalized Hidden Markov Model , 2006, 2006 5th International Conference on Machine Learning and Applications (ICMLA'06).

[34]  Eric Saund,et al.  A perceptually-supported sketch editor , 1994, UIST '94.

[35]  Michael Werman,et al.  A Linear Time Histogram Metric for Improved SIFT Matching , 2008, ECCV.

[36]  Hironobu Takagi,et al.  Annotation-based transcoding for nonvisual web access , 2000, Assets '00.

[37]  Ruslan R. Fayzrakhmanov,et al.  A versatile model for web page representation, information extraction and content re-packaging , 2011, DocEng '11.

[38]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[39]  Apostolos Antonacopoulos,et al.  Page Segmentation Using the Description of the Background , 1998, Comput. Vis. Image Underst..

[40]  David Sloan,et al.  Understanding the role of age and fluid intelligence in information search , 2012, ASSETS '12.

[41]  Simeon Keates,et al.  Improving Web accessibility through an enhanced open-source browser , 2005, IBM Syst. J..

[42]  Junji Maeda,et al.  Site-wide annotation: reconstructing existing pages to be accessible , 2002, Assets '02.

[43]  Michael Werman,et al.  Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[44]  Karyn Moffatt,et al.  Older-adult HCI: why should we care? , 2013, INTR.

[45]  Hanhwe Kim,et al.  Spatial metaphors and disorientation in hypertext browsing , 1995, Behav. Inf. Technol..

[46]  Morgan Dixon,et al.  Prefab: implementing advanced behaviors using pixel-based reverse engineering of interface structure , 2010, CHI.

[47]  Jacob O. Wobbrock,et al.  In the shadow of misperception: assistive technology use and social interactions , 2011, CHI.

[48]  Michael Cormier,et al.  A Robust Vision-Based Framework for Screen Readers , 2014, ECCV Workshops.

[49]  Shari Trewin,et al.  Cognitive impairments and Web 2.0 , 2009, Universal Access in the Information Society.

[50]  Walter S. Lasecki,et al.  RegionSpeak: Quick Comprehensive Spatial Descriptions of Complex Images for Blind Users , 2015, CHI.

[51]  Yeliz Yesilada,et al.  Barriers common to mobile and disabled web users , 2011, Interact. Comput..

[52]  Alison Lee Scaffolding visually cluttered web pages to facilitate accessibility , 2004, AVI.

[53]  Constantine Stephanidis,et al.  Universal Access in Human-Computer Interaction - Addressing Diversity , 2009 .

[54]  Peter G. Fairweather How older and younger adults differ in their approach to problem solving on a complex website , 2008, Assets '08.

[55]  Sidney S. Fels,et al.  A visual recipe book for persons with language impairments , 2005, CHI.

[56]  Alex Mihailidis,et al.  An Intelligent Powered Wheelchair for Users with Dementia: Case Studies with NOAH (Navigation and Obstacle Avoidance Help) , 2012, AAAI Fall Symposium: Artificial Intelligence for Gerontechnology.

[57]  Guido Bologna,et al.  Descending Stairs Detection with Low-Power Sensors , 2014, ECCV Workshops.

[58]  Ian Williams,et al.  A performance evaluation of statistical tests for edge detection in textured images , 2014, Comput. Vis. Image Underst..

[59]  Michael Brooks,et al.  Accessible online content creation by end users , 2013, CHI.