Understanding and improving navigation within electronic documents

Electronic documents form an integral part of the modern computer age—virtually all personal computers have the ability to create, store and display their content. A connection to the Internet provides users with an almost endless source of documents, be they web-pages, word-processor files or emails. However, the entire contents of an electronic document are often too large to be usefully presented on a user’s screen, at a single point in time. This issue is usually overcome by placing the content inside a scrolling environment. The view onto the document is then modified by directly adjusting a scrollbar or by employing tools such as the mousewheel or paging keys. Applications may also provide methods for adjusting the document’s zoom and page layout. The scrollbar has seen widespread adoption, becoming the default tool used to visualise large information spaces. Despite its extensive deployment, researchers have little knowledge on how this and related navigation tools are used in an everyday work environment. A characterisation of users’ actions would allow designers to identify common behaviours and areas of inefficiency as they strive to improve navigation techniques. To fill this knowledge gap, this thesis aims to understand and improve navigation within desktop-based electronic documents. This goal is achieved using a five step process. First, the literature is used to explore document navigation tasks and the tools currently available to support electronic document navigation. Second, a software tool called AppMonitor, that logs users’ navigation actions, was developed. Third, AppMonitor was deployed in a longitudinal study to characterise document navigation actions in Microsoft Word and Adobe Reader. Forth, to compliment this study, two task-centric observations of electronic document navigation were performed, to probe the reasons for navigation tool selection. Finally, the Footprints Scrollbar was developed to improve one common aspect of navigation—within-document revisitation. To begin, two areas of current knowledge in this domain are overviewed: paper and electronic document navigation and electronic document navigation tools. The literature review produced five categories of document navigation tasks: ‘overviewing and browsing’, ‘reading’, ‘annotating and writing’, ‘searching’ and ‘revisitation’. In a similar fashion, electronic document navigation tools were reviewed and divided into eight categories: core navigation tools (those commonly found in today’s navigation systems), input devices, scrollbar augmentations, content-aware navigation aids, visualisations that provide multiple document views, indirect manipulation techniques, zooming tools and revisitation tools. The literature lacked evidence of an understanding of how these current document navigation tools are used. To aid the gathering of empirical data on tool use, the AppMonitor tool was developed. It records user actions in unmodified Windows applications—specifically for this research, Microsoft Word and Adobe Reader. It logs low-level interactions such as “left mouse button pressed” and “Ctrl-f pressed” as well as high level ‘logical’ actions such as menu selections and scrollbar manipulations. It requires no user input to perform these tasks, allowing study participants to continue with their everyday work. To collect data to form a characterisation of document navigation actions, 14 participants installed AppMonitor on their computer for 120 days. This study found that users primarily employ the mousewheel, scrollbar thumb and paging keys for navigation. Further, many advanced navigation tools that are lauded for their efficiency, including bookmarks and search tools, are rarely used. The longitudinal study provided valuable insights into the use of navigation tools. To understand the reasons behind this tool use, two task-centric observations of electronic document navigation were conducted. The first asked participants to perform a series of specific navigation tasks while AppMonitor logged their actions. The second was performed as a series of interactive sessions, where users performed a particular task and were then probed on their tool choice. These two studies found that many users are not aware of the advanced navigation tools that could significantly improve their navigation efficiency. Finally, the characterisations highlighted within-document revisitation as a commonly performed task, with current tools that support this action rarely used. To address this problem, the analysis, design and evaluation of a Footprints Scrollbar is presented. It places marks inside the scrollbar trough and provides shortcuts to aid users return to previously visited locations. The Footprints Scrollbar was significantly faster and subjectively preferred over a standard scrollbar for revisitation tasks. To summarise, this thesis contributes a literature review of document navigation and electronic document navigation tools; the design and implementation of AppMonitor—a tool to monitor user actions in unmodifiedWindows applications; a longitudinal study describing the navigation actions users perform; two taskcentric studies examining why actions are performed; and the Footprints Scrollbar, a tool to aid within-document revisitation tasks.

[1]  D. Gordon Rohman,et al.  Pre-Writing the Stage of Discovery in the Writing Process , 1965 .

[2]  A. Kornhauser The Human Problems of an Industrial Civilization. , 1934 .

[3]  Pierre Dragicevic,et al.  Strategies for accelerating on-line learning of hotkeys , 2007, CHI.

[4]  Yangzhou Du,et al.  Allowing camera tilts for document navigation in the standard GUI: a discussion and an experiment , 2006, AVI '06.

[5]  Elaine Wherry,et al.  Scroll ring performance evaluation , 2003, CHI Extended Abstracts.

[6]  Joseph Moses Juran,et al.  Quality-control handbook , 1951 .

[7]  Carl Gutwin,et al.  Improving revisitation in fisheye views with visit wear , 2005, CHI.

[8]  Olivier Chapuis,et al.  Quantifying degree of goal directedness in document navigation: application to the evaluation of the perspective-drag technique , 2007, CHI.

[9]  Max Mühlhäuser,et al.  Digital paper bookmarks: collaborative structuring, indexing and tagging of paper documents , 2008, CHI Extended Abstracts.

[10]  J. W. Hunt,et al.  An Algorithm for Differential File Comparison , 2008 .

[11]  Edward Cutrell,et al.  Quantitative analysis of scrolling techniques , 2002, CHI.

[12]  Jonathan Trevor,et al.  Anchored conversations: chatting in the context of a document , 2000, CHI.

[13]  Parisa Eslambolchilar,et al.  Multimodal feedback for tilt controlled speed dependent automatic zooming , 2004 .

[14]  Brad A. Myers,et al.  Two-handed input using a PDA and a mouse , 2000, CHI.

[15]  Robert W. Reeder,et al.  WebLogger: A Data Collection Tool for Web-use Studies , 2000 .

[16]  Carl Gutwin,et al.  Workspace awareness support with radar views , 1996, CHI Conference Companion.

[17]  Kenton O'Hara,et al.  Student readers' use of library documents: implications for library technologies , 1998, CHI.

[18]  Michael A. Shepherd,et al.  The impact of task on the usage of web browser navigation mechanisms , 2006, Graphics Interface.

[19]  Frank Linton,et al.  OWL: A Recommender System for Organization-Wide Learning , 2000, J. Educ. Technol. Soc..

[20]  Joanna McGrenere,et al.  A comparison of static, adaptive, and adaptable menus , 2004, CHI.

[21]  Ivan E. Sutherland,et al.  Sketchpad a Man-Machine Graphical Communication System , 1899, Outstanding Dissertations in the Computer Sciences.

[22]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[23]  R. Darin Ellis,et al.  Listener: A tool for client-side investigation of hypermedia navigation behavior , 1998 .

[24]  Mikael Wiberg,et al.  Scrollpad: tangible scrolling with mobile devices , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[25]  François Guimbretière,et al.  Flipper: a new method of digital document navigation , 2005, CHI EA '05.

[26]  Ben Shneiderman,et al.  Designing The User Interface , 2013 .

[27]  Allison Woodruff,et al.  Popout prism: adding perceptual principles to overview+detail document interfaces , 2002, CHI.

[28]  Yves Guiard,et al.  Multiscale pointing: facilitating pan-zoom coordination , 2002, CHI Extended Abstracts.

[29]  George Buchanan,et al.  Investigating Document Triage on Paper and Electronic Media , 2007, ECDL.

[30]  Andy Cockburn,et al.  Comparing speed-dependent automatic zooming with traditional scroll, pan and zoom methods , 2004 .

[31]  Xiangshi Ren,et al.  An Empirical Evaluation of Seven Mice for Scrolling Tasks , 2007, 2007 International Conference on Mechatronics and Automation.

[32]  Kasper Hornbæk,et al.  Reading of electronic documents: the usability of linear, fisheye, and overview+detail interfaces , 2001, CHI.

[33]  Jock D. Mackinlay,et al.  The document lens , 1993, UIST '93.

[34]  Ian H. Witten,et al.  Seeking information in realistic books: a user study , 2008, JCDL '08.

[35]  James D. Hollan,et al.  Pad++: a zooming graphical interface for exploring alternate interface physics , 1994, UIST '94.

[36]  Shumin Zhai,et al.  View size and pointing difficulty in multi-scale navigation , 2004, AVI.

[37]  Mikkel Rønne Jakobsen,et al.  Evaluating a fisheye view of source code , 2006, CHI.

[38]  Bongshin Lee,et al.  Fishnet, a fisheye web browser with search term popouts: a comparative evaluation with overview and linear view , 2004, AVI.

[39]  Victor Kaptelinin,et al.  Transient visual cues for scrolling: an empirical study , 2002, CHI Extended Abstracts.

[40]  Andy Cockburn,et al.  Tuning and testing scrolling interfaces that automatically zoom , 2005, CHI.

[41]  Joachim Pouderoux,et al.  TangiMap: a tangible interface for visualization of large documents on handheld computers , 2005, Graphics Interface.

[42]  Nicholas Chen,et al.  Navigation techniques for dual-display e-book readers , 2008, CHI.

[43]  Ben Shneiderman,et al.  Split menus: effectively using selection frequency to organize menus , 1994, TCHI.

[44]  Ivan E. Sutherland,et al.  Sketch pad a man-machine graphical communication system , 1964, DAC.

[45]  Stuart K. Card,et al.  Evaluation of mouse, rate-controlled isometric joystick, step keys, and text keys, for text selection on a CRT , 1987 .

[46]  Douglas C. Engelbart,et al.  Display-Selection Techniques for Text Manipulation , 1967 .

[47]  Eelco Herder,et al.  Web page revisitation revisited: implications of a long-term click-stream study of browser usage , 2007, CHI.

[48]  Kellogg S. Booth,et al.  The design and evaluation of multiple interfaces : a solution for complex software , 2002 .

[49]  Mark H. Chignell,et al.  Information archiving with bookmarks: personal Web space construction and organization , 1998, CHI.

[50]  Pattie Maes,et al.  Footprints: history-rich tools for information foraging , 1999, CHI '99.

[51]  B. Boguraev Dynamic presentation of document content for rapid on-line skimming , 1998, AAAI 1998.

[52]  Suya You,et al.  Large document, small screen: a camera driven scroll and zoom control for mobile devices , 2008, I3D '08.

[53]  John Rieman,et al.  The diary study: a workplace-oriented research tool to guide laboratory efforts , 1993, INTERCHI.

[54]  Yves Guiard,et al.  Pan-zoom coordination in multi-scale pointing , 2001, CHI Extended Abstracts.

[55]  Wilfred Beeching Century of the Typewriter , 1974 .

[56]  Terry Winograd,et al.  Gaze-enhanced scrolling techniques , 2007, UIST.

[57]  Antero Taivalsaari,et al.  Generalized and stationary scrolling , 1999, UIST '99.

[58]  Susan L. Franzel,et al.  Guided search: an alternative to the feature integration model for visual search. , 1989, Journal of experimental psychology. Human perception and performance.

[59]  Steven K. Feiner,et al.  Content-aware scrolling , 2006, UIST.

[60]  Olivier Chapuis,et al.  Touchstone: exploratory design of experiments , 2007, CHI.

[61]  Victor Kaptelinin,et al.  Window navigation with and without animation: a comparison of scroll bars, zoom, and fisheye view , 1997, CHI Extended Abstracts.

[62]  Saul Greenberg,et al.  Revisitation patterns in World Wide Web navigation , 1997, CHI.

[63]  Pieter Jan Stappers,et al.  Flicking through page-based documents with thumbnail sliders and electronic dog-ears , 2000, CHI Extended Abstracts.

[64]  D. Gentner,et al.  A framework for a cognitive theory of writing , 1980 .

[65]  Sriram Subramanian,et al.  Multi-flick: an evaluation of flick-based scrolling techniques for pen interfaces , 2008, CHI.

[66]  Catherine C. Marshall,et al.  Turning the page on navigation , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[67]  Clayton Lewis,et al.  TASK-CENTERED USER INTERFACE DESIGN A Practical Introduction , 2006 .

[68]  George Buchanan,et al.  Improving navigation interaction in digital documents , 2008, JCDL '08.

[69]  Saul Greenberg,et al.  How people revisit web pages: empirical findings and implications for the design of history systems , 1997, Int. J. Hum. Comput. Stud..

[70]  J. Hayes,et al.  A Cognitive Process Theory of Writing , 1981, College Composition & Communication.

[71]  Sriram Subramanian,et al.  Augmenting the mouse with pressure sensitive input , 2007, CHI.

[72]  Masood Masoodian,et al.  DeepDocument: use of a multi-layered display to provide context awareness in text editing , 2004, AVI.

[73]  Nicholas J. Belkin,et al.  Reading time, scrolling and interaction: exploring implicit sources of user preferences for relevance feedback , 2001, Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

[74]  Adam Ginsburg,et al.  A viewer for PostScript documents , 1996, UIST '96.

[75]  Shumin Zhai,et al.  Improving Browsing Performance: A study of four input devices for scrolling and pointing tasks , 1997, INTERACT.

[76]  Roy Want,et al.  Squeeze me, hold me, tilt me! An exploration of manipulative user interfaces , 1998, CHI.

[77]  Kenton O'Hara,et al.  A comparison of reading paper and on-line documents , 1997, CHI.

[78]  Urmila Kukreja,et al.  RUI: Recording user input from interfaces under Windows and Mac OS X , 2006, Behavior research methods.

[79]  S. Hart,et al.  Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research , 1988 .

[80]  Judy Pearsall,et al.  The Concise Oxford Dictionary , 1999 .

[81]  Shari Trewin InputLogger: General-purpose logging of keyboard and mouse events on an Apple Macintosh , 1998 .

[82]  Albrecht Schmidt,et al.  Knowing the User's Every Move – User Activity Tracking for Website Usability Evaluation and Implicit Interaction , 2006 .

[83]  Parisa Eslambolchilar,et al.  Tilt-Based Automatic Zooming and Scaling in Mobile Devices - A State-Space Implementation , 2004, Mobile HCI.

[84]  Ravin Balakrishnan,et al.  Zliding: fluid zooming and sliding for high precision parameter manipulation , 2005, UIST.

[85]  Andrew Dillon,et al.  Reading from paper versus screens: a critical review of the empirical literature , 1992 .

[86]  Ben Shneiderman,et al.  The alphaslider: a compact and rapid selector , 1994, CHI Conference Companion.

[87]  Tek-Jin Nam,et al.  Inflatable mouse: volume-adjustable mouse with air-pressure-sensitive input and haptic feedback , 2008, CHI.

[88]  Benjamin B. Bederson,et al.  A review of overview+detail, zooming, and focus+context interfaces , 2009, CSUR.

[89]  T. K. Landauer,et al.  Selection from alphabetic and numeric menu trees using a touch screen: breadth, depth, and width , 1985, CHI '85.

[90]  Abigail Sellen,et al.  A comparison of input devices in element pointing and dragging tasks , 1991, CHI.

[91]  Sarah A. Douglas,et al.  Testing pointing device performance and user assessment with the ISO 9241, Part 9 standard , 1999, CHI '99.

[92]  Richard M. Stallman,et al.  Gnu Emacs Manual , 1996 .

[93]  Andy Cockburn,et al.  Comparing Automatic and Manual Zooming Methods for Acquiring Off-screen Targets , 2005, BCS HCI.

[94]  G. W. Furnas,et al.  Generalized fisheye views , 1986, CHI '86.

[95]  Ian H. Witten,et al.  Supporting Command Reuse: Empirical Foundations and Principles , 1993, Int. J. Man Mach. Stud..

[96]  E. Lunzer,et al.  The Effective Use of Reading. , 1980 .

[97]  James D. Hollan,et al.  Direct Manipulation Interfaces , 1985, Hum. Comput. Interact..

[98]  Kenton O'Hara,et al.  A diary study of work-related reading: design implications for digital reading devices , 1998, CHI.

[99]  Allison Woodruff,et al.  Using thumbnails to search the Web , 2001, CHI.

[100]  Bryce Allen,et al.  Cognitive and task influences on Web searching behavior , 2002, J. Assoc. Inf. Sci. Technol..

[101]  Olivier Chapuis,et al.  Shakespeare's complete works as a benchmark for evaluating multiscale document navigation techniques , 2006, BELIV '06.

[102]  D. T. Lawton,et al.  Streaming thumbnails: combining low resolution navigation and RSVP displays , 2000, CHI Extended Abstracts.

[103]  Andy Cockburn,et al.  What do web users do? An empirical analysis of web use , 2001, Int. J. Hum. Comput. Stud..

[104]  Staffan Björk,et al.  An alternative to scrollbars on small screens , 1999, CHI EA '99.

[105]  Yu Qi,et al.  ActiveScrollbar: A Scroll Bar with Direct Scale Ratio Control , 2007, HCI.

[106]  Jamey Graham,et al.  The reader's helper: a personalized document reading environment , 1999, CHI '99.

[107]  Benjamin B. Bederson,et al.  Benefits of animated scrolling , 2005, CHI Extended Abstracts.

[108]  Eelco Herder,et al.  Off the beaten tracks: exploring three aspects of web navigation , 2006, WWW '06.

[109]  Mark Witkowski,et al.  Image presentation in space and time: errors, preferences and eye-gaze activity , 2004, AVI.

[110]  Sari A. Laakso,et al.  Improved scroll bars , 2000, CHI Extended Abstracts.

[111]  Yves Guiard,et al.  Navigation as multiscale pointing: extending Fitts' model to very high precision tasks , 1999, CHI '99.

[112]  Carl Gutwin,et al.  A predictive model of menu performance , 2007, CHI.

[113]  Mark Claypool,et al.  Implicit interest indicators , 2001, IUI '01.

[114]  Monica M. C. Schraefel,et al.  The radial scroll tool: scrolling support for stylus- or touch-based document navigation , 2004, UIST '04.

[115]  J. F. Bartlett,et al.  Rock 'n' Scroll Is Here to Stay , 2000 .

[116]  Andy Cockburn,et al.  WebView: A Graphical Aid for Revisiting Web Pages , 1999 .

[117]  Susan T. Dumais,et al.  Large scale analysis of web revisitation patterns , 2008, CHI.

[118]  Takeo Igarashi,et al.  Speed-dependent automatic zooming for browsing large documents , 2000, UIST '00.

[119]  George Buchanan,et al.  The myth of find: user behaviour and attitudes towards the basic search feature , 2008, JCDL '08.

[120]  Ben Shneiderman,et al.  Determining Causes and Severity of End-User Frustration , 2004, Int. J. Hum. Comput. Interact..

[121]  Nicholas J. Belkin,et al.  Display time as implicit feedback: understanding task effects , 2004, SIGIR '04.

[122]  Alan Dix Hands Across the Screen - why scrollbars are on the right and other stories , 1998 .

[123]  Donald Byrd,et al.  A scrollbar-based visualization for document navigation , 1999, DL '99.

[124]  Ian Oakley,et al.  Tilt and Feel: Scrolling with Vibrotactile Display , 2004 .

[125]  Staffan Björk Hierarchical flip zooming: enabling parallel exploration of hierarchical visualizations , 2000, AVI '00.

[126]  George Buchanan Rapid document navigation for information triage support , 2007, JCDL '07.

[127]  Toshiyuki Masui,et al.  Elastic graphical interfaces to precise data manipulation , 1995, CHI 95 Conference Companion.

[128]  Andreas Dieberger,et al.  Exploratory navigation in large multimedia documents using Context Lenses , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[129]  W. Buxton,et al.  A study in two-handed input , 1986, CHI '86.

[130]  Ken Perlin,et al.  Pad: an alternative approach to the computer interface , 1993, SIGGRAPH.

[131]  James D. Hollan,et al.  Edit wear and read wear , 1992, CHI.

[132]  Stuart Macdonald,et al.  User Engagement in Research Data Curation , 2009, ECDL.

[133]  P. Fitts The information capacity of the human motor system in controlling the amplitude of movement. , 1954, Journal of experimental psychology.

[134]  Michael D. Byrne,et al.  The tangled Web we wove: a taskonomy of WWW use , 1999, CHI '99.

[135]  Benjamin B. Bederson,et al.  SlideBar: Analysis of a linear input device , 2004, Behav. Inf. Technol..

[136]  Mary Czerwinski,et al.  Code Thumbnails: Using Spatial Memory to Navigate Source Code , 2006, Visual Languages and Human-Centric Computing (VL/HCC'06).

[137]  Richard Catrambone,et al.  Beyond the scrollbar: an evolution and evaluation of alternative navigation techniques , 1999, Proceedings 1999 IEEE Symposium on Visual Languages.

[138]  John F. Hughes,et al.  Navigating documents with the virtual scroll ring , 2004, UIST '04.

[139]  I. Scott MacKenzie,et al.  Fitts' Law as a Research and Design Tool in Human-Computer Interaction , 1992, Hum. Comput. Interact..

[140]  Jeffrey Heer,et al.  WebQuilt : A Proxy-based Approach to Remote Web Usability Testing , 2001 .

[141]  Richard Chimera Value bars: an information visualization and navigation tool for multi-attribute listings , 1992, CHI '92.

[142]  Ramana Rao,et al.  Bridging the paper and electronic worlds: the paper user interface , 1993, INTERCHI.

[143]  Albrecht Schmidt,et al.  Tracking the interaction of users with AJAX applications for usability testing , 2007, CHI.

[144]  Tue Haste Andersen A simple movement time model for scrolling , 2005, CHI EA '05.

[145]  Kori Inkpen Quinn,et al.  Challenges of Capturing Natural Web-Based User Behaviors , 2008, Int. J. Hum. Comput. Interact..

[146]  Carl Gutwin,et al.  The usability of transparent overview layers , 1998, CHI Conference Summary.

[147]  Gustav Öquist,et al.  Eye movement study of reading text on a mobile phone using paging, scrolling, leading, and RSVP , 2007, MUM.

[148]  Ronald Baecker,et al.  The user-centered iterative design of collaborative writing software , 1993, INTERCHI.

[149]  Mary Czerwinski,et al.  Display space usage and window management operation comparisons between single monitor and multiple monitor users , 2004, AVI.

[150]  Carl Gutwin,et al.  Faster document navigation with space-filling thumbnails , 2006, CHI.

[151]  Scott E. Hudson,et al.  PaperLink: a technique for hyperlinking from real paper to electronic content , 1997, CHI.

[152]  Jean-Daniel Fekete,et al.  OrthoZoom scroller: 1D multi-scale navigation , 2006, CHI.

[153]  B. Schneirdeman,et al.  Designing the User Interface: Strategies for Effective Human-Computer Interaction , 1998 .

[154]  C. Alder,et al.  Investigating the human-computer interface using the Datalogger , 1996 .

[155]  Daniela Karin Rosner,et al.  Studying paper use to inform the design of personal and portable technology , 2008, CHI Extended Abstracts.

[156]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[157]  Kori Inkpen Quinn,et al.  Web browsing today: the impact of changing contexts on user activity , 2005, CHI EA '05.

[158]  Gavriel Salvendy,et al.  Human Interface and the Management of Information. Methods, Techniques and Tools in Information Design, Symposium on Human Interface 2007, Held as Part of HCI International 2007, Beijing, China, July 22-27, 2007, Proceedings Part I , 2007, HCI.

[159]  Dan Venolia Facile 3D direct manipulation , 1993, CHI '93.

[160]  I. Silo Scroll Display: pointing device for palmtop computers , 1998, Proceedings. 3rd Asia Pacific Computer Human Interaction (Cat. No.98EX110).

[161]  K. O'hara Towards a Typology of Reading Goals , 1996 .

[162]  Andrew Dillon,et al.  Visual search and reading tasks using ClearType and regular displays: two experiments , 2006, CHI.

[163]  James E. Pitkow,et al.  Characterizing Browsing Strategies in the World-Wide Web , 1995, Comput. Networks ISDN Syst..