Web Search, Web Tutorials & Software Applications: Characterizing and Supporting the Coordinated Use of Online Resources for Performing Work in Feature-Rich Software

Web search and other online resources serve an integral role in how people learn and use feature-rich software (e.g., Adobe Photoshop) on a daily basis. Users depend on web resources both as a first line of technical support, and as a means for coping with system complexity. For example, people rely on web resources to learn new tasks, to troubleshoot problems, or to remind themselves of key task details. When users rely on web resources to support their work, their interactions are distributed over three user environments: (1) the search engine, (2) retrieved documents, and (3) the application’s user interface. As users interact with these environments, their actions generate a rich set of signals that characterize how the population thinks about and uses software systems “in the wild,” on a day-to-day basis. This dissertation presents three works that successively connect and associate signals and artifacts across these environments, thereby generating novel insights about users and their tasks, and enabling powerful new end-user tools and services. These three projects are as follows: Characterizing usability through search (CUTS): The CUTS system demonstrates that aggregate logs of web search queries can be leveraged to identify common tasks and potential usability problems faced by the users of any publicly available interactive system. For example, in 2011 I examined query data for the Firefox web browser. Automated analysis uncovered approximately 150 variations of the query “Firefox how to get the menu bar back”, with queries issued once every 32 minutes on average. Notably, this analysis did not depend on direct access to query logs. Instead, query suggestions services and online advertising valuations were leveraged to approximate aggregate query data. Nevertheless, these data proved to be timely, to have a high degree of ecological validity, and to be arguably less prone to self-selection bias than data gathered via traditional usability methods. Query-feature graphs (QF-Graphs): Query-feature graphs are structures that map high-level descriptions of a user’s goals to the specific features and commands relevant to achieving those goals in software. QF-graphs address an important instance of the more general vocabulary mismatch problem. For example, users of the GIMP photo manipulation software often want to “make a picture black and white”, and fail to recognize the relevance of the applicable commands, which include: “desaturate”, and “channel mixer”. The key insights for building QF-graphs are that: (1) queries concisely express the user’s goal in the user’s own words, and (2) retrieved tutorials likely include both query terms, as well as terminology from the application’s interface (e.g., the names of commands). QF-graphs are generated by mining these co-occurrences across thousands of query-tutorial pairings. InterTwine: InterTwine explores interaction possibilities that arise when software applications, web search, and online support materials are directly integrated into a single productivity system. With InterTwine, actions in the web browser directly impact how information is presented in a software application, and vice versa. For example, when a user opens a web tutorial in their browser, the application’s menus and tooltips are

[1]  Qiang Yang,et al.  Query enrichment for web-query classification , 2006, TOIS.

[2]  Eelco Herder,et al.  Web page revisitation revisited: implications of a long-term click-stream study of browser usage , 2007, CHI.

[3]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[4]  Adam Fourney,et al.  Mining online software tutorials: challenges and open problems , 2014, CHI Extended Abstracts.

[5]  Ziv Bar-Yossef,et al.  Mining search engine query logs via suggestion sampling , 2008, Proc. VLDB Endow..

[6]  Marti A. Hearst,et al.  The state of the art in automating usability evaluation of user interfaces , 2001, CSUR.

[7]  Björn Hartmann,et al.  HyperSource: bridging the gap between source and code-related web sites , 2011, CHI.

[8]  Meredith Ringel Morris,et al.  Enhancing Technical Q&A Forums with CiteHistory , 2013, ICWSM.

[9]  Michael S. Bernstein,et al.  Eddi: interactive topic-based browsing of social status streams , 2010, UIST.

[10]  Adam Fourney,et al.  InterTwine: creating interapplication information scent to support coordinated use of software , 2014, UIST.

[11]  Kellogg S. Booth,et al.  Graphically enhanced keyboard accelerators for GUIs , 2010, Graphics Interface.

[12]  Björn Hartmann,et al.  ShowMeHow: translating user interface instructions between applications , 2011, UIST.

[13]  Susan T. Dumais,et al.  Examining Repetition in User Search Behavior , 2007, ECIR.

[14]  Rob Miller,et al.  Translating keyword commands into executable code , 2006, UIST.

[15]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[16]  Jeffrey Nichols,et al.  Interpreting Written How-To Instructions , 2009, IJCAI.

[17]  Wanda Pratt,et al.  Understanding quantified-selfers' practices in collecting and exploring personal data , 2014, CHI.

[18]  Ian H. Witten,et al.  The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.

[19]  Carolyn Watters,et al.  A field study characterizing Web-based information-seeking tasks , 2007 .

[20]  Ryen W. White,et al.  Diagnoses, Decisions, and Outcomes: Web Search as Decision Support for Cancer , 2015, WWW.

[21]  Ryen W. White,et al.  Exploring Time-Dependent Concerns about Pregnancy and Childbirth from Search Logs , 2015, CHI.

[22]  Amanda Spink,et al.  Real life information retrieval: a study of user queries on the Web , 1998, SIGF.

[23]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[24]  Fabrizio Silvestri,et al.  Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data , 2006, TOIS.

[25]  Koji Yatani,et al.  Analysis of Adjective-Noun Word Pair Extraction Methods for Online Review Summarization , 2011, IJCAI.

[26]  Krzysztof Z. Gajos,et al.  Crowdsourcing step-by-step information extraction to enhance existing how-to videos , 2014, CHI.

[27]  Susan T. Dumais,et al.  Large scale analysis of web revisitation patterns , 2008, CHI.

[28]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[29]  Adam Fourney,et al.  Characterizing the usability of interactive applications through query log analysis , 2011, CHI.

[30]  Robin Jeffries,et al.  Backtracking Events as Indicators of Usability Problems in Creation-Oriented Applications , 2012, TCHI.

[31]  Jakob Grue Simonsen,et al.  Extracting usability and user experience information from online user reviews , 2013, CHI.

[32]  Philip J. Guo,et al.  Two studies of opportunistic programming: interleaving web foraging, learning, and writing code , 2009, CHI.

[33]  Qiang Yang,et al.  Q2C@UST: our winning solution to query classification in KDDCUP 2005 , 2005, SKDD.

[34]  Matthew Kay,et al.  Ingimp: introducing instrumentation to an end-user open source application , 2008, CHI.

[35]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[36]  Andrea Bunt,et al.  Task-centric interfaces for feature-rich software , 2014, OZCHI.

[37]  Declan Butler,et al.  When Google got flu wrong , 2013, Nature.

[38]  Scott Counts,et al.  Unraveling abstinence and relapse: smoking cessation reflected in social media , 2014, CHI.

[39]  Jodi Forlizzi,et al.  A stage-based model of personal informatics systems , 2010, CHI.

[40]  Scott R. Klemmer,et al.  Example-centric programming: integrating web search into the development environment , 2010, CHI.

[41]  D. Norman,et al.  User Centered System Design: New Perspectives on Human-Computer Interaction , 1988 .

[42]  A. J. Ko Mining whining in support forums with frictionary , 2012, CHI EA '12.

[43]  Morgan Dixon,et al.  Prefab: implementing advanced behaviors using pixel-based reverse engineering of interface structure , 2010, CHI.

[44]  Robin Jeffries,et al.  Undo and erase events as indicators of usability problems , 2009, CHI.

[45]  Crystale Purvis Cooper,et al.  Cancer Internet Search Activity on a Major Search Engine, United States 2001-2003 , 2005, Journal of medical Internet research.

[46]  Soo-Min Kim,et al.  Automatic Identification of Pro and Con Reasons in Online Reviews , 2006, ACL.

[47]  Jimmy J. Lin,et al.  Quantitative evaluation of passage retrieval algorithms for question answering , 2003, SIGIR.

[48]  Ya Xu,et al.  Computers and iphones and mobile phones, oh my!: a logs-based comparison of search users on different devices , 2009, WWW '09.

[49]  Wagner Meira,et al.  Rank-preserving two-level caching for scalable search engines , 2001, SIGIR '01.

[50]  Christian Stefansen,et al.  GOOGLE DISEASE TRENDS: AN UPDATE , 2013 .

[51]  Peter Pirolli,et al.  Information Foraging , 2009, Encyclopedia of Database Systems.

[52]  D. Sornette,et al.  Stretched exponential distributions in nature and economy: “fat tails” with characteristic scales , 1998, cond-mat/9801293.

[53]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.

[54]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[55]  Don Norman,et al.  The next UI breakthrough: command lines , 2007, INTR.

[56]  Benjamin J. Lafreniere,et al.  Task-Centric User Interfaces , 2014 .

[57]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[58]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[59]  Anne Aula,et al.  How does search behavior change as search becomes more difficult? , 2010, CHI.

[60]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[61]  N. Hoffart Basics of Qualitative Research: Techniques and Procedures for Developing Grounded Theory , 2000 .

[62]  Ryen W. White,et al.  Toward Enhanced Pharmacovigilance Using Patient-Generated Data on the Internet , 2014, Clinical pharmacology and therapeutics.

[63]  Themis Palpanas,et al.  Survey on mining subjective data on the web , 2011, Data Mining and Knowledge Discovery.

[64]  Cathleen Wharton,et al.  Cognitive Walkthroughs: A Method for Theory-Based Evaluation of User Interfaces , 1992, Int. J. Man Mach. Stud..

[65]  Michael J. Paul Search and Breast Cancer : On Disruptive Shifts of Attention over Life Histories of an Illness , 2014 .

[66]  Tim Weninger,et al.  Text Extraction from the Web via Text-to-Tag Ratio , 2008, 2008 19th International Workshop on Database and Expert Systems Applications.

[67]  Falk Scholer,et al.  User performance versus precision measures for simple search tasks , 2006, SIGIR.

[68]  Adam Fourney,et al.  "Then click ok!": extracting references to interface elements in online documentation , 2012, CHI.

[69]  Gierad Laput,et al.  CommandSpace: modeling the relationships between tasks, descriptions and features , 2014, UIST.

[70]  Tovi Grossman,et al.  Searching for software learning resources using application context , 2011, UIST.

[71]  Ricardo A. Baeza-Yates,et al.  Extracting semantic relations from query logs , 2007, KDD '07.

[72]  B. Sparrow,et al.  Google Effects on Memory: Cognitive Consequences of Having Information at Our Fingertips , 2011, Science.

[73]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[74]  Charles L. A. Clarke,et al.  Characterizing large-scale use of a direct manipulation application in the wild , 2010, Graphics Interface.

[75]  Ed H. Chi,et al.  Using information scent to model user information needs and actions and the Web , 2001, CHI.

[76]  Mika Käki,et al.  Information search and re-access strategies of experienced web users , 2005, WWW '05.

[77]  Eser Kandogan,et al.  Field studies of computer system administrators: analysis of system management tools and practices , 2004, CSCW.

[78]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[79]  Tovi Grossman,et al.  Community enhanced tutorials: improving tutorials with multiple demonstrations , 2013, CHI.

[80]  Charles L. A. Clarke,et al.  Relevance ranking for one to three term queries , 1997, Inf. Process. Manag..

[81]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[82]  Tovi Grossman,et al.  Patina: dynamic heatmaps for visualizing application usage , 2013, CHI.

[83]  Ryen W. White,et al.  Web-scale pharmacovigilance: listening to signals from the crowd , 2013, J. Am. Medical Informatics Assoc..

[84]  Koji Yatani,et al.  Review spotlight: a user interface for summarizing user-generated reviews using adjective-noun word pairs , 2011, CHI.

[85]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[86]  David F. Redmiles,et al.  Extracting usability information from user interface events , 2000, CSUR.

[87]  Charles L. A. Clarke,et al.  In the Mood to Click? Towards Inferring Receptiveness to Search Advertising , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[88]  Charles L. A. Clarke,et al.  Exploiting redundancy in question answering , 2001, SIGIR '01.

[89]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[90]  Andrea Bunt,et al.  Exploring Personalized Command Recommendations based on Information Found in Web Documentation , 2015, IUI.

[91]  Scott E. Hudson,et al.  Dynamic detection of novice vs. skilled use without a task model , 2007, CHI.

[92]  Gierad Laput,et al.  Tutorial-based interfaces for cloud-enabled applications , 2012, UIST.

[93]  Matthew Richardson,et al.  Learning about the world through long-term query logs , 2008, TWEB.

[94]  Jalal Mahmud,et al.  Towards automatic functional test execution , 2012, IUI '12.

[95]  Adam Fourney,et al.  Query-feature graphs: bridging user vocabulary and system functionality , 2011, UIST '11.

[96]  Ryen W. White,et al.  From cookies to cooks: insights on dietary patterns via analysis of web usage logs , 2013, WWW.

[97]  Desney S. Tan,et al.  Phosphor: explaining transitions in the user interface using afterglow effects , 2006, UIST.

[98]  Luke S. Zettlemoyer,et al.  Reading between the Lines: Learning to Map High-Level Instructions to Commands , 2010, ACL.

[99]  Songbo Tan,et al.  A survey on sentiment detection of reviews , 2009, Expert Syst. Appl..

[100]  Jakob Nielsen,et al.  Heuristic evaluation of user interfaces , 1990, CHI '90.

[101]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[102]  Tovi Grossman,et al.  CommunityCommands: command recommendations for software applications , 2009, UIST '09.

[103]  D. Wegner Transactive Memory: A Contemporary Analysis of the Group Mind , 1987 .

[104]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[105]  Andrea Bunt,et al.  AdaptableGIMP: designing a socially-adaptable interface , 2011, UIST '11 Adjunct.

[106]  Mira Dontcheva,et al.  Pause-and-play: automatically linking screencast video tutorials with applications , 2011, UIST.

[107]  Keith Vander Linden,et al.  Automatically Eliciting Task Models from Written Task Narratives , 2002, CADUI.

[108]  Efthimis N. Efthimiadis,et al.  Analyzing and evaluating query reformulation strategies in web search logs , 2009, CIKM.

[109]  Susan T. Dumais,et al.  Personalizing search via automated analysis of interests and activities , 2005, SIGIR '05.

[110]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.