Collaborative exploratory search for information filtering and large‐scale information triage

Modern information seekers face dynamic streams of large‐scale heterogeneous data that are both intimidating and overwhelming. They need a strategy to filter this barrage of massive data sets, and to find all of the information responding to their information needs, despite the pressures imposed by schedules and budgets. In this applied research, we present an exploratory search strategy that allows professional information seekers to efficiently and effectively triage all of the data. We demonstrate that exploratory search is particularly useful for information filtering and large‐scale information triage, regardless of the language of the data, and regardless of the particular industry, whether finance, medical, business, government, information technology, news, or legal. Our strategy reduces a dauntingly large volume of information into a manageable, high‐precision data set, suitable for focused reading. This strategy is interdisciplinary, integrating concepts from information filtering, information triage, and exploratory search. Key aspects include advanced search software, interdisciplinary paired search, asynchronous collaborative search, attention to linguistic phenomena, and aggregated search results in the form of a search matrix or search grid. We present the positive results of a task‐oriented evaluation in a real‐world setting, discuss these results from a qualitative perspective, and share future research areas.

[1]  Charles L. A. Clarke,et al.  Information Retrieval - Implementing and Evaluating Search Engines , 2010 .

[2]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[3]  Konstantinos A. Meintanis,et al.  Recognizing user interest and document value from reading and organizing activities in document triage , 2006, IUI '06.

[4]  Donald T. Hawkins,et al.  Online Bibliographic Search Strategy Development. , 1982 .

[5]  Gina Venolia Backstory: A Search Tool for Software Developers Supporting Scalable Sensemaking , 2008 .

[6]  Frank M. Shipman,et al.  Spatial hypertext and the practice of information triage , 1997, HYPERTEXT '97.

[7]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[8]  Bert R. Boyce,et al.  Online information retrieval concepts, principles, and techniques , 1987, J. Am. Soc. Inf. Sci..

[9]  Chirag Shah,et al.  Role-based results redistribution for collaborative information retrieval , 2010, Inf. Process. Manag..

[10]  Mark Buchanan Learning from bacteria , 2008 .

[11]  George Buchanan,et al.  Improving skim reading for document triage , 2008, IIiX.

[12]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[13]  José Luis Vicedo González,et al.  TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[14]  Ronald P. Carver,et al.  Reading Rate: Theory, Research, and Practical Implications. , 1992 .

[15]  Meredith Ringel Morris,et al.  Co-located collaborative web search: understanding status quo practices , 2009, CHI Extended Abstracts.

[16]  Catherine N. Ball,et al.  Reliable Electronic Text: The Elusive Prerequisite for a Host of Human Language Technologies , 2010 .

[17]  Frank M. Shipman,et al.  Supporting document triage via annotation-based multi-application visualizations , 2010, JCDL '10.

[18]  George Buchanan,et al.  An Empirical Study of User Navigation during Document Triage , 2009, ECDL.

[19]  Gary Marchionini,et al.  Information Seeking in Electronic Environments , 1995 .

[20]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[21]  Pamela Effrein Sandstrom,et al.  Information Foraging Theory: Adaptive Interaction with Information , 2010, J. Assoc. Inf. Sci. Technol..

[22]  Charles T. Meadow,et al.  Basics of online searching , 1981 .

[23]  Paul Hugh Cleverley,et al.  Exploratory information searching in the enterprise: A study of user satisfaction and task performance , 2017, J. Assoc. Inf. Sci. Technol..

[24]  Stephen J. Payne,et al.  Division of labour in collaborative information seeking:Current approaches and future directions , 2013 .

[25]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[26]  Peter Pirolli,et al.  Information Foraging , 2009, Encyclopedia of Database Systems.

[27]  Meredith Ringel Morris,et al.  CoSense: enhancing sensemaking for collaborative web search , 2009, CHI.

[28]  Robert M. Rolfe,et al.  Exploratory analysis of highly heterogeneous document collections , 2013, KDD.

[29]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.

[30]  Jannica Heinström,et al.  Looking for Information: A Survey of Research on Information Seeking, Needs and Behavior , 2013, J. Documentation.

[31]  Frank M. Shipman,et al.  Effects of Display Configurations on Document Triage , 2005, INTERACT.

[32]  Vasant Dhar,et al.  Intelligent information triage , 2001, SIGIR '01.

[33]  Paul M. Herceg,et al.  Methods for Evaluating Text Extraction Toolkits: An Exploratory Investigation , 2015 .

[34]  George Buchanan,et al.  Investigating Document Triage on Paper and Electronic Media , 2007, ECDL.

[35]  Ryen W. White Interactions with Search Systems , 2016 .

[36]  Dan Morris,et al.  SearchBar: a search-centric web history for task resumption and information re-finding , 2008, CHI.

[37]  P. Pirolli,et al.  The Sensemaking Process and Leverage Points for Analyst Technology as Identified Through Cognitive Task Analysis , 2007 .

[38]  Penelope Campbell,et al.  Looking for Information: A Survey of Research on Information Seeking, Needs, and Behavior (3rd ed.) , 2013 .

[39]  Nikhil Sharma,et al.  Artifact usefulness and usage in sensemaking handoffs , 2009, ASIST.