Creating Sparks: Comparing Search Results Using Discriminatory Search Term Word Co-Occurrence to Facilitate Serendipity in the Enterprise

Categories or tags that appear in faceted search interfaces which are representative of an information item, rarely convey unexpected or non-obvious associated concepts buried within search results. No prior research has been identified which assesses the usefulness of discriminative search term word co-occurrence to generate facets to act as catalysts to facilitate insightful and serendipitous encounters during exploratory search. In this study, 53 scientists from two organisations interacted with semi-interactive stimuli, 74% expressing a large/moderate desire to use such techniques within their workplace. Preferences were shown for certain algorithms and colour coding. Insightful and serendipitous encounters were identified. These techniques appear to offer a significant improvement over existing approaches used within the study organisations, providing further evidence that insightful and serendipitous encounters can be facilitated in the search user interface. This research has implications for organisational learning, knowledge discovery and exploratory search interface design.

[1]  John Stasko,et al.  Jigsaw: supporting investigative analysis through interactive visualization , 2008 .

[2]  D. McNabb,et al.  Dialectical Inquiry: A Structured Qualitative Research Method , 2006 .

[3]  Dagobert Soergel Digital Libraries and Knowledge Organization , 2009, Semantic Digital Libraries.

[4]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[5]  Oren Etzioni,et al.  Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.

[6]  Tod A. Olson Utility of a faceted catalog for scholarly research , 2007, Libr. Hi Tech.

[7]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[8]  M. Sheelagh T. Carpendale,et al.  The bohemian bookshelf: supporting serendipitous book discoveries through information visualization , 2012, CHI.

[9]  Ralf Krestel,et al.  Visual interfaces for stimulating exploratory search , 2011, JCDL '11.

[10]  Martin Halvey,et al.  An assessment of tag presentation techniques , 2007, WWW '07.

[11]  Paul Hugh Cleverley,et al.  Retrieving haystacks: a data driven information needs model for faceted search , 2015, J. Inf. Sci..

[12]  Jin Zhang,et al.  Identifying Web search session patterns using cluster analysis: A comparison of three search environments , 2009, J. Assoc. Inf. Sci. Technol..

[13]  Mark Nolan IA column: Exploring exploratory search , 2008 .

[14]  Jeffrey Beall The value of alphabetically-sorted browse displays in information discovery , 2007 .

[15]  Marti A. Hearst,et al.  NLP Support for Faceted Navigation in Scholarly Collection , 2009, Proceedings of the 2009 Workshop on Text and Citation Analysis for Scholarly Digital Libraries - NLPIR4DL '09.

[16]  Adrian Furnham,et al.  The Brainstorming Myth , 2000 .

[17]  Catherine C. Marshall,et al.  Designing Qualitative Research , 1996 .

[18]  Ann Blandford,et al.  Coming across information serendipitously - Part 1: A process model , 2012, J. Documentation.

[19]  Andreas Nürnberger,et al.  Trailblazing Information: An Exploratory Search User Interface , 2013, HCI.

[20]  Christina Fang,et al.  The Economics of Strategic Opportunity , 2003 .

[21]  Dorota Glowacka,et al.  Supporting exploratory search tasks with interactive user modeling , 2013, ASIST.

[22]  Jannica Heinström Fast Surfers, Broad Scanners, And Deep Divers: Personality And Information Seeking Behaviour , 2002 .

[23]  T. D. Wilson,et al.  Models in information behaviour research , 1999, J. Documentation.

[24]  Abigail McBirnie,et al.  Seeking serendipity: the paradox of control , 2008, Aslib Proc..

[25]  Elaine Toms,et al.  Measuring the dimensions of serendipity in digital environments , 2011, Inf. Res..

[26]  M. de Rijke,et al.  Exploring entity associations over time , 2013, SIGIR 2013.

[27]  Matthew Banta,et al.  What do exploratory searchers look at in a faceted search interface? , 2009, JCDL '09.

[28]  Kathryn La Barre,et al.  Facet analysis , 2010, Annu. Rev. Inf. Sci. Technol..

[29]  Lei Shi,et al.  Understanding text corpora with multiple facets , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[30]  Michael O’Donnell Visualising patterns in text , 2011 .

[31]  James Allan,et al.  Frontiers, challenges, and opportunities for information retrieval: Report from SWIRL 2012 the second strategic workshop on information retrieval in Lorne , 2012, SIGF.

[32]  Robert G. Capra,et al.  Influence of training and stage of search on gaze behavior in a library catalog faceted search interface , 2012, J. Assoc. Inf. Sci. Technol..

[33]  Tobun Dorbin Ng,et al.  Demonstration of hierarchical document clustering of digital library retrieval results , 2001, JCDL '01.

[34]  Sharon Q. Yang,et al.  Evaluating and comparing discovery tools: how close are we towards next generation catalog? , 2010, Libr. Hi Tech.

[35]  Sonali Mishra,et al.  Manually Classifying User Search Queries on an Academic Library Web Site , 2013 .

[36]  Susan T. Dumais,et al.  Discovery is never by chance: designing for (un)serendipity , 2009, C&C '09.

[37]  Klaus E. Meyer,et al.  Networks, Serendipity and SME Entry into Eastern Europe , 2002 .

[38]  James Rice,et al.  Serendipity and holism: the beauty of OPACS , 1988 .

[39]  A. Strauss Basics Of Qualitative Research , 1992 .

[40]  Tony Russell-Rose,et al.  Designing the search experience - the information architecture of discovery , 2012 .

[41]  Mark de Rond,et al.  Serendipity: Fortune and the Prepared Mind , 2010 .

[42]  Nigel Ford,et al.  Serendipity and information seeking: an empirical study , 2003, J. Documentation.

[43]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[44]  Michael Gleicher,et al.  Serendip: Topic model-driven visual exploration of text corpora , 2014, 2014 IEEE Conference on Visual Analytics Science and Technology (VAST).

[45]  Ahmed Patel,et al.  An analysis of web proxy logs with query distribution pattern approach for search engines , 2012, Comput. Stand. Interfaces.

[46]  M. Wertheimer A source book of Gestalt psychology. , 1939 .

[47]  Andrea Marino,et al.  Topical clustering of search results , 2012, WSDM '12.

[48]  Dick Stenmark Identifying clusters of user behavior in intranet search engine log files , 2008 .

[49]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[50]  David McCandless,et al.  Information is beautiful , 2009 .

[51]  Jasper Kaizer,et al.  AquaBrowser Library: Search, Discover, Refine , 2005 .

[52]  Focus Groups as Qualitative Research PLANNING AND RESEARCH DESIGN FOR FOCUS GROUPS , 2013 .

[53]  Dick Stenmark Identifying clusters of user behavior in intranet search engine log files , 2008, J. Assoc. Inf. Sci. Technol..

[54]  Yunhyong Kim,et al.  Why did you pick that? Visualising relevance criteria in exploratory search , 2010, International Journal on Digital Libraries.

[55]  Tony Russell-Rose,et al.  A Taxonomy of Enterprise Search , 2011, EuroHCIR.

[56]  Daniele Quercia,et al.  Auralist: introducing serendipity into music recommendation , 2012, WSDM '12.

[57]  Anselm L. Strauss,et al.  Basics of qualitative research : techniques and procedures for developing grounded theory , 1998 .

[58]  Ann Blandford,et al.  “Making my own luck”: Serendipity strategies and how to support them in digital information environments , 2014, J. Assoc. Inf. Sci. Technol..

[59]  Rianne Appel-Meulenbroek,et al.  Knowledge sharing through co‐presence: added value of facilities , 2010 .

[60]  Andreas Nürnberger,et al.  Supporting Exploratory Search by User-Centered Interactive Data Mining , 2011 .

[61]  David Bawden,et al.  Information systems and the stimulation of creativity , 1986, J. Inf. Sci..

[62]  Ali Khalili,et al.  conTEXT - Lightweight Text Analytics Using Linked Data , 2014, ESWC.

[63]  M. White Enterprise Search , 2012 .

[64]  A. Colman,et al.  Comparing Rating Scales of Different Lengths: Equivalence of Scores from 5-Point and 7-Point Scales , 1997 .

[65]  Jody Condit Fagan Usability studies of faceted browsing: A literature review , 2010 .

[66]  J. Dessalles Have you anything unexpected to say? The human propensity to communicate surprise and its role in the emergence of language , 2010 .

[67]  Rafal Michalski,et al.  The influence of color grouping on users' visual search behavior and preferences , 2014, Displays.

[68]  Max L. Wilson,et al.  Improving Exploratory Search Interfaces: Adding Value or Information Overload? , 2008 .

[69]  M. Wertheimer Laws of organization in perceptual forms. , 1938 .

[70]  Ryen W. White,et al.  Struggling or exploring?: disambiguating long search sessions , 2014, WSDM.

[71]  John E. Morrison,et al.  Foundations of the After Action Review Process , 1999 .

[72]  Santoshi Halder,et al.  THE INFLUENCE OF PERSONALITY TRAITS ON INFORMATION SEEKING BEHAVIOUR OF STUDENTS , 2010 .

[73]  Anabel Quan-Haase,et al.  Designing the next big thing: Randomness versus serendipity in DH tools , 2014, DH.

[74]  Victoria L. Rubin,et al.  Facets of serendipity in everyday chance encounters: a grounded theory approach to blog analysis , 2011, Inf. Res..

[75]  Brinda Bhowmick,et al.  A Physicsbased Model for Electrical Parameters of Double gate Hetero-material Nano Scale Tunnel FET , 2012 .

[76]  Elaine Toms,et al.  Chance Encounters in the Digital Library , 2009, ECDL.

[77]  Juho Hamari,et al.  Does Gamification Work? -- A Literature Review of Empirical Studies on Gamification , 2014, 2014 47th Hawaii International Conference on System Sciences.

[78]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[79]  J. Bullinaria,et al.  Extracting semantic representations from word co-occurrence statistics: A computational study , 2007, Behavior research methods.

[80]  Peter van der Weerd,et al.  Conceptual Grouping in Word Co-Occurrence Networks , 1999, IJCAI.

[81]  J. Eyles,et al.  Developing and Implementing a Triangulation Protocol for Qualitative Health Research , 2006, Qualitative health research.

[82]  Stephen E. Robertson,et al.  Query Expansion with Long-Span Collocates , 2003, Information Retrieval.

[83]  D. Kolb Experiential Learning: Experience as the Source of Learning and Development , 1983 .

[84]  James Allan,et al.  Frontiers, Challenges, and Opportunities for Information Retrieval , 2012 .

[85]  Michael J. Muller,et al.  Getting our head in the clouds: toward evaluation studies of tagclouds , 2007, CHI.

[86]  L. Vicsek Issues in the Analysis of Focus Groups: Generalisability, Quantifiability, Treatment of Context and Quotations , 2010 .

[87]  Laurence S. Dooley,et al.  Gestalt theory in visual screen design: a new look at an old subject , 2002 .

[88]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[89]  Sivan Yogev,et al.  Exploratory search interfaces: blending relevance, diversity, relationships and categories , 2014, IUI Companion '14.

[90]  J. Gosby MEDIA REVIEWS: Basics of Qualitative Research - Techniques and Procedures for Developing Grounded Theory 2nd Edition by A. Strauss and J. Corbin. Sage Publications, , 2000 .

[91]  Jacek Gwizdka,et al.  What a difference a tag cloud makes: effects of tasks and cognitive abilities on search results interface use , 2009, Inf. Res..

[92]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[93]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[94]  Ephraim R. McLean,et al.  The DeLone and McLean Model of Information Systems Success: A Ten-Year Update , 2003, J. Manag. Inf. Syst..

[95]  Thomas Ertl,et al.  Word Cloud Explorer: Text Analytics Based on Word Clouds , 2014, 2014 47th Hawaii International Conference on System Sciences.

[96]  C. K. Ogden A Source Book Of Gestalt Psychology , 2013 .

[97]  Peter Robinson,et al.  Interpreting Hand-Over-Face Gestures , 2011, ACII.

[98]  Olga Vechtomova,et al.  Exploring knowledge graphs for exploratory search , 2014, IIiX.

[99]  P. Cleverley Improving Enterprise Search in the Upstream Oil and Gas Industry by Automatic Query Expansion using a Non-Probabilistic Knowledge Representation , 2012 .

[100]  Susan T. Dumais,et al.  Challenges for Supporting Faceted Search in Large, Heterogeneous Corpora like the Web , 2008 .

[101]  Ben Shneiderman,et al.  Users can change their web search tactics: Design guidelines for categorized overviews , 2008, Inf. Process. Manag..

[102]  Thomas H. Davenport,et al.  Book review:Working knowledge: How organizations manage what they know. Thomas H. Davenport and Laurence Prusak. Harvard Business School Press, 1998. $29.95US. ISBN 0‐87584‐655‐6 , 1998 .

[103]  Gerhard Weikum,et al.  YAGO2: exploring and querying world knowledge in time, space, context, and many languages , 2011, WWW.

[104]  Jeffrey Heer,et al.  Replication of the Keyword Extraction part of the paper "'Without the Clutter of Unimportant Words': Descriptive Keyphrases for Text Visualization" , 2019, ArXiv.

[105]  Elaine Toms,et al.  The serendipity quotient , 2011, ASIST.