Facilitating the discovery of open government datasets through an exploratory data search interface

The primary area of investigation for this paper is the process of open data discovery, specifically, how novice users search for open government datasets and how this process can be improved. The problem of search on open data portals has featured consistently in Canadian Open Government consultations. A literature review of open data initiatives and processes reveals that open data search is a browsing or investigative task rather than a factual lookup or subject search task. Researchers have proposed exploratory search interfaces as alternatives to traditional search interfaces to support learning and investigative tasks in the information retrieval domain. This paper elaborates on a study conducted to evaluate the usefulness of a visualization-based exploratory data search interface to help novice users discover relevant datasets. A special feature of this interface is the use of variable names in addition to dataset description for search. Today, data search systems on open data portals tend to rely on the text contained in metadata and dataset descriptions to facilitate keyword search. However, the variable names contain important information about the content and structure of a file. This study also probed the role that variable names could play in search, specifically as a means of facilitating exploration and relevance assessment.

[1]  Marti A. Hearst Search User Interfaces , 2009 .

[2]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[3]  Ben Shneiderman,et al.  Integrating statistics and visualization: case studies of gaining clarity during exploratory data analysis , 2008, CHI.

[4]  Xiaojun Yuan,et al.  Seeking information with an information visualization system: a study of cognitive styles , 2011, Inf. Res..

[5]  Ben Shneiderman,et al.  From Keyword Search to Exploration: Designing Future Search Interfaces for the Web , 2010, Found. Trends Web Sci..

[6]  Gary Marchionini,et al.  Accessing government statistical information , 2005, Computer.

[7]  Tingting Jiang,et al.  Information architecture: Exploratory search in different information architectures , 2008 .

[8]  James D. Foley,et al.  ResultMaps: Visualization for Search Interfaces , 2009, IEEE Transactions on Visualization and Computer Graphics.

[9]  Marti A. Hearst Clustering versus faceted categories for information exploration , 2006, Commun. ACM.

[10]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[11]  M. Sheelagh T. Carpendale,et al.  Fluid Views: a zoomable search environment , 2012, AVI.

[12]  Tim Davies,et al.  Open data, democracy and public sector reform. A look at open government data use from data.gov.uk , 2010 .

[13]  Robert G. Capra,et al.  Designing exploratory search tasks for user studies of information seeking support systems , 2009, JCDL '09.

[14]  Ben Shneiderman,et al.  Designing a Metadata -Driven Visual Information Browser for Federal Statistics , 2003, DG.O.

[15]  Diane Kelly,et al.  Methods for Evaluating Interactive Information Retrieval Systems with Users , 2009, Found. Trends Inf. Retr..

[16]  Luanne Freund,et al.  Assigning search tasks designed to elicit exploratory search behaviors , 2012, HCIR '12.