Provenance, End-User Trust and Reuse: An Empirical Investigation

Provenance theorists and practitioners assume that provenance is essential for trust in and reuse of data. However, little empirical research has been conducted to more closely examine this assumption. This qualitative study explores how provenance affects end-users’ trust in and reuse of data. Toward this end, the authors conducted semistructured interviews with 17 proteomics researchers who interact with data from ProteomeCommons.org, a large scientific data repository. Empirical findings from this study suggest that provenance does help end-users gauge the trustworthiness of data and build their confidence in reusing data. However, provenance also needs to be accompanied by other kinds of information, including: more specific data quality information, the data itself, and author reputation information. Implications of this study stress the value of end-user studies in provenance research, specifically to assess the ‘real-world’ impact of provenance encoded and communicated to end-users in systems.

[1]  J. Gosby MEDIA REVIEWS: Basics of Qualitative Research - Techniques and Procedures for Developing Grounded Theory 2nd Edition by A. Strauss and J. Corbin. Sage Publications, , 2000 .

[2]  David Maier,et al.  Scientific Exploration in the Era of Ocean Observatories , 2008, Computing in Science & Engineering.

[3]  Kristi Jackson,et al.  Qualitative Data Analysis with NVivo , 2007 .

[4]  Olaf Hartig,et al.  Using Web Data Provenance for Quality Assessment , 2009, SWPM.

[5]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[6]  William A. Wallace,et al.  Trust in digital information , 2008, J. Assoc. Inf. Sci. Technol..

[7]  Chris Elsaesser Provenance-based Belief , 2010, TaPP.

[8]  Peter L. Pulsifer,et al.  Today's Data are Part of Tomorrow's Research: Archival Issues in the Sciences , 2007 .

[9]  Anselm L. Strauss,et al.  Basics of qualitative research : techniques and procedures for developing grounded theory , 1998 .

[10]  N. Hoffart Basics of Qualitative Research: Techniques and Procedures for Developing Grounded Theory , 2000 .

[11]  Karen Schuchardt,et al.  Application of Named Graphs Towards Custom Provenance Views , 2009, Workshop on the Theory and Practice of Provenance.

[12]  Vladimiro Sassone,et al.  A Formal Model of Provenance in Distributed Systems , 2009, Workshop on the Theory and Practice of Provenance.

[13]  Carole A. Goble,et al.  Data Lineage Model for Taverna Workflows with Lightweight Annotation Requirements , 2008, IPAW.

[14]  Yolanda Gil,et al.  Towards content trust of web resources , 2006, WWW '06.

[15]  Chris Turner,et al.  User feedback: Testing the leaders demonstrator application , 2004 .