Data Reuse and Users’ Trust Judgments: Toward Trusted Data Curation

Ayoung Yoon: Data Reuse and Users’ Trust Judgments: Toward Trusted Data Curation (Under the direction of Dr. Helen R. Tibbo) Data reuse refers to the secondary use of data—not for its original purpose but for studying new problems. Although reusing data might not yet be the norm in every discipline, the benefits of reusing shared data have been asserted by a number of researchers, and data reuse has been a major concern in many disciplines. Assessing data for its trustworthiness becomes important in data reuse with the growth in data creation because of the lack of standards for ensuring data quality and potential harm from using poor-quality data. This dissertation aims to explore many facets of data reusers’ trust in data generated by other researchers, focusing on user-defined trust attributes and the judgment process with influential factors that determine these attributes. Because trust is a complex concept that is explored in multiple disciplines, this study developed a theoretical framework from an extensive literature review in the areas of sociology, social psychology, information, and information systems. This study takes an interpretive qualitative approach by using in-depth semi-structured interviews as the primary research method. The study population comprises reusers of quantitative social science data from public health and social work—the primary disciplines with data reuse cultures. By employing purposive sampling, a total of 38 participants were recruited.

[1]  Harry van den Berg,et al.  Reanalyzing Qualitative Interviews from Different Angles: The Risk of Decontextualization and Other Problems of Sharing Qualitative Data , 2005 .

[2]  Mark John Costello Motivating Online Publication of Data , 2009 .

[3]  John K. Butler Toward Understanding and Measuring Conditions of Trust: Evolution of a Conditions of Trust Inventory , 1991 .

[4]  Geoff Walsham,et al.  Interpreting Information Systems in Organizations , 1993 .

[5]  L. Stewart User Acceptance of Electronic Journals: Interviews with Chemists at Cornell University , 1996 .

[6]  Michael Witt,et al.  Data sharing, small science and institutional repositories , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[7]  Devan Ray Donaldson,et al.  User conceptions of trustworthiness for digital archival documents , 2015, J. Assoc. Inf. Sci. Technol..

[8]  Inge Angevaare Taking Care of Digital Collections and Data: ‘Curation’ and Organisational Choices for Research Libraries , 2009 .

[9]  Matthew S. Mayernik,et al.  Whose data do you trust? Integrity issues in the preservation of scientific data , 2008 .

[10]  R. Elliott,et al.  Evolving guidelines for publication of qualitative research studies in psychology and related fields. , 1999, The British journal of clinical psychology.

[11]  Thomas Chesney,et al.  An empirical examination of Wikipedia's credibility , 2006, First Monday.

[12]  K. Bailey Methods of Social Research , 1978 .

[13]  Carol Tenopir,et al.  Finding and using journal-article components: Impacts of disaggregation on teaching and research practice , 2008, J. Assoc. Inf. Sci. Technol..

[14]  Sarah Callaghan,et al.  Citation and Peer Review of Data: Moving Towards Formal Data Publication , 2011, Int. J. Digit. Curation.

[15]  Kathleen Marie Fear,et al.  Measuring and Anticipating the Impact of Data Reuse. , 2013 .

[16]  Wanda J. Orlikowski,et al.  Studying Information Technology in Organizations: Research Approaches and Assumptions , 1991, Inf. Syst. Res..

[17]  P. Ring,et al.  Structuring cooperative relationships between organizations , 1992 .

[18]  Andrew Cox,et al.  What are communities of practice? A comparative review of four seminal works , 2005, J. Inf. Sci..

[19]  Etienne Wenger,et al.  Situated Learning: Legitimate Peripheral Participation , 1991 .

[20]  Ann Peterson Bishop,et al.  Document Structure and Digital Libraries: How Researchers Mobilize Information in Journal Articles , 1999, Inf. Process. Manag..

[21]  K. Giffin The contribution of studies of source credibility to a theory of interpersonal trust in the communication process. , 1967, Psychological bulletin.

[22]  Bernard Barber,et al.  The Logic and Limits of Trust , 1983 .

[23]  Geoff Walsham,et al.  Interpretive case studies in IS research: nature and method , 1995 .

[24]  B. Adelson When Novices Surpass Experts: The Difficulty of a Task May Increase With Expertise , 1984 .

[25]  Greg Guest,et al.  Public health research methods , 2015 .

[26]  D. Giaretta,et al.  The Digital Curation Centre: a vision for digital curation , 2005, 2005 IEEE International Symposium on Mass Storage Systems and Technology.

[27]  C Gleit,et al.  Secondary data analysis: a valuable resource. , 1989, Nursing research.

[28]  Melissa H. Cragin,et al.  Introduction: Institutional Repositories: Current State and Future , 2008, Libr. Trends.

[29]  Lois W. Sayrs Interviews : an introduction to qualitative research interviewing , 1996 .

[30]  Louise Corti,et al.  Progress and Problems of Preserving and Providing Access to Qualitative Data for Social Research—The International Picture of an Emerging Culture , 2000 .

[31]  Jonathan A. Smith Reflecting on the development of interpretative phenomenological analysis and its contribution to qualitative research in psychology , 2004 .

[32]  C. Willig Introducing Qualitative Research in Psychology , 2001 .

[33]  How could contemporary social theory contribute to socialized epistemology? , 2001 .

[35]  Marc Berg,et al.  The contextual nature of medical information , 1999, Int. J. Medical Informatics.

[36]  P. Sztompka Trust: A Sociological Theory , 2000 .

[37]  Alexander Ball,et al.  Challenges and Issues Relating to the Use of Representation Information for the Digital Curation of Crystallography and Engineering Data , 2008, Int. J. Digit. Curation.

[38]  Yasmeen Shorish Data Curation Is for Everyone! The Case for Master's and Baccalaureate Institutional Engagement with Data Curation , 2012 .

[39]  Miriam J. Metzger,et al.  Perceptions of Internet Information Credibility , 2000 .

[40]  Richard Arena,et al.  Trust, Codification and Epistemic Communities: Implementing an Expert System in the French Steel Industry , 2006 .

[41]  J. Welman,et al.  Research Methodology for the Business and Administrative Sciences , 2002 .

[42]  Joan E. Sieber Social Scientists' Concerns About Sharing Data , 1991 .

[43]  Adolfo G. Prieto,et al.  From conceptual to perceptual reality: trust in digital repositories , 2009 .

[44]  K. Blomqvist The many faces of trust , 1997 .

[45]  Blair H. Sheppard,et al.  The Grammars of Trust: A Model and General Implications , 1998 .

[46]  Susan Wiedenbeck,et al.  On-line trust: concepts, evolving themes, a model , 2003, Int. J. Hum. Comput. Stud..

[47]  Martina Stockhause,et al.  Quality assessment concept of the World Data Center for Climate and its application to CMIP5 data , 2012 .

[48]  Ruth E. Duerr,et al.  Data Citation and Peer Review , 2010 .

[49]  Christine L. Borgman,et al.  Research Data: Who Will Share What, with Whom, When, and Why? , 2010 .

[50]  Dagmar Lorenz-Meyer Possibilities of Enacting and Researching Epistemic Communities , 2010 .

[51]  Gary P. Radford Positivism, Foucault, and the Fantasia of the Library: Conceptions of Knowledge and the Modern Library Experience , 1992, The Library Quarterly.

[52]  Ann Zimmerman,et al.  Beyond the Data Deluge: A Research Agenda for Large-Scale Data Sharing and Reuse , 2011, Int. J. Digit. Curation.

[53]  Robert L. Cromwell,et al.  Evaluating Internet resources: Identity, affiliation, and cognitive authority in a networked world , 2001, J. Assoc. Inf. Sci. Technol..

[54]  Barbara P. Buttenfield,et al.  Digital Libraries and Collaborative Knowledge Construction , 2003 .

[55]  Susan L. Morrow Quality and trustworthiness in qualitative research in counseling psychology. , 2005 .

[56]  Mary Larsgaard,et al.  The National Geospatial Digital Archives—Collection Development: Lessons Learned , 2009, Libr. Trends.

[57]  Jeremy P. Birnholtz,et al.  Data at work: supporting sharing in science and engineering , 2003, GROUP.

[58]  Etienne Wenger,et al.  Communities of Practice: Learning, Meaning, and Identity , 1998 .

[59]  C. Strasser,et al.  Researcher Perspectives on Publication and Peer Review of Data , 2014, PloS one.

[60]  Lynn Yarmey,et al.  Data Stewardship: Environmental Data Curation and a Web-of-Repositories , 2009, Int. J. Digit. Curation.

[61]  Eric G. Campbell,et al.  Sharing in Science , 2002, American Scientist.

[62]  Matthew S. Mayernik,et al.  Peer Review of Datasets: When, Why, and How , 2015 .

[63]  Michele Williams In Whom we Trust: Group Membership as an Affective Context for Trust Development , 2001 .

[64]  Colin Camerer,et al.  Not So Different After All: A Cross-Discipline View Of Trust , 1998 .

[65]  B. J. Fogg,et al.  Credibility and computing technology , 1999, CACM.

[66]  Chinho Lin,et al.  The Mediate Effect of Trust on Organizational Online Knowledge Sharing: an Empirical Study , 2010, Int. J. Inf. Technol. Decis. Mak..

[67]  Judith Baker,et al.  TRUST AND RATIONALITY , 1987 .

[68]  J. H. Davis,et al.  An Integrative Model Of Organizational Trust , 1995 .

[69]  J. Popay,et al.  Rationale and Standards for the Systematic Review of Qualitative Literature in Health Services Research , 1998, Qualitative health research.

[70]  S. Boslaugh Secondary data sources for public health , 2007 .

[71]  Ayoung Yoon End users’ trust in data repositories: definition and influences on trust development , 2014 .

[72]  Sara Lichtenwalter,et al.  SECONDARY ANALYSIS IN SOCIAL WORK RESEARCH EDUCATION: PAST, PRESENT, AND FUTURE PROMISE , 2006 .

[73]  Heather L. Coates Ensuring research integrity The role of data management in current crises , 2014 .

[74]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[75]  M. Leininger Qualitative research methods in Nursing , 1985 .

[76]  James A. Narus,et al.  A Model of Distributor Firm and Manufacturer Firm Working Partnerships , 1990 .

[77]  R. Lewicki,et al.  Developing and Maintaining Trust in Work Relationships , 1996 .

[78]  Ruth E. Duerr,et al.  Challenges in Long-Term Data Stewardship , 2004, MSST.

[79]  M. Gelfand,et al.  How Do I Trust Thee? Dynamic Trust Profiles and Their Individual and Social Contextual Determinants , 2011 .

[80]  Geoff Walsham,et al.  The Emergence of Interpretivism in IS Research , 1995, Inf. Syst. Res..

[81]  Daniel J. McAllister Affect- and Cognition-Based Trust as Foundations for Interpersonal Cooperation in Organizations , 1995 .

[82]  Nicholas J. Belkin,et al.  Understanding Judgment of Information Quality and Cognitive Authority in the WWW , 1998 .

[83]  M. Sako,et al.  Prices, Quality and Trust: Inter-Firm Relations in Britain and Japan. , 1994 .

[84]  Gail Steinhart,et al.  Digital Research Data Curation: Overview of Issues, Current Activities, and Opportunities for the Cornell University Library , 2008 .

[85]  H S Wilson,et al.  Validity threats in scheduled semistructured research interviews. , 1992, Nursing research.

[86]  Deepak Malhotra,et al.  Foundations of Organizational Trust: What Matters to Different Stakeholders? , 2010, Organ. Sci..

[87]  Ixchel M. Faniel,et al.  Reusing Scientific Data: How Earthquake Engineering Researchers Assess the Reusability of Colleagues’ Data , 2010, Computer Supported Cooperative Work (CSCW).

[88]  J. Rotter A new scale for the measurement of interpersonal trust. , 1967, Journal of personality.

[89]  L. Gitelman "Raw Data" Is an Oxymoron , 2013 .

[90]  Charles J. Kacmar,et al.  Factors of Information Credibility for an Internet Advice Site , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).

[91]  David L. Altheide,et al.  Criteria for assessing interpretive validity in qualitative research. , 1994 .

[92]  Lars Hertzberg,et al.  On the attitude of trust , 1988 .

[93]  Nancy A. Van House,et al.  Cooperative knowledge work and practices of trust: sharing environmental planning data sets , 1998, CSCW '98.

[94]  N. Luhmann Trust and Power , 1979 .

[95]  Greg Janée Preserving Geospatial Data: The National Geospatial Digital Archive’s Approach , 2009 .

[96]  Margaret Hedstrom,et al.  The application of archival concepts to a data-intensive environment: working with scientists to understand data management and preservation needs , 2011 .

[97]  Melissa H. Cragin,et al.  Scientific Data Collections and Distributed Collective Practice , 2006, Computer Supported Cooperative Work (CSCW).

[98]  Stephen Marsh,et al.  The role of trust in information science and technology , 2005, Annu. Rev. Inf. Sci. Technol..

[99]  Daniel Benediktsson,et al.  Hermeneutics: dimensions toward LIS thinking , 1989 .

[100]  Helena Karasti,et al.  Digital Data Practices and the Long Term Ecological Research Program Growing Global , 2008, Int. J. Digit. Curation.

[101]  Rino Falcone,et al.  Trust in information sources as a source for trust: a fuzzy approach , 2003, AAMAS '03.

[102]  Sophia Lafferty-Hess Sharing Research Data in the Social Sciences: The Role of Journal Policies , 2014 .

[103]  Sarah Callaghan Data without Peer: Examples of Data Peer Review in the Earth Sciences , 2015, D Lib Mag..

[104]  Martin Pilgram,et al.  Consultative Committee For Space Data Systems , 2009 .

[105]  Vincent S Smith,et al.  Data publication: towards a database of everything , 2009, BMC Research Notes.

[106]  Micah Altman,et al.  A Proposed Standard for the Scholarly Citation of Quantitative Data , 2008 .

[107]  Nele Boelaert,et al.  ATLAS offline data quality monitoring , 2010 .

[108]  Devan Ray Donaldson,et al.  Provenance, End-User Trust and Reuse: An Empirical Investigation , 2011, TaPP.

[109]  J. Brown,et al.  Knowledge and Organization: A Social-Practice Perspective , 2001 .

[110]  Carina Lansing,et al.  Capturing and supporting contexts for scientific data sharing via the biological sciences collaboratory , 2004, CSCW.

[111]  N. L. Chervany,et al.  Initial Trust Formation in New Organizational Relationships , 1998 .

[112]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[113]  Ann S. Zimmerman,et al.  DATA SHARING AND SECONDARY USE OF SCIENTIFIC DATA: EXPERIENCES OF ECOLOGISTS , 2003 .

[114]  Gareth R. Jones,et al.  The experience and evolution of trust: Implications for cooperation and teamwork , 1998 .

[115]  Noel Enyedy,et al.  Building Digital Libraries for Scientific Data: An Exploratory Study of Data Practices in Habitat Ecology , 2006, ECDL.

[116]  J. Morse,et al.  Verification Strategies for Establishing Reliability and Validity in Qualitative Research , 2002 .

[117]  Matthew S. Mayernik,et al.  Moving Archival Practices Upstream: An Exploration of the Life Cycle of Ecological Sensing Data in Collaborative Field Research , 2008, Int. J. Digit. Curation.

[118]  J. Willis Foundations of Qualitative Research: Interpretive and Critical Approaches , 2007 .

[119]  Jonathan A. Smith Beyond the divide between cognition and discourse: using interpretative phenomenological analysis in health psychology , 1996 .

[120]  Olli Lagenspetz,et al.  Legitimacy and Trust , 1992 .

[121]  Elizabeth Yakel,et al.  Trust in Digital Repositories , 2013, Int. J. Digit. Curation.

[122]  Christine L. Borgman,et al.  The conundrum of sharing research data , 2012, J. Assoc. Inf. Sci. Technol..

[123]  P. Haas Introduction: epistemic communities and international policy coordination , 1992, International Organization.

[124]  J. Rotter Generalized expectancies for interpersonal trust. , 1971 .

[125]  Neil Beagrie,et al.  Digital Curation for Science, Digital Libraries, and Individuals , 2008, Int. J. Digit. Curation.

[126]  J. Lewis,et al.  Trust as a Social Reality , 1985 .

[127]  William A. Wallace,et al.  Trust in digital information , 2008, J. Assoc. Inf. Sci. Technol..

[128]  Ian Alexander,et al.  An introduction to qualitative research , 2000, Eur. J. Inf. Syst..

[129]  S. Molyneux-Hodgson,et al.  Introduction: The Dynamics of Epistemic Communities , 2010 .

[130]  E P HOLLANDER,et al.  Conformity, status, and idiosyncrasy credit. , 1958, Psychological review.

[131]  Sue Newell,et al.  Back to the future: from knowledge management to the management of information and data , 2003, Inf. Syst. E Bus. Manag..

[132]  Ann Zimmerman,et al.  Not by metadata alone: the use of diverse forms of knowledge to locate data for reuse , 2007, International Journal on Digital Libraries.

[133]  Jonathan A. Smith,et al.  Interpretative phenomenological analysis. , 2008, Qualitative research in psychology: Expanding perspectives in methodology and design (2nd ed.)..

[134]  Margaret Hedstrom,et al.  Incentives for Data Producers to Create "Archive-Ready" Data: Implications for Archives and Records Management , 2008 .

[135]  Sarah Higgins,et al.  The dcc curation lifecycle model , 2008, JCDL '08.

[136]  A. Baier Trust and Antitrust , 1986, Ethics.

[137]  Andrew C. Simpson,et al.  Collaboration and Trust in Healthcare Innovation: The eDiaMoND Case Study , 2005, Computer Supported Cooperative Work (CSCW).

[138]  Donald Hislop,et al.  The Paradox of Communities of Practice: Knowledge Sharing Between Communities , 2004 .

[139]  N. House Digital libraries and practices of trust: Networked biodiversity information , 2002 .

[140]  Morten Hertzum,et al.  Trust in information sources: seeking information from people, documents, and virtual agents , 2002, Interact. Comput..

[141]  William A. Wallace,et al.  Trust in electronic environments , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[142]  Allan J. Magrath,et al.  A Strategic Paradigm for Predicting Manufacturer‐Reseller Conflict , 1989 .

[143]  Jonathan A. Smith Hermeneutics, human sciences and health: linking theory and practice , 2007 .

[144]  Deepak Malhotra,et al.  NORMAL ACTS OF IRRATIONAL TRUST: MOTIVATED ATTRIBUTIONS AND THE TRUST DEVELOPMENT PROCESS , 2004 .

[145]  Kalpana Shankar Order from chaos: The poetics and pragmatics of scientific recordkeeping , 2007 .

[146]  A. Sullivan,et al.  Economic and Social Research Council. , 2017 .

[147]  D. Good,et al.  Individuals, Interpersonal Relations, and Trust , 2000 .

[148]  Alexander S. Szalay,et al.  Online scientific data curation, publication, and archiving , 2002, SPIE Astronomical Telescopes + Instrumentation.

[149]  Jinfang Niu,et al.  Overcoming inadequate documentation , 2009, ASIST.

[150]  E. Erikson,et al.  Childhood and Society , 1951 .

[151]  S. Hilgartner,et al.  Data withholding in academic genetics: evidence from a national survey. , 2002, JAMA.

[152]  Jan Maarten Schraagen,et al.  Factual accuracy and trust in information: The role of expertise , 2011, J. Assoc. Inf. Sci. Technol..

[153]  Dharma Akmon,et al.  The Role of Conceptions of Value in Data Practices: A Multi-Case Study of Three Small Teams of Ecological Scientists , 2014 .

[154]  Edward C. Tomlinson,et al.  The Role Of Causal Attribution Dimensions In Trust Repair , 2009 .

[155]  M. Appelbaum,et al.  Some issues of conducting secondary analyses , 1991 .

[156]  Ian V. Cornelius Meaning and Method in Information Studies , 1996 .

[157]  J. G. Holmes,et al.  Trust in close relationships. , 1985 .

[158]  Barton A. Weitz,et al.  Determinants of Continuity in Conventional Industrial Channel Dyads , 1989 .

[159]  J. Budd An Epistemological Foundation for Library and Information Science , 1995, The Library Quarterly.

[160]  Anne E. Trefethen,et al.  The Data Deluge: An e-Science Perspective , 2003 .

[161]  S. Fienberg,et al.  Sharing research data , 1985 .

[162]  Glenn Dingwall Trusting Archivists: The Role of Archival Ethics Codes in Establishing Public Faith , 2007 .

[163]  Xubin Zeng,et al.  Environmental data management at NOAA: Archiving, stewardship, and access , 2007 .

[164]  Christine L. Borgman,et al.  The Digital Future is Now: A Call to Action for the Humanities , 2009, Digit. Humanit. Q..

[165]  Aneil Mishra ORGANIZATIONAL RESPONSES TO CRISIS: THE CENTRALITY OF TRUST , 1996 .

[166]  D. Kleppner Ensuring the integrity, accessibility, and stewardship of research data in the digital age , 2010 .

[167]  Nithya Ramanathan,et al.  Know Thy Sensor: Trust, Data Quality, and Data Integrity in Scientific Digital Libraries , 2007, ECDL.

[168]  S. Hunt,et al.  The Commitment-Trust Theory of Relationship Marketing , 1994 .

[169]  Roderick M. Kramer,et al.  Swift trust and temporary groups. , 1996 .

[170]  Ben Anderson,et al.  What Are Data? The Many Kinds of Data and Their Implications for Data Re-Use , 2007, J. Comput. Mediat. Commun..

[171]  M. Lynne Markus,et al.  Toward A Theory of Knowledge Reuse : Types of Knowledge Reuse Situations and Factors in Reuse Success , 2022 .

[172]  P. Bryan Heidorn,et al.  The Emerging Role of Libraries in Data Curation and E-science , 2011 .

[173]  E. Buchborn [Trust and distrust]. , 1983, MMW, Munchener medizinische Wochenschrift.

[174]  G. Rolfe Validity, trustworthiness and rigour: quality and the idea of qualitative research. , 2006, Journal of advanced nursing.

[175]  Stuart Macdonald,et al.  User Engagement in Research Data Curation , 2009, ECDL.

[176]  Joacim Hansson,et al.  Hermeneutics as a bridge between the modern and the postmodern in library and information science , 2005, J. Documentation.

[177]  Key Perspectives Ltd Data dimensions: disciplinary differences in research data sharing, reuse and long term viability , 2010 .

[178]  Colin Sharp Qualitative Research and Evaluation Methods (3rd ed.) , 2003 .

[179]  N. L. Chervany,et al.  Initial trust formation in new organizational relationships, Academy of Management Review, , . , 1998 .

[180]  Myron P. Gutmann,et al.  The selection, appraisal, and retention of social science data , 2004, Data Sci. J..

[181]  E. Wenger,et al.  cultivating communities of practice , 2002 .

[182]  Dale E. Zand Trust and Managerial Problem Solving , 1972 .

[183]  Joseph P. Cannon,et al.  An Examination of the Nature of Trust in Buyer–Seller Relationships: , 1997 .

[184]  Megan Tschannen-Moran,et al.  Collaboration and the need fortrust , 2001 .

[185]  S. Sitkin,et al.  Explaining the Limited Effectiveness of Legalistic “Remedies” for Trust/Distrust , 1993 .

[186]  Ann Zimmerman,et al.  New Knowledge from Old Data , 2008 .

[187]  Birgit Renzl Trust in management and knowledge sharing: The mediating effects of fear and knowledge documentation , 2008 .

[188]  Phyllis S. Glaeser Scientific and technical data in a new era : proceedings of the Eleventh International CODATA Conference, Karlsruhe, Federal Republic of Germany, 26-29 September 1988 , 1990 .

[189]  Jinfang Niu,et al.  Documentation evaluation model for social science data , 2008, ASIST.

[190]  A. Giddens The consequences of modernity , 1990 .

[191]  Anna Keller Gold Libraries, process, and data , 2013, ASIST.

[192]  C. I. Hovland,et al.  Social Judgment: Assimilation and Contrast Effects in Communication and Attitude Change , 1981 .

[193]  M. Manen Researching Lived Experience: Human Science for an Action Sensitive Pedagogy , 1990 .