Evaluation of Clinical Trial Data Sharing Policy in Leading Medical Journals

Background. The benefits from responsible sharing of individual-participant data (IPD) from clinical studies are well recognized, but stakeholders often disagree on how to align those benefits with privacy risks, costs, and incentives for clinical trialists and sponsors. Recently, the International Committee of Medical Journal Editors (ICMJE) required a data sharing statement (DSS) from submissions reporting clinical trials effective July 1, 2018. We set out to evaluate the implementation of the policy in three leading medical journals (JAMA, Lancet, and New England Journal of Medicine (NEJM)). Methods. A MEDLINE/PubMed search of clinical trials published in the three journals between July 1, 2018 and April 4, 2020 identified 487 eligible trials (JAMA n = 112, Lancet n = 147, NEJM n = 228). Two reviewers evaluated each of the 487 articles independently. Captured outcomes were declared data availability, data type, access, conditions and reasons for data (un)availability, and funding sources. Findings. 334 (68.6%, 95% confidence interval (CI), 64.1%-72.5%) articles declared data sharing, with non-industry NIH-funded trials exhibiting the highest rates of declared data sharing (88.9%, 95% CI, 80.0%-97.8) and industry-funded trials the lowest (61.3%, 95% CI, 54.3%-68.3). However, only two IPD datasets were actually deidentified and publicly available as of April 10, 2020. The remaining were supposedly accessible via request to authors (42.8%, 143/334), repository (26.6%, 89/334), and company (23.4%, 78/334). Among the 89 articles declaring to store IPD in repositories, only 17 articles (19.1%) deposited data, mostly due to embargo and regulatory approval. Embargo was set in 47.3% (158/334) of data-sharing articles, and in half of them the period exceeded 1 year or was unspecified. Interpretation. Most trials published in JAMA, Lancet, and NEJM after the implementation of the ICMJE policy declared their intent to make data available. However, a wide gap between declared and actual data sharing exists. To improve transparency and data reuse, journals should promote the use of unique pointers to dataset location and standardized choices for embargo periods and access requirements. All data, code, and materials used in this analysis are available on OSF at https://osf.io/s5vbg/.

[1]  Michael L. Waskom,et al.  mwaskom/seaborn: v0.10.1 (April 2020) , 2020 .

[2]  A. Butte,et al.  Time for NIH to lead on data sharing , 2020, Science.

[3]  Joel Nothman,et al.  SciPy 1.0-Fundamental Algorithms for Scientific Computing in Python , 2019, ArXiv.

[4]  Barbara McGillivray,et al.  The citation advantage of linking publications to research data , 2019, PloS one.

[5]  J. Chang-Claude,et al.  Sharing data safely while preserving privacy , 2019, The Lancet.

[6]  Barbara E. Bierer,et al.  Credit data generators for data reuse , 2019, Nature.

[7]  C. Ohmann,et al.  Evaluation of repositories for sharing individual-participant data from clinical studies , 2019, Trials.

[8]  Fyodor Lukyanov Everything on Display , 2019 .

[9]  Harlan M Krumholz,et al.  Overview and experience of the YODA Project with clinical trial data sharing after 5 years , 2018, Scientific Data.

[10]  S. Goodman,et al.  Clinical Trial Participants’ Views of the Risks and Benefits of Data Sharing , 2018, The New England journal of medicine.

[11]  Christopher W. Belter,et al.  Data sharing in PLOS ONE: An analysis of Data Availability Statements , 2018, PloS one.

[12]  David Moher,et al.  Assessing scientists for hiring, promotion, and tenure , 2018, PLoS biology.

[13]  David Moher,et al.  Data sharing and reanalysis of randomized controlled trials in leading biomedical journals with a full data sharing policy: survey of studies published in The BMJ and PLOS Medicine , 2018, British Medical Journal.

[14]  Trialists’ Intent to Share Individual Participant Data as Disclosed at ClinicalTrials.gov , 2018, JAMA.

[15]  Dipak Kalra,et al.  Sharing and reuse of individual participant data from clinical trials: principles and recommendations , 2017, BMJ Open.

[16]  R. Kiley,et al.  Data Sharing from Clinical Trials — A Research Funder's Perspective , 2017, The New England journal of medicine.

[17]  A. Marson,et al.  Resource implications of preparing individual participant data from a clinical trial to share with external researchers , 2017, Trials.

[18]  Fiona Godlee,et al.  Data Sharing Statements for Clinical Trials: A Requirement of the International Committee of Medical Journal Editors. , 2017, JAMA.

[19]  H. Bauchner,et al.  Data Sharing Statements for Clinical Trials - A Requirement of the International Committee of Medical Journal Editors. , 2017, New England Journal of Medicine.

[20]  Fiona Godlee,et al.  Data Sharing Statements for Clinical Trials: A Requirement of the International Committee of Medical Journal Editors , 2017, Journal of Korean medical science.

[21]  H. Bauchner,et al.  Data sharing statements for clinical trials: a requirement of the International Committee of Medical Journal Editors , 2017, The Lancet.

[22]  Lisa Rosenbaum,et al.  Bridging the Data-Sharing Divide - Seeing the Devil in the Details, Not the Other Camp. , 2017, The New England journal of medicine.

[23]  George A. Mensah,et al.  Use of the National Heart, Lung, and Blood Institute Data Repository , 2017, The New England journal of medicine.

[24]  Mercè Crosas,et al.  Data Authorship as an Incentive to Data Sharing. , 2017, The New England journal of medicine.

[25]  Mercè Crosas,et al.  Data Authorship as an Incentive to Data Sharing. , 2017, The New England journal of medicine.

[26]  Bartha M Knoppers,et al.  Data Sharing - Is the Juice Worth the Squeeze? , 2016, The New England journal of medicine.

[27]  Anisa Rowhani-Farid,et al.  Has open data arrived at the British Medical Journal (BMJ)? An observational study , 2016, BMJ Open.

[28]  Frank Rockhold,et al.  Data Sharing at a Crossroads. , 2016, The New England journal of medicine.

[29]  G. Guyatt,et al.  Toward Fairness in Data Sharing. , 2016, The New England journal of medicine.

[30]  I. Boutron,et al.  Sharing of Data From Industry-Funded Registered Clinical Trials. , 2016, JAMA.

[31]  H. Bauchner,et al.  Sharing Clinical Trial Data: A Proposal from the International Committee of Medical Journal Editors , 2016, PLoS medicine.

[32]  Michael J. Pencina,et al.  Use of Open Access Platforms for Clinical Trial Data. , 2016, JAMA.

[33]  Howard Bauchner,et al.  Data Sharing: An Ethical and Scientific Imperative. , 2016, JAMA.

[34]  John Fletcher,et al.  Sharing Clinical Trial Data: A Proposal from the International Committee of Medical Journal Editors , 2016, The National medical journal of India.

[35]  et al.,et al.  Jupyter Notebooks - a publishing format for reproducible computational workflows , 2016, ELPUB.

[36]  Stephen E. Fienberg,et al.  Self-correction in science at work , 2015, Science.

[37]  Brian A. Nosek,et al.  Promoting an open research culture , 2015, Science.

[38]  B. Lo Sharing clinical trial data: maximizing benefits, minimizing risk. , 2015, JAMA.

[39]  Frank W. Rockhold,et al.  Bumps and bridges on the road to responsible sharing of clinical trial data , 2014, Clinical trials.

[40]  Richard Van Noorden Data-sharing: Everything on display , 2013 .

[41]  Iain Hrynaszkiewicz,et al.  Sharing of clinical trial data among trialists: a cross sectional survey , 2012, BMJ : British Medical Journal.

[42]  Tanneguy Redarce,et al.  Automatic Lip-Contour Extraction and Mouth-Structure Segmentation in Images , 2011, Computing in Science & Engineering.

[43]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[44]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[45]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.