Open innovation: Towards sharing of data, models and workflows

Abstract Sharing of resources across organisations to support open innovation is an old idea, but which is being taken up by the scientific community at increasing speed, concerning public sharing in particular. The ability to address new questions or provide more precise answers to old questions through merged information is among the attractive features of sharing. Increased efficiency through reuse, and increased reliability of scientific findings through enhanced transparency, are expected outcomes from sharing. In the field of pharmacometrics, efforts to publicly share data, models and workflow have recently started. Sharing of individual‐level longitudinal data for modelling requires solving legal, ethical and proprietary issues similar to many other fields, but there are also pharmacometric‐specific aspects regarding data formats, exchange standards, and database properties. Several organisations (CDISC, C‐Path, IMI, ISoP) are working to solve these issues and propose standards. There are also a number of initiatives aimed at collecting disease‐specific databases – Alzheimer's Disease (ADNI, CAMD), malaria (WWARN), oncology (PDS), Parkinson's Disease (PPMI), tuberculosis (CPTR, TB‐PACTS, ReSeqTB) – suitable for drug‐disease modelling. Organized sharing of pharmacometric executable model code and associated information has in the past been sparse, but a model repository (DDMoRe Model Repository) intended for the purpose has recently been launched. In addition several other services can facilitate model sharing more generally. Pharmacometric workflows have matured over the last decades and initiatives to more fully capture those applied to analyses are ongoing. In order to maximize both the impact of pharmacometrics and the knowledge extracted from clinical data, the scientific community needs to take ownership of and create opportunities for open innovation. Graphical abstract No caption available.

[1]  Heather A. Piwowar,et al.  Sharing Detailed Research Data Is Associated with Increased Citation Rate , 2007, PloS one.

[2]  Farid Neema,et al.  Data sharing , 1998 .

[3]  Tobias Faust Anonymization Of Electronic Medical Records To Support Clinical Analysis , 2016 .

[4]  F. Prinz,et al.  Believe it or not: how much can we rely on published data on potential drug targets? , 2011, Nature Reviews Drug Discovery.

[5]  J. Grefenstette,et al.  A systematic review of barriers to data sharing in public health , 2014, BMC Public Health.

[6]  JA Rassen,et al.  Transparency and Reproducibility of Observational Cohort Studies Using Large Healthcare Databases. , 2016, Clinical pharmacology and therapeutics.

[7]  Robert W. Corty,et al.  The project data sphere initiative: accelerating cancer research by sharing data. , 2015, The oncologist.

[8]  JJ Wilkins,et al.  Thoughtflow: Standards and Tools for Provenance Capture and Workflow Definition to Support Model‐Informed Drug Discovery and Development , 2017, CPT: pharmacometrics & systems pharmacology.

[9]  B. Malin,et al.  Anonymization of electronic medical records for validating genome-wide association studies , 2010, Proceedings of the National Academy of Sciences.

[10]  Martha Brumfield,et al.  The Critical Path Institute: transforming competitors into collaborators , 2014, Nature Reviews Drug Discovery.

[11]  Yihui Xie,et al.  knitr: A Comprehensive Tool for Reproducible Research in R , 2018, Implementing Reproducible Research.

[12]  Elizabeth Gilbert,et al.  Reproducibility Project: Results (Part of symposium called "The Reproducibility Project: Estimating the Reproducibility of Psychological Science") , 2014 .

[13]  Michael W. Weiner,et al.  2014 Update of the Alzheimer's Disease Neuroimaging Initiative: A review of papers published since its inception , 2015, Alzheimer's & Dementia.

[14]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.

[15]  Brian A. Nosek,et al.  Power failure: why small sample size undermines the reliability of neuroscience , 2013, Nature Reviews Neuroscience.

[16]  A. Singleton,et al.  The Parkinson Progression Marker Initiative (PPMI) , 2011, Progress in Neurobiology.

[17]  Krzysztof J. Gorgolewski,et al.  A Practical Guide for Improving Transparency and Reproducibility in Neuroimaging Research , 2016, bioRxiv.

[18]  Marcia McNutt,et al.  Data sharing , 2016, Science.

[19]  Charles W. Dement,et al.  Engineering a Pharmacometrics Enterprise , 2006 .

[20]  Kristian Thorlund,et al.  Reanalyses of randomized clinical trial data. , 2014, JAMA.

[21]  C. Begley,et al.  Drug development: Raise standards for preclinical cancer research , 2012, Nature.

[22]  S. Marshall,et al.  Good Practices in Model‐Informed Drug Discovery and Development: Practice, Application, and Documentation , 2016, CPT: pharmacometrics & systems pharmacology.

[23]  Heather A. Piwowar,et al.  Data reuse and the open data citation advantage , 2013, PeerJ.

[24]  Bartha M Knoppers,et al.  Data Sharing - Is the Juice Worth the Squeeze? , 2016, The New England journal of medicine.

[25]  Leonid Churilov,et al.  Risk of Bias in Reports of In Vivo Research: A Focus for Improvement , 2015, PLoS biology.

[26]  Justin J. Wilkins,et al.  NONMEMory: A run management tool for NONMEM , 2005, Comput. Methods Programs Biomed..

[27]  Arthur W. Toga,et al.  Precompetitive Data Sharing as a Catalyst to Address Unmet Needs in Parkinson’s Disease , 2015, Journal of Parkinson's disease.

[28]  Carole A. Goble,et al.  The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud , 2013, Nucleic Acids Res..

[29]  Steven Kern,et al.  Data Sharing and Inductive Learning--Toward Healthy Birth, Growth, and Development. , 2016, The New England journal of medicine.

[30]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[31]  Jeremy A. Rassen,et al.  Transparency and reproducibility of published observational cohort studies , 2015 .

[32]  Ira Shoulson,et al.  Approaches and costs for sharing clinical research data. , 2014, JAMA.

[33]  Jos H. Beijnen,et al.  Piraña and PCluster: A modeling environment and cluster infrastructure for NONMEM , 2011, Comput. Methods Programs Biomed..