Open Data Commons for Preclinical Traumatic Brain Injury Research: Empowering Data Sharing and Big Data Analytics

Traumatic brain injury (TBI) is a major unsolved public health problem worldwide with considerable preclinical research dedicated to recapitulating clinical TBI, deciphering the underlying pathophysiology, and developing therapeutics. However, the heterogeneity of clinical TBI and correspondingly in preclinical studies have made translation from bench to bedside difficult. Here, we present the potential of data sharing, data aggregation, and multivariate analytics to integrate heterogeneity and empower researchers. We introduce the Open Data Commons for Traumatic Brain Injury (ODC-TBI.org) as a user-centered web platform and cloudbased repository focused on preclinical TBI research that enables data citation with persistent identifiers, promotes data element harmonization, and follows FAIR data sharing principles. Importantly, the ODC-TBI implements data sharing at the level of individual subjects, thus enabling data reuse for granular big data analytics and data-hungry machine learning approaches. We provide use cases applying descriptive analytics and unsupervised machine learning on pooled ODC-TBI data. Descriptive statistics included subject-level data for 11 published papers (N = 1250 subjects) representing six distinct TBI models across mice and rats (implementing controlled cortical impact, closed head injury, fluid percussion injury, and CHIMERA TBI modalities). We performed principal component analysis (PCA) on cohorts of animals combined through the ODC-TBI to identify persistent inflammatory patterns across different experimental designs. Our workflow ultimately improved the sensitivity of our analyses in uncovering patterns of pro- vs anti-inflammation and oxidative stress without the multiple testing problems of univariate analyses. As the practice of open data becomes increasingly required by the scientific community, ODC-TBI provides a foundation that creates new scientific opportunities for researchers and their work, facilitates multi-dataset and multidimensional analytics, and drives collaboration across molecular and computational biologists to bridge preclinical research to the clinic.

[1]  Adam R Ferguson,et al.  Preclinical Common Data Elements for Traumatic Brain Injury Research: Progress and Use Cases. , 2020, Journal of Neurotrauma.

[2]  Carlos A Almeida,et al.  Data Dissemination: Shortening the Long Tail of Traumatic Brain Injury Dark Data. , 2020, Journal of neurotrauma.

[3]  Adam R Ferguson,et al.  Reproducible analysis of disease space via principal components using the novel R package syndRomics , 2020, eLife.

[4]  Adam R Ferguson,et al.  Statistical guidelines for handing missing data in traumatic brain injury clinical research. , 2020, Journal of neurotrauma.

[5]  Hester F. Lingsma,et al.  Common Data Elements: Critical Assessment of Harmonization between Current Multi-Center Traumatic Brain Injury Studies , 2020, Journal of neurotrauma.

[6]  A. Lee,et al.  Traumatic Brain Injuries: Pathophysiology and Potential Therapeutic Targets , 2019, Front. Cell. Neurosci..

[7]  Kohske Takahashi,et al.  Welcome to the Tidyverse , 2019, J. Open Source Softw..

[8]  Hester F. Lingsma,et al.  Case-mix, care pathways, and outcomes in patients with traumatic brain injury in CENTER-TBI: a European prospective, multicentre, longitudinal, cohort study , 2019, The Lancet Neurology.

[9]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[10]  Amit Agrawal,et al.  Estimating the global incidence of traumatic brain injury. , 2019, Journal of neurosurgery.

[11]  J Russell Huie,et al.  Testing a Multivariate Proteomic Panel for Traumatic Brain Injury Biomarker Discovery: A TRACK-TBI Pilot Study. , 2019, Journal of neurotrauma.

[12]  J Russell Huie,et al.  Neurotrauma as a big-data problem , 2018, Current opinion in neurology.

[13]  C. Lemere,et al.  Traumatic Brain Injury in Aged Mice Induces Chronic Microglia Activation, Synapse Loss, and Complement-Dependent Memory Deficits , 2018, International journal of molecular sciences.

[14]  V. Sohal,et al.  Repeated Mild Head Injury Leads to Wide-Ranging Deficits in Higher-Order Cognitive Functions Associated with the Prefrontal Cortex. , 2018, Journal of neurotrauma.

[15]  Nicholas J Tierney,et al.  Expanding Tidy Data Principles to Facilitate Missing Data Exploration, Visualization and Assessment of Imputations , 2018, J. Stat. Softw..

[16]  S. Rosi,et al.  Persistent Infiltration and Impaired Response of Peripherally-Derived Monocytes after Traumatic Brain Injury in the Aged Brain , 2018, International journal of molecular sciences.

[17]  Iain Hrynaszkiewicz,et al.  Whitepaper: Practical challenges for researchers in data sharing , 2018 .

[18]  Iain Hrynaszkiewicz,et al.  Practical challenges for researchers in data sharing , 2018 .

[19]  Kara H. Woo,et al.  Data Organization in Spreadsheets , 2018 .

[20]  C. Najac,et al.  In vivo metabolic imaging of Traumatic Brain Injury , 2017, Scientific Reports.

[21]  Alison Callahan,et al.  Developing a data sharing community for spinal cord injury research , 2017, Experimental Neurology.

[22]  Denes V Agoston,et al.  Big Data in traumatic brain injury; promise and challenges , 2017, Concussion.

[23]  P. Walter,et al.  Inhibition of the integrated stress response reverses cognitive deficits after traumatic brain injury , 2017, Proceedings of the National Academy of Sciences.

[24]  B. Stoica,et al.  Microglial/Macrophage Polarization Dynamics following Traumatic Brain Injury. , 2016, Journal of neurotrauma.

[25]  Adam R Ferguson,et al.  A novel antagonist of p75NTR reduces peripheral expansion and CNS trafficking of pro-inflammatory monocytes and spares function after traumatic brain injury , 2016, Journal of Neuroinflammation.

[26]  S. Rosi,et al.  Age exacerbates the CCR2/5-mediated neuroinflammatory response to traumatic brain injury , 2016, Journal of Neuroinflammation.

[27]  Adam R Ferguson,et al.  A novel inhibitor of p75-neurotrophin receptor improves functional outcomes in two models of traumatic brain injury , 2016, Brain : a journal of neurology.

[28]  Jorge Cadima,et al.  Principal component analysis: a review and recent developments , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[29]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[30]  S. Rosi,et al.  Frontal Lobe Contusion in Mice Chronically Impairs Prefrontal-Dependent Behavior , 2016, PloS one.

[31]  Eric G. Campbell,et al.  The Changing Nature of Scientific Sharing and Withholding in Academic Life Sciences Research: Trends From National Surveys in 2000 and 2013 , 2016, Academic medicine : journal of the Association of American Medical Colleges.

[32]  S. Rosi,et al.  Call Off the Dog(ma): M1/M2 Polarization Is Concurrent following Traumatic Brain Injury , 2016, PloS one.

[33]  D. A. Bergstrom,et al.  Pre-Clinical Traumatic Brain Injury Common Data Elements: Toward a Common Language Across Laboratories. , 2015, Journal of neurotrauma.

[34]  Adam R Ferguson,et al.  Topological data analysis for discovery in preclinical spinal cord injury and traumatic brain injury , 2015, Nature Communications.

[35]  C. Spearman The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[36]  Stephane Champely,et al.  Basic Functions for Power Analysis , 2015 .

[37]  H. Thompson,et al.  Common Data Elements and Federal Interagency Traumatic Brain Injury Research Informatics System for TBI Research , 2015, Annual Review of Nursing Research.

[38]  L. Latour,et al.  Multivariate Analysis of Traumatic Brain Injury: Development of an Assessment Score , 2015, Front. Neurol..

[39]  P. Kochanek,et al.  Emerging Therapies in Traumatic Brain Injury , 2015, Seminars in Neurology.

[40]  Adam R Ferguson,et al.  CCR2 Antagonism Alters Brain Macrophage Polarization and Ameliorates Cognitive Dysfunction Induced by Traumatic Brain Injury , 2015, The Journal of Neuroscience.

[41]  Adam R Ferguson,et al.  Big data from small data: data-sharing in the 'long tail' of neuroscience , 2014, Nature Neuroscience.

[42]  Shruti V. Kabadi,et al.  PARP-1 inhibition attenuates neuronal loss, microglia activation and neurological deficits after traumatic brain injury. , 2014, Journal of neurotrauma.

[43]  Hester F. Lingsma,et al.  Transforming research and clinical knowledge in traumatic brain injury pilot: multicenter implementation of the common data elements for traumatic brain injury. , 2013, Journal of neurotrauma.

[44]  Stephen D. Larson,et al.  NeuroLex.org: an online framework for neuroscience knowledge , 2013, Front. Neuroinform..

[45]  C. Y. Peng,et al.  Principled missing data methods for researchers , 2013, SpringerPlus.

[46]  Adam R Ferguson,et al.  Derivation of Multivariate Syndromic Outcome Metrics for Consistent Testing across Multiple Models of Cervical Spinal Cord Injury in Rats , 2013, PloS one.

[47]  Michael Chopp,et al.  Animal models of traumatic brain injury , 2013, Nature Reviews Neuroscience.

[48]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[49]  C. Tenopir,et al.  Data Sharing by Scientists: Practices and Perceptions , 2011, PloS one.

[50]  B. Masel,et al.  Traumatic brain injury: a disease process, not an event. , 2010, Journal of neurotrauma.

[51]  E. Erdfelder,et al.  Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses , 2009, Behavior research methods.

[52]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[53]  T. Miller,et al.  Prevalence of Long‐Term Disability From Traumatic Brain Injury in the Civilian Population of the United States, 2005 , 2008, The Journal of head trauma rehabilitation.

[54]  J. Graham,et al.  How Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation Theory , 2007, Prevention Science.

[55]  Heather A. Piwowar,et al.  Sharing Detailed Research Data Is Associated with Increased Citation Rate , 2007, PloS one.

[56]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[57]  Christina Gloeckner,et al.  Modern Applied Statistics With S , 2003 .

[58]  R. Gonzalez Applied Multivariate Statistics for the Social Sciences , 2003 .

[59]  T. Frieden Traumatic Brain Injury In the United States: Epidemiology and Rehabilitation , 2015 .

[60]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[61]  G. Manley,et al.  Clinical trials in traumatic brain injury: Past experience and current developments , 2011, Neurotherapeutics.

[62]  M. Chopp,et al.  Emerging treatments for traumatic brain injury. , 2009, Expert opinion on emerging drugs.

[63]  Neeraj,et al.  Digital Commons@Becker Digital Commons@Becker Classification of traumatic brain injury for targeted therapies Classification of traumatic brain injury for targeted therapies Classification of Traumatic Brain Injury for Targeted Therapies , 2008 .

[64]  R. Little A Test of Missing Completely at Random for Multivariate Data with Missing Values , 1988 .

[65]  C. Spearman The proof and measurement of association between two things. By C. Spearman, 1904. , 1987, The American journal of psychology.

[66]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[67]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .