Towards structured sharing of raw and derived neuroimaging data across existing resources

Data sharing efforts increasingly contribute to the acceleration of scientific discovery. Neuroimaging data is accumulating in distributed domain-specific databases and there is currently no integrated access mechanism nor an accepted format for the critically important meta-data that is necessary for making use of the combined, available neuroimaging data. In this manuscript, we present work from the Derived Data Working Group, an open-access group sponsored by the Biomedical Informatics Research Network (BIRN) and the International Neuroimaging Coordinating Facility (INCF) focused on practical tools for distributed access to neuroimaging data. The working group develops models and tools facilitating the structured interchange of neuroimaging meta-data and is making progress towards a unified set of tools for such data and meta-data exchange. We report on the key components required for integrated access to raw and derived neuroimaging data as well as associated meta-data and provenance across neuroimaging resources. The components include (1) a structured terminology that provides semantic context to data, (2) a formal data model for neuroimaging with robust tracking of data provenance, (3) a web service-based application programming interface (API) that provides a consistent mechanism to access and query the data model, and (4) a provenance library that can be used for the extraction of provenance data by image analysts and imaging software developers. We believe that the framework and set of tools outlined in this manuscript have great potential for solving many of the issues the neuroimaging community faces when sharing raw and derived neuroimaging data across the various existing database systems for the purpose of accelerating scientific discovery.

[1]  K Wagner,et al.  The Neuroimaging Informatics Tools and Resources Clearinghouse (NITRC) , 2008, NeuroImage.

[2]  M. Milham Open Neuroscience Solutions for the Connectome-wide Association Era , 2012, Neuron.

[3]  Johan Montagnat,et al.  NeuroLOG: A framework for the sharing and reuse of distributed tools and data in neuroimaging , 2011 .

[4]  David B. Keator,et al.  Federated Web-accessible Clinical Data Management within an Extensible NeuroImaging Database , 2010, Neuroinformatics.

[5]  Eduard H. Hovy,et al.  Knowledge engineering tools for reasoning with scientific observations and interpretations: a neural connectivity use case , 2011, BMC Bioinformatics.

[6]  Tanya M. Teslovich,et al.  Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index , 2010 .

[7]  C. Begley,et al.  Drug development: Raise standards for preclinical cancer research , 2012, Nature.

[8]  Craig A. Knoblock,et al.  Quality-driven geospatial data integration , 2007, GIS.

[9]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[10]  Patrick Valduriez,et al.  A Methodology for Query Reformulation in CIS Using Semantic Knowledge , 1996, Int. J. Cooperative Inf. Syst..

[11]  Arthur W. Toga,et al.  Provenance in neuroimaging , 2008, NeuroImage.

[12]  Simon C. Potter,et al.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis , 2011, Nature.

[13]  Alon Y. Halevy,et al.  The Nimble XML data integration system , 2001, Proceedings 17th International Conference on Data Engineering.

[14]  Martone Maryann NeuroLex.org: Integrating a community-driven neuroscience ontology with the web of linked data , 2011 .

[15]  Hans-Michael Müller,et al.  Federated Access to Heterogeneous Information Resources in the Neuroscience Information Framework (NIF) , 2008, Neuroinformatics.

[16]  Michael S. Gazzaniga,et al.  Databasing fMRI studies — towards a 'discovery science' of brain function , 2002, Nature Reviews Neuroscience.

[17]  Kenneth D. Harris,et al.  Data Sharing for Computational Neuroscience , 2008, Neuroinformatics.

[18]  Satrajit S. Ghosh,et al.  Data sharing in neuroimaging research , 2012, Front. Neuroinform..

[19]  Henrik,et al.  Association analyses of 249,796 individuals reveal eighteen new loci associated with body mass index , 2012 .

[20]  T. Insel,et al.  Limits to growth: why neuroscience needs large-scale science , 2004, Nature Neuroscience.

[21]  Margaret D. King,et al.  The NKI-Rockland Sample: A Model for Accelerating the Pace of Discovery Science in Psychiatry , 2012, Front. Neurosci..

[22]  David B. Keator,et al.  Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid , 2009, Front. Neuroinform..

[23]  K. Selçuk Candan,et al.  Query caching and optimization in distributed mediator systems , 1996, SIGMOD '96.

[24]  Anders D. Børglum,et al.  Genome-wide association study identifies five new schizophrenia loci , 2011, Nature Genetics.

[25]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[26]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[27]  Marisa O. Hollinshead,et al.  Identification of common variants associated with human hippocampal and intracranial volumes , 2012, Nature Genetics.

[28]  Ron Mengelers,et al.  The Effects of FreeSurfer Version, Workstation Type, and Macintosh Operating System Version on Anatomical Volume and Cortical Thickness Measurements , 2012, PloS one.

[29]  Jessica A. Turner,et al.  Neuroscience Data Integration through Mediation: An (F)BIRN Case Study , 2010, Front. Neuroinform..

[30]  Grethe Jeff,et al.  The Neuroimaging Informatics Tools and Resources Clearinghouse (NITRC) , 2010 .

[31]  Paul T. Groth,et al.  The anatomy of a nanopublication , 2010, Inf. Serv. Use.

[32]  Daniel S. Marcus,et al.  The extensible neuroimaging archive toolkit , 2007, Neuroinformatics.

[33]  C. Langlotz RadLex: a new method for indexing online educational materials. , 2006, Radiographics : a review publication of the Radiological Society of North America, Inc.

[34]  Yogesh L. Simmhan,et al.  Special Issue: The First Provenance Challenge , 2008, Concurr. Comput. Pract. Exp..

[35]  Owen Carmichael,et al.  Update on the Magnetic Resonance Imaging core of the Alzheimer's Disease Neuroimaging Initiative , 2010, Alzheimer's & Dementia.

[36]  J B Woodward,et al.  The Functional Magnetic Resonance Imaging Data Center (fMRIDC): the challenges and rewards of large-scale databasing of neuroimaging studies. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[37]  David B. Keator,et al.  XCEDE: An Extensible Schema for Biomedical Data , 2011, Neuroinformatics.

[38]  Alon Y. Halevy,et al.  Data integration and genomic medicine , 2007, J. Biomed. Informatics.

[39]  David B. Keator,et al.  Infrastructure for sharing standardized clinical brain scans across hospitals , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[40]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..

[41]  Timothy R. Olsen,et al.  The Extensible Neuroimaging Archive Toolkit: an informatics platform for managing, exploring, and sharing neuroimaging data. , 2007, Neuroinformatics.

[42]  Michele T. Diaz,et al.  Function biomedical informatics research network recommendations for prospective multicenter functional MRI studies , 2012, Journal of magnetic resonance imaging : JMRI.

[43]  C. Chute,et al.  Electronic Medical Records for Genetic Research: Results of the eMERGE Consortium , 2011, Science Translational Medicine.

[44]  Ian Foster,et al.  Special Issue: The First Provenance Challenge , 2008 .

[45]  Christian Windischberger,et al.  Toward discovery science of human brain function , 2010, Proceedings of the National Academy of Sciences.

[46]  Bharat B. Biswal,et al.  Making data sharing work: The FCP/INDI experience , 2013, NeuroImage.

[47]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[48]  Jessica A. Turner,et al.  Impact of scanner hardware and imaging protocol on image quality and compartment volume precision in the ADNI cohort , 2010, NeuroImage.

[49]  Mark E. Schmidt,et al.  The Alzheimer's Disease Neuroimaging Initiative: A review of papers published since its inception , 2012, Alzheimer's & Dementia.

[50]  Hector Garcia-Molina,et al.  Template-based wrappers in the TSIMMIS system , 1997, SIGMOD '97.