A Semantic Cross-Species Derived Data Management Application

Managing dynamic information in large multi-site, multi-species, and multi-discipline consortia is a challenging task for data management applications. Often in academic research studies the goals for informatics teams are to build applications that provide extract-transform-load (ETL) functionality to archive and catalog source data that has been collected by the research teams. In consortia that cross species and methodological or scientific domains, building interfaces that supply data in a usable fashion and make intuitive sense to scientists from dramatically different backgrounds increases the complexity for developers. Further, reusing source data from outside one's scientific domain is fraught with ambiguities in understanding the data types, analysis methodologies, and how to combine the data with those from other research teams. We report on the design, implementation, and performance of a semantic data management application to support the NIMH funded Conte Center at the University of California, Irvine. The Center is testing a theory of the consequences of "fragmented" (unpredictable, high entropy) early-life experiences on adolescent cognitive and emotional outcomes in both humans and rodents. It employs cross-species neuroimaging, epigenomic, molecular, and neuroanatomical approaches in humans and rodents to assess the potential consequences of fragmented unpredictable experience on brain structure and circuitry. To address this multi-technology, multi-species approach, the system uses semantic web techniques based on the Neuroimaging Data Model (NIDM) to facilitate data ETL functionality. We find this approach enables a low-cost, easy to maintain, and semantically meaningful information management system, enabling the diverse research teams to access and use the data.

[1]  Michele T. Diaz,et al.  Function biomedical informatics research network recommendations for prospective multicenter functional MRI studies , 2012, Journal of magnetic resonance imaging : JMRI.

[2]  Yogesh L. Simmhan,et al.  Special Issue: The First Provenance Challenge , 2008, Concurr. Comput. Pract. Exp..

[3]  Ian Foster,et al.  Special Issue: The First Provenance Challenge , 2008 .

[4]  David B. Keator,et al.  A general XML schema and SPM toolbox for storage of neuro-imaging results and anatomical labels , 2007, Neuroinformatics.

[5]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[6]  Paul H. E. Tiesinga,et al.  The Scalable Brain Atlas: Instant Web-Based Access to Public Brain Atlases and Related Content , 2013, Neuroinformatics.

[7]  Timothy R. Olsen,et al.  The Extensible Neuroimaging Archive Toolkit: an informatics platform for managing, exploring, and sharing neuroimaging data. , 2007, Neuroinformatics.

[8]  Robert Quinn,et al.  Comparing rat's to human's age: how old is my rat in people years? , 2005, Nutrition.

[9]  David C. Glahn,et al.  Neuroinformatics Database (NiDB) – A Modular, Portable Database for the Storage, Analysis, and Sharing of Neuroimaging Data , 2013, Neuroinformatics.

[10]  Andre Obenaus,et al.  Fragmentation and unpredictability of early-life experience in mental disorders. , 2012, The American journal of psychiatry.

[11]  Curt A. Sandman,et al.  Stressed-out, or in (utero)? , 2002, Trends in Neurosciences.

[12]  David B. Keator,et al.  Federated Web-accessible Clinical Data Management within an Extensible NeuroImaging Database , 2010, Neuroinformatics.

[13]  Daniel S. Marcus,et al.  The extensible neuroimaging archive toolkit , 2007, Neuroinformatics.

[14]  David B. Keator,et al.  XCEDE: An Extensible Schema for Biomedical Data , 2011, Neuroinformatics.

[15]  David B. Keator,et al.  Towards structured sharing of raw and derived neuroimaging data across existing resources , 2012, NeuroImage.