DeID – a data sharing tool for neuroimaging studies

Funding institutions and researchers increasingly expect that data will be shared to increase scientific integrity and provide other scientists with the opportunity to use the data with novel methods that may advance understanding in a particular field of study. In practice, sharing human subject data can be complicated because data must be de-identified prior to sharing. Moreover, integrating varied data types collected in a study can be challenging and time consuming. For example, sharing data from structural imaging studies of a complex disorder requires the integration of imaging, demographic and/or behavioral data in a way that no subject identifiers are included in the de-identified dataset and with new subject labels or identification values that cannot be tracked back to the original ones. We have developed a Java program that users can use to remove identifying information in neuroimaging datasets, while still maintaining the association among different data types from the same subject for further studies. This software provides a series of user interaction wizards to allow users to select data variables to be de-identified, implements functions for auditing and validation of de-identified data, and enables the user to share the de-identified data in a single compressed package through various communication protocols, such as FTPS and SFTP. DeID runs with Windows, Linux, and Mac operating systems and its open architecture allows it to be easily adapted to support a broader array of data types, with the goal of facilitating data sharing. DeID can be obtained at http://www.nitrc.org/projects/deid.

[1]  Captain Y. B. Nusfield Public Health , 1906, Canadian Medical Association journal.

[2]  Alan C. Evans,et al.  Three-Dimensional MRI Atlas of the Human Cerebellum in Proportional Stereotaxic Space , 1999, NeuroImage.

[3]  Anders M. Dale,et al.  A hybrid approach to the Skull Stripping problem in MRI , 2001, NeuroImage.

[4]  W. Drevets Neuroimaging and neuropathological studies of depression: implications for the cognitive-emotional features of mood disorders , 2001, Current Opinion in Neurobiology.

[5]  Stephen M Smith,et al.  Fast robust automated brain extraction , 2002, Human brain mapping.

[6]  Philip S. Yu,et al.  Bottom-up generalization: a data mining solution to privacy protection , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[7]  Scott T. Grafton,et al.  Sharing neuroimaging studies of human cognition , 2004, Nature Neuroscience.

[8]  Roberto J. Bayardo,et al.  Data privacy through optimal k-anonymization , 2005, 21st International Conference on Data Engineering (ICDE'05).

[9]  K. El Emam,et al.  Evaluating Common De-Identification Heuristics for Personal Health Information , 2006, Journal of medical Internet research.

[10]  Philip S. Yu,et al.  Anonymizing Classification Data for Privacy Preservation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[11]  Gregory G. Brown,et al.  A technique for the deidentification of structural brain MR images , 2007, Human brain mapping.

[12]  F. Irani,et al.  Functional Near Infrared Spectroscopy (fNIRS): An Emerging Neuroimaging Technology with Important Applications for the Study of Brain Disorders , 2007, The Clinical neuropsychologist.

[13]  Kevin A. Archie,et al.  The Open-Source Neuroimaging Research Enterprise , 2007, Journal of Digital Imaging.

[14]  Chris Rorden,et al.  Improving Lesion-Symptom Mapping , 2007, Journal of Cognitive Neuroscience.

[15]  Charles A. Nelson Incidental Findings in Magnetic Resonance Imaging (MRI) Brain Research , 2008, The Journal of law, medicine & ethics : a journal of the American Society of Law, Medicine & Ethics.

[16]  A. Levy,et al.  Personal privacy and public health: potential impacts of privacy legislation on health research in Canada. , 2008, Canadian journal of public health = Revue canadienne de sante publique.

[17]  Wei‐Ju Lee,et al.  Incidental findings on brain MRI. , 2008, The New England journal of medicine.

[18]  M. Harris,et al.  Personal Privacy and Public Health , 2008 .

[19]  Kenneth D. Harris,et al.  Data Sharing for Computational Neuroscience , 2008, Neuroinformatics.

[20]  Jean-Pierre Corriveau,et al.  A globally optimal k-anonymity method for the de-identification of health data. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[21]  Charles Hildebolt,et al.  Facial Recognition From Volume-Rendered Magnetic Resonance Imaging Data , 2009, IEEE Transactions on Information Technology in Biomedicine.

[22]  R. Borra,et al.  Incidental findings in brain MRI research: what do we owe our subjects? , 2011, Journal of the American College of Radiology : JACR.

[23]  Paul M. Thompson,et al.  Robust Brain Extraction Across Datasets and Comparison With Publicly Available Methods , 2011, IEEE Transactions on Medical Imaging.

[24]  C. Tenopir,et al.  Data Sharing by Scientists: Practices and Perceptions , 2011, PloS one.

[25]  M. Milham Open Neuroscience Solutions for the Connectome-wide Association Era , 2012, Neuron.

[26]  Russell A. Poldrack,et al.  The future of fMRI in cognitive neuroscience , 2012, NeuroImage.

[27]  Satrajit S. Ghosh,et al.  Data sharing in neuroimaging research , 2012, Front. Neuroinform..

[28]  Oluwasanmi Koyejo,et al.  Toward open sharing of task-based fMRI data: the OpenfMRI project , 2013, Front. Neuroinform..

[29]  Jean-Baptiste Poline,et al.  A simple tool for neuroimaging data sharing , 2014, Front. Neuroinform..