Clinical Study Data Exchange technologies, based on XML, have improved the data capture phase of clinical data and enabled larger and more diverse longitudinal clinical research studies. There is now a growing interest in this community for solutions based on Semantic Web standards. Healthcare and life sciences metadata resources such as medication classifications are now shared via linked data platforms. The increasing pressure to make clinical trial data more open is another strong incentive for the adoption of linked open data technologies. This paper describes the application of semantic statistics vocabularies to deliver clinical data as linked data in a form that is easy to consume by statisticians and easy to enrich with links to complementary data sources. We combine the strengths of the RDF Data Cube and DDI-RDF vocabularies to propose a Linked Clinical Data Cube (LCDC), a set of modular data cubes that helps us manage the multi-disciplinary nature of the source data. We validate our approach on the Australian, Imaging, Biomarker and Lifestyle study of Ageing (AIBL). This dataset, comprising more than 1600 variables clustered in 25 different sub-domains, has been fully converted into RDF with one general data cube and one specialised data cube for each sub-domain. This implementation demonstrates the effectiveness of the association of the RDF Data Cube and DDI-RDF vocabularies for the publication of large and diverse clinical datasets as linked data. We also show that the structure of the LCDC overcomes the monolithic nature of clinical data exchange standards and expedites the navigation and querying of the data from multiple views.
[1]
Alvaro Graves,et al.
Creation of visualizations based on linked data
,
2013,
WIMS '13.
[2]
Mirina Grosz,et al.
World Wide Web Consortium
,
2010
.
[3]
Hugo Leroux,et al.
On Selecting a Clinical Trial Management System for Large Scale, Multi-Centre, Multi-Modal Clinical Research Study
,
2011,
HIC.
[4]
C. Rowe,et al.
The Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging: methodology and baseline characteristics of 1112 individuals recruited for a longitudinal study of Alzheimer's disease
,
2009,
International Psychogeriatrics.
[5]
Armin Haller,et al.
A Linked Sensor Data Cube for a 100 Year Homogenised Daily Temperature Dataset
,
2012,
SSN.
[6]
Laurent Lefort,et al.
Using CDISC ODM and the RDF Data Cube for the Semantic Enrichment of Longitudinal Clinical Trial Data
,
2012,
SWAT4LS.
[7]
Michael Lawley,et al.
Using Australian Medicines Terminology (AMT) and SNOMED CT-AU to better support clinical research
,
2012,
HIC.
[8]
Joachim Wackerow,et al.
DDI-RDF Discovery Vocabulary: A Metadata Vocabulary for Documenting Research and Survey Data
,
2013,
LDOW.
[9]
Joachim Wackerow,et al.
Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences
,
2012,
Dublin Core Conference.