Big Data Integration Case Study for Radiology Data Sources

Today's digitized world urgently needs Big Data integration and analysis. Healthcare records are responsible for generating petabytes of data in a single day. Such data is heterogeneous in nature, captured in different files and formats, and varies from hospital to hospital. By integrating data from different sources and extracting meaningful information for the medical community, we can improve the overall quality of patient care. Our research targets the problem of integration for health records. To start, we already developed the Integrated Radiology Image search (IRIS) engine, which could represent a data integration framework for the healthcare domain. IRIS provided support for multiple public data sources and incorporated medical ontologies which would help radiologists and improve search interpretation by considering the meaning of the search query terms. In this paper, we describe a case study of data integration for radiology data sources. While the need for data integration is self-evident, we learned that rather than being a single step, data integration is an iterative process that requires continuous integration of metadata and additional supporting data sources. Our results show that an each step of data integration further improved IRIS engine results.

[1]  Jacob D. Furst,et al.  An Integrated Database and Smart Search Tool for Medical Knowledge Extraction from Radiology Teaching Files , 2017, MIH@KDD.

[2]  M. dos-Santos,et al.  Interactive Radiology teaching file system: The development of a MIRC-compliant and user-centered e-learning resource , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[3]  Ossama Tawfik,et al.  Integrating pathology and radiology disciplines: an emerging opportunity? , 2012, BMC Medicine.

[4]  Paul Clough,et al.  ImageCLEF: Experimental Evaluation in Visual Information Retrieval , 2010 .

[5]  Roland Talanow Radiology Teacher: a free, Internet-based radiology teaching file server. , 2009, Journal of the American College of Radiology : JACR.

[6]  Haejun Lee,et al.  Building and Querying RDF/OWL Database of Semantically Annotated Nuclear Medicine Images , 2017, Journal of Digital Imaging.

[7]  Jean Yeh,et al.  Big Data and the Future of Radiology Informatics. , 2016, Academic radiology.

[8]  Gaurav Pandey,et al.  Breast Imaging in the Era of Big Data: Structured Reporting and Data Mining. , 2016, AJR. American journal of roentgenology.

[9]  Garry Gold,et al.  Use of computer databases to reduce radiograph reading errors. , 2007, Journal of the American College of Radiology : JACR.

[10]  Jacob D. Furst,et al.  Augmenting Medical Decision Making With Text-Based Search of Teaching File Repositories and Medical Ontologies: Text-Based Search of Radiology Teaching Files , 2018, Int. J. Knowl. Discov. Bioinform..