ETL Framework for Real-Time Business Intelligence over Medical Imaging Repositories

In the last decades, the amount of medical imaging studies and associated metadata has been rapidly increasing. Despite being mostly used for supporting medical diagnosis and treatment, many recent initiatives claim the use of medical imaging studies in clinical research scenarios but also to improve the business practices of medical institutions. However, the continuous production of medical imaging studies coupled with the tremendous amount of associated data, makes the real-time analysis of medical imaging repositories difficult using conventional tools and methodologies. Those archives contain not only the image data itself but also a wide range of valuable metadata describing all the stakeholders involved in the examination. The exploration of such technologies will increase the efficiency and quality of medical practice. In major centers, it represents a big data scenario where Business Intelligence (BI) and Data Analytics (DA) are rare and implemented through data warehousing approaches. This article proposes an Extract, Transform, Load (ETL) framework for medical imaging repositories able to feed, in real-time, a developed BI (Business Intelligence) application. The solution was designed to provide the necessary environment for leading research on top of live institutional repositories without requesting the creation of a data warehouse. It features an extensible dashboard with customizable charts and reports, with an intuitive web-based interface that empowers the usage of novel data mining techniques, namely, a variety of data cleansing tools, filters, and clustering functions. Therefore, the user is not required to master the programming skills commonly needed for data analysts and scientists, such as Python and R.

[1]  Oleg S. Pianykh,et al.  Digital Imaging and Communications in Medicine : A Practical Introduction and Survival Guide , 2008 .

[2]  Shanshan Wang,et al.  An Automated DICOM Database Capable of Arbitrary Data Mining (Including Radiation Dose Indicators) for Quality Monitoring , 2011, Journal of Digital Imaging.

[3]  Barbara Wixom,et al.  The Current State of Business Intelligence , 2007, Computer.

[4]  Paul G Nagy,et al.  Informatics in radiology: automated Web-based graphical dashboard for radiology operational business intelligence. , 2009, Radiographics : a review publication of the Radiological Society of North America, Inc.

[5]  Tiago Marques Godinho,et al.  Anatomy of an Extensible Open Source PACS , 2016, Journal of Digital Imaging.

[6]  Peter J. Hunter,et al.  Big Data, Big Knowledge: Big Data for Personalized Healthcare , 2015, IEEE Journal of Biomedical and Health Informatics.

[7]  D. Peck Digital Imaging and Communications in Medicine (DICOM): A Practical Introduction and Survival Guide , 2009, Journal of Nuclear Medicine.

[8]  Hui-Huang Hsu Introduction to Data Mining in Bioinformatics , 2008 .

[9]  Nelson Pacheco da Rocha,et al.  DICOM and Clinical Data Mining in a Small Hospital PACS: A Pilot Study , 2011, CENTERIS.

[10]  P. Mildenberger,et al.  Introduction to the DICOM standard , 2002, European Radiology.

[11]  Carmen C. Y. Poon,et al.  Big Data for Health , 2015, IEEE Journal of Biomedical and Health Informatics.

[12]  Big Data is the Future of Healthcare , 2012 .

[13]  Steve G. Langer A Flexible Database Architecture for Mining DICOM Objects: the DICOM Data Warehouse , 2011, Journal of Digital Imaging.

[14]  Steve G. Langer,et al.  Challenges for Data Storage in Medical Imaging Research , 2011, Journal of Digital Imaging.

[15]  Dennis Shasha,et al.  Introduction to Data Mining in Bioinformatics , 2005, Data Mining in Bioinformatics.

[16]  Pablo R Ros,et al.  Survey of the use of quality indicators in academic radiology departments. , 2006, AJR. American journal of roentgenology.

[17]  Tiago Marques Godinho,et al.  A Routing Mechanism for Cloud Outsourcing of Medical Imaging Repositories , 2016, IEEE Journal of Biomedical and Health Informatics.

[18]  José Luís Oliveira,et al.  Indexing and retrieving DICOM data in disperse and unstructured archives , 2008, International Journal of Computer Assisted Radiology and Surgery.

[19]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[20]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[21]  William Pavlicek,et al.  Informatics in radiology: Efficiency metrics for imaging device productivity. , 2011, Radiographics : a review publication of the Radiological Society of North America, Inc.

[22]  Nelson Pacheco da Rocha,et al.  Clinical Data Mining in Small Hospital PACS: Contributions for Radiology Department Improvement , 2013 .