Relational Databases versus Search Engines: A Performance Comparison for Storing and Querying DICOM Metadata

The Digital Imaging and Communications in Medicine (DICOM) standard adopts files as individual, self-contained repositories for the storage of a mixed of alphanumerical and binary content regarding radiological images. Usually, groups of DICOM files are hierarchically organized in studies and series, physically disposed into filesystem directory trees. Despite its simplicity in storing content, ordinary filesystems do not provide index capabilities allowing searches by content – restricting access by directory names and file names. To surpass such limitation, Picture Archiving and Communication Systems (PACSs) often adopt Relational Database Management Systems (RDBMSs) as metadata repositories, benefiting from its general-purposed index structures. An alternative approach, not quite explored, considers the adoption of search engines as metadata catalogs, aiming to minimize the search time by exploring the engine’s index optimizations. In order to evaluate the performance on managing DICOM metadata, this work compares relational database instances to a search engine in terms of storage space, storage time, and query time. Results show that, in the best case, the search engine is slightly slower in storing content; however, it requires 69% less disk space for the same dataset. For queries, in turn, the search engine performs up to 8.3 times faster in retrieving groups of tags.

[1]  Oleg S. Pianykh,et al.  Digital Imaging and Communications in Medicine : A Practical Introduction and Survival Guide , 2008 .

[2]  Jeffrey L Sunshine,et al.  Picture Archiving and Communication System (PACS) implementation, integration & benefits in an integrated health system. , 2012, Academic radiology.

[3]  Simón J. Rascovsky,et al.  Use of CouchDB for Document-based Storage of DICOM Objects1 , 2012 .

[4]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[5]  H. K. Huang,et al.  Short history of PACS. Part I: USA. , 2011, European journal of radiology.

[6]  D. Peck Digital Imaging and Communications in Medicine (DICOM): A Practical Introduction and Survival Guide , 2009, Journal of Nuclear Medicine.

[7]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[8]  S. Esakkirajan,et al.  Fundamentals of relational database management systems , 2007 .

[9]  Heinz U Lemke,et al.  Short history of PACS (Part II: Europe). , 2011, European journal of radiology.

[10]  Aldo von Wangenheim,et al.  DCMDSM: a DICOM decomposed storage model , 2014, J. Am. Medical Informatics Assoc..

[11]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[12]  Steven C. Horii,et al.  Review: Understanding and Using DICOM, the Data Interchange Standard for Biomedical Imaging , 1997, J. Am. Medical Informatics Assoc..

[13]  Rajendra Singh,et al.  Standardization in digital pathology: Supplement 145 of the DICOM standards , 2011, Journal of pathology informatics.

[14]  José Luís Oliveira,et al.  Medical imaging archiving: A comparison between several NoSQL solutions , 2014, IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[15]  Davide Caramella,et al.  The future of PACS in healthcare enterprises. , 2011, European journal of radiology.

[16]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[17]  F. E. A Relational Model of Data Large Shared Data Banks , 2000 .

[18]  P. Mildenberger,et al.  Introduction to the DICOM standard , 2002, European Radiology.