Efficiently Indexing Multiple Repositories of Medical Image Databases

Performing content-based image retrieval over large repositories of medical images demands efficient computational techniques. The use of such techniques is intended to speed up the work of physicians, who often have to deal with information from multiple data repositories. When dealing with multiple data repositories, the common computational approach is to search each repository separately and merge the multiple results into one final response, which slows down the whole process. This can be improved if we build a mechanism able to search several repositories as if they were a single one, i.e. a mechanism to search the whole domain of medical images. Aiming at this goal, we propose the Domain Index, a new category of index structures aimed at efficiently searching domains of data, regardless of the repository to which they belong. To evaluate our proposal, we carried out experiments over multiple mammography repositories involving k Nearest Neighbor (kNN) and Range queries. The results show that images from any repository are seamlessly retrieved, even sustaining gains in performance of up to 36% in kNN queries and up to 7% in Range queries. The experimental evaluation shows that the Domain Index allows fast retrieval from multiple data repositories for medical systems, allowing a better performance in similarity queries over them.

[1]  Agma J. M. Traina,et al.  A new method to efficiently reduce histogram dimensionality , 2008, SPIE Medical Imaging.

[2]  Agma J. M. Traina,et al.  Feature Extraction and Selection for Decision Making , 2010 .

[3]  David Dagan Feng,et al.  A content-based image retrieval framework for multi-modality lung images , 2010, 2010 IEEE 23rd International Symposium on Computer-Based Medical Systems (CBMS).

[4]  Agma J. M. Traina,et al.  MedFMI-SiR: A Powerful DBMS Solution for Large-Scale Medical Image Retrieval , 2011, ITBAM.

[5]  Caetano Traina,et al.  One index to dominate them all: domain indexes for improving queries across multiple tables , 2017, SAC.

[6]  Thomas Martin Deserno,et al.  Content-Based Image Retrieval in Medicine: Retrospective Assessment, State of the Art, and Future Directions , 2009, Int. J. Heal. Inf. Syst. Informatics.

[7]  Hermann Ney,et al.  Features for image retrieval: an experimental comparison , 2008, Information Retrieval.

[8]  T M Lehmann,et al.  Content-based Image Retrieval in Medical Applications , 2004, Methods of Information in Medicine.

[9]  Richard H. Moore,et al.  THE DIGITAL DATABASE FOR SCREENING MAMMOGRAPHY , 2007 .

[10]  Ian W. Ricketts,et al.  The Mammographic Image Analysis Society digital mammogram database , 1994 .

[11]  Agma J. M. Traina,et al.  MedInject: A General-Purpose Information Retrieval Framework Applied in a Medical Context , 2014, 2014 IEEE 27th International Symposium on Computer-Based Medical Systems.

[12]  L. Rodney Long,et al.  SPIRS: A Web-based image retrieval system for large biomedical databases , 2009, Int. J. Medical Informatics.

[13]  Christos Faloutsos,et al.  Fast Indexing and Visualization of Metric Data Sets using Slim-Trees , 2002, IEEE Trans. Knowl. Data Eng..

[14]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[15]  L. Rodney Long,et al.  Spine X-ray image retrieval using partial vertebral boundaries , 2011, 2011 24th International Symposium on Computer-Based Medical Systems (CBMS).