Diamond: multi-dimensional indexing technique for medical images retrieval using vertical fragmentation approach

Over the last decade, a huge number of medical visual data are widely used for diagnose, treatment, and follow-up. Retrieving needed medical image(s) from a huge number of images is one of the most widely used features in medical information systems, especially in medical image search engines. Indexing as part of search engines (or information retrieval systems), increases the speed (efficiency) of search and the information retrieval process. In this paper, a multidimensional indexing technique for medical images is presented that can improve effectiveness and efficiency of medical image search engines. The structure of the proposed multi-dimensional indexing technique and its main operations (i.e., creation, insertion, deletion and search) is designed and evaluated. In order to create this multidimensional index, the "vertical fragmentation" approach (which is usually applied for distributed database design) is used to determine the each of dimensions; roughly speaking, dimensions are different aspects of the medical images for a/some information need (e.g., image type and format, type of disorder, etc.). Accordingly, data structure of the proposed multidimensional indexing technique (which is named "Diamond") is formed by using the vertical fragmentation of medical image attributes (to differentiate the dimensions), and then using agglomerative hierarchical clustering to build up the hierarchy in each dimension. The proposed multi-dimensional indexing technique is implemented using the open-source search engine Lucene and compared with the built-in indexing technique available in the Lucene search engine, and also with the Terrier Platform (available for the benchmarking of information retrieval systems). In this evaluation, efficiency and effectiveness measures of the proposed multidimensional indexing technique (Diamond) are evaluated experimentally, beside the analysis of the designed data structure and its operations. For the experimental evaluation data set, images from Tabriz Behbood Hospital and a subset of TCIA images were used. Experimental evaluation results show that Diamond, the proposed multidimensional indexing technique improves both efficiency and effectiveness for a medical image search engine.

[1]  C. Krishna Mohan,et al.  Content based medical image retrieval using dictionary learning , 2015, Neurocomputing.

[2]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[3]  Jinho Lee,et al.  MIRE: a multidimensional information retrieval engine for structured data and text , 2002, Proceedings. International Conference on Information Technology: Coding and Computing.

[4]  Andrei Voronkov,et al.  On the Evaluation of Indexing Techniques for Theorem Proving , 2001, IJCAR.

[5]  Christian Böhm,et al.  Multidimensional Index Structures in Relational Databases , 2000, Journal of Intelligent Information Systems.

[6]  Saeide Habibi Asl,et al.  Medical image retrieval approaches, methods and systems: A systematic review , 2016 .

[7]  Frank S. C. Tseng,et al.  D-Tree: A Multi-Dimensional Indexing Structure for Constructing Document Warehouses , 2006, J. Inf. Sci. Eng..

[8]  JUSTIN ZOBEL,et al.  Inverted files for text search engines , 2006, CSUR.

[9]  Emanuele Della Valle,et al.  Web Information Retrieval , 2013, Data-Centric Systems and Applications.

[10]  Degui Xiao,et al.  Medical Image Retrieval: A Multimodal Approach , 2014, Cancer informatics.

[11]  Randy L. Gollub,et al.  High Throughput Tools to Access Images from Clinical Archives for Research , 2014, Journal of Digital Imaging.

[12]  Aini Hussain,et al.  Image retrieval system for medical applications , 2015, 2015 IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE).

[13]  Thomas Martin Deserno,et al.  Evaluation axes for medical image retrieval systems: the imageCLEF experience , 2005, ACM Multimedia.

[14]  Henning Müller,et al.  Using Medline Queries to Generate Image Retrieval Tasks for Benchmarking , 2008, MIE.

[15]  Sri Guru,et al.  Comparison Between K-Mean and Hierarchical Algorithm Using Query Redirection , 2013 .

[16]  Habiba Drias,et al.  Towards a Multidimensional Information Retrieval , 2015, WorldCIST.

[17]  Marjan Kuchaki Rafsanjani,et al.  A Survey Of Hierarchical Clustering Algorithms , 2012 .

[18]  Sunil Kumar,et al.  A new approach for effective retrieval and indexing of medical images , 2019, Biomed. Signal Process. Control..

[19]  Gareth J. F. Jones,et al.  An analysis of evaluation campaigns in ad-hoc medical information retrieval: CLEF eHealth 2013 and 2014 , 2018, Information Retrieval Journal.

[20]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[21]  Teh Ying Wah,et al.  A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data , 2015, PloS one.

[22]  Arambam Neelima,et al.  An overview of approaches for content-based medical image retrieval , 2017, International Journal of Multimedia Information Retrieval.

[23]  Hansaem Park,et al.  Agglomerative Hierarchical Clustering for Information Retrieval Using Latent Semantic Index , 2015, 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity).

[24]  Haejun Lee,et al.  Medical Image Retrieval: Past and Present , 2012, Healthcare informatics research.

[25]  Patrick Valduriez,et al.  Principles of Distributed Database Systems, Third Edition , 2011 .

[26]  Jyoti Prakash,et al.  Precision and Relative Recall of Search Engines: A Comparative Study of Google and Yahoo , 2009 .

[27]  Fatemeh Abdi,et al.  Answering ad-hoc continuous aggregate queries over data streams using Dynamic Prefix Aggregate Tree , 2016, Intell. Data Anal..

[28]  David Dagan Feng,et al.  Content-Based Medical Image Retrieval: A Survey of Applications to Multidimensional and Multimodality Data , 2013, Journal of Digital Imaging.

[29]  Suhap Şahin,et al.  Comparison of Hierarchical and Non-Hierarchical Clustering Algorithms , 2017 .

[30]  Radha Appan,et al.  Exploring Health Information Exchange (HIE) Through Collaboration Framework: Normative Guidelines for IT Leadership of Healthcare Organizations , 2017, Inf. Syst. Manag..

[31]  Marjan Laal,et al.  Innovation Process in Medical Imaging , 2013 .

[32]  Rudolf Hanka,et al.  A review of intelligent content-based indexing and browsing of medical images , 1999 .