Unsupervised scaling of multi-descriptor similarity functions for medical image datasets

Content-based search has proven to be a proper complement to textual queries over medical image databases. In many applications, employing multiple image descriptors and combining the respective distance functions using adequate scale factors improves the retrieval accuracy. However, the existing weighting methods are either exhaustive or supervised. In this paper, we present the Fractal-scaled Product Metric, an unsupervised method to determine a scale factor among features in multi-descriptor image similarity assessment based on the Fractal Theory. The composite distance function obtained is not limited to dimensional image descriptors and enables using scalable indexing structures. Experiments have shown that the proposed method determines near-optimal scale factors for the descriptors involved, and always improves the precision of the results, outperforming the individual descriptors up to 31% on the average precision.

[1]  Christos Faloutsos,et al.  Distance Exponent: A New Concept for Selectivity Estimation in Metric Trees , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[2]  Weiguo Fan,et al.  Image Retrieval with Relevance Feedback based on Genetic Programming , 2008, SBBD.

[3]  Mario A. Nascimento,et al.  MiCRoM: A Metric Distance to Compare Segmented Images , 2002, VISUAL.

[4]  Thomas Martin Deserno,et al.  Ontology of Gaps in Content-Based Image Retrieval , 2009, Journal of Digital Imaging.

[5]  Clement T. Yu,et al.  Techniques and Systems for Image and Video Retrieval , 1999, IEEE Trans. Knowl. Data Eng..

[6]  Agma J. M. Traina,et al.  Assessing the best integration between distance-function and image-feature to answer similarity queries , 2008, SAC '08.

[7]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[8]  Christos Faloutsos,et al.  Beyond uniformity and independence: analysis of R-trees using the concept of fractal dimension , 1994, PODS.

[9]  Daniel A. Keim,et al.  Automatic selection and combination of descriptors for effective 3D similarity search , 2004, IEEE Sixth International Symposium on Multimedia Software Engineering.

[10]  Weiguo Fan,et al.  Relevance feedback based on genetic programming for image retrieval , 2011, Pattern Recognit. Lett..

[11]  Hanan Samet,et al.  Index-driven similarity search in metric spaces (Survey Article) , 2003, TODS.

[12]  Antoine Geissbühler,et al.  A Review of Content{Based Image Retrieval Systems in Medical Applications { Clinical Bene(cid:12)ts and Future Directions , 2022 .

[13]  Agma J. M. Traina,et al.  Efficient Content-Based Image Retrieval through Metric Histograms , 2003, World Wide Web.

[14]  Christos Faloutsos,et al.  Fast feature selection using fractal dimension , 2010, J. Inf. Data Manag..

[15]  Agma J. M. Traina,et al.  Time-Aware Similarity Search: A Metric-Temporal Representation for Complex Data , 2009, SSTD.

[16]  Fabio A. González,et al.  Design of a Medical Image Database with Content-Based Retrieval Capabilities , 2007, PSIVT.

[17]  Stefan M. Rüger,et al.  Combining Features for Content-Based Sketch Retrieval - A Comparative Evaluation of Retrieval Performance , 2002, ECIR.