Improving efficiency and effectiveness of the image distortion model

The image distortion model (IDM) is a deformation model that is used for computing the (dis-)similarity between images. Therefore it evaluates displacements of individual pixels between two images within a so-called warp range and also takes into account the surrounding pixels (local context). It can be used with a kNN classifier and has shown good retrieval quality in handwritten character recognition as well as in past runs of the medical automatic annotation task of ImageCLEF workshops. However, one of its limitations is computational complexity and the resulting long query response times, that may limit its use for a wider range of applications and for modifications to further improve retrieval quality. In particular an enlarged local context and warp range are candidates for such improvements, but would even further increase computational complexity. In our approach, we therefore apply several optimizations that reduce the retrieval time without degrading the result quality. First, we use an early termination strategy for the individual distance computations which contribute a speedup of a factor of 4.3-4.9. Second, we make efficient use of multithreading. With these extensions, we are able to perform the IDM in less than 1.5s per query on an 8-way server and 16s on a standard Pentium 4 PC without any degradation of retrieval quality compared to the non-optimized version. We extend the possible displacements to an area of 7x7 pixels, using a local context of either 5x5 or 7x7 pixels. The results of the extended IDM have been submitted to the medical automatic annotation task of ImageCLEF 2007 and were ranked in the upper third. More importantly, the used techniques for reducing the execution time are not limited strictly to IDM but are also applicable to other expensive distance measures.

[1]  Heiko Schuldt,et al.  ISIS and OSIRIS: A Process-Based Digital Library Application on Top of a Distributed Process Support Middleware , 2007, DELOS.

[2]  Thomas Martin Deserno,et al.  Automatic medical image annotation in ImageCLEF 2007: Overview, results, and discussion , 2008, Pattern Recognit. Lett..

[3]  Thomas Martin Deserno,et al.  Combining Global features for Content-based Retrieval of Medical Images , 2005, CLEF.

[4]  Martin Ester,et al.  Knowledge Discovery in Databases , 2000 .

[5]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[6]  Hermann Ney,et al.  Local context in non-linear deformation models for handwritten character recognition , 2004, ICPR 2004.

[7]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[8]  Thomas Martin Deserno,et al.  Content-Based Retrieval of Medical Images by Combining Global Features , 2005, CLEF.

[9]  Patrice Y. Simard Efficient Computation of Complex Distance Metrics Using Hierarchical Filtering , 1993, NIPS.

[10]  Thomas Martin Deserno,et al.  Baseline Results for the CLEF 2007 Medical Automatic Annotation Task , 2007, CLEF.

[11]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[12]  Thomas Martin Deserno,et al.  Content-based Queries on the Casimage Database with the IRMA framework: A Field Report , 2004, CLEF.

[13]  Hermann Ney,et al.  Shared-memory parallelization for content-based image retrieval , 2006 .

[14]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[15]  Eugene Kim,et al.  Overview of the ImageCLEFmed 2006 Medical Retrieval and Annotation Tasks , 2006, CLEF.

[16]  Hermann Ney,et al.  FIRE in ImageCLEF 2005: Combining Content-based Image Retrieval with Textual Information Retrieval , 2005, CLEF.

[17]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[18]  Hermann Ney,et al.  Deformation Models for Image Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Hermann Ney,et al.  Statistical framework for model-based image retrieval in medical applications , 2003, J. Electronic Imaging.