A multi-step strategy for approximate similarity search in image databases

Many strategies for similarity search in image databases assume a metric and quadratic form-based similarity model where an optimal lower bounding distance function exists for filtering. These strategies are mainly two-step, with the inital "filter" step based on a spatial or metric access method followed by a "refine" step employing expensive computation. Recent research on robust matching methods for computer vision has discovered that similarity models behind human visual judgment are inherently non-metric. When applying such models to similarity search in image databases, one has to address the problem of nonmetric distance functions that might to have an optimal lower bound for filtering. Here, we propose a novel three-step "prune-filter-refine" strategy for approximate similarity search on these models. First, the "prune" step adopts a spatial access method to roughly eliminate improbable matches via an adjustable distance threshold. Second, the "filter" step uses a quasi lower-bounding distance derived from the non-metric distance function of the similarity model. Third, the "refine" stage compares the query with the remaining candidates by a robust matching method for final ranking. Experimental results confirmed that the proposed strategy achieves more filtering than a two-step approach with close to no false drops in the final result.

[1]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[2]  Herbert Janssen,et al.  Scale-invariant image recognition based on higher-order autocorrelation features , 1996, Pattern Recognit..

[3]  Keisuke Kameyama,et al.  Approximate Query Processing for a Content-Based Image Retrieval Method , 2003, DEXA.

[4]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Westone,et al.  Home Page , 2004, 2022 2nd International Conference on Intelligent Cybernetics Technology & Applications (ICICyTA).

[6]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[7]  Hans-Peter Kriegel,et al.  A Multistep Approach for Shape Similarity Search in Image Databases , 1998, IEEE Trans. Knowl. Data Eng..

[8]  Christos Faloutsos,et al.  Fast Nearest Neighbor Search in Medical Image Databases , 1996, VLDB.

[9]  Keisuke Kameyama,et al.  On a relaxation-labeling algorithm for real-time contour-based image similarity retrieval , 2003, Image Vis. Comput..

[10]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[11]  Z. Meral Özsoyoglu,et al.  Distance-based indexing for high-dimensional metric spaces , 1997, SIGMOD '97.

[12]  Hideyo Nagashima,et al.  A Classification for Trademark Images Using the Auto-correlation Function Graph Figure , 2003 .

[13]  Marco Patella,et al.  Searching in metric spaces with user-defined and approximate distances , 2002, TODS.

[14]  Takio Kurita,et al.  A face recognition method using higher order local autocorrelation and multivariate analysis , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[15]  Jon Louis Bentley,et al.  Data Structures for Range Searching , 1979, CSUR.

[16]  Douglas Comer,et al.  Ubiquitous B-Tree , 1979, CSUR.

[17]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[18]  John A. McLaughlin,et al.  Nth-Order Autocorrelations in Pattern Recognition , 1968, Inf. Control..

[19]  Daphna Weinshall,et al.  Classification with Nonmetric Distances: Image Retrieval and Class Representation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Connolly,et al.  Database Systems , 2004 .