The Earth Mover's Distance as a Metric for Image Retrieval

We investigate the properties of a metric between two distributions, the Earth Mover's Distance (EMD), for content-based image retrieval. The EMD is based on the minimal cost that must be paid to transform one distribution into the other, in a precise sense, and was first proposed for certain vision problems by Peleg, Werman, and Rom. For image retrieval, we combine this idea with a representation scheme for distributions that is based on vector quantization. This combination leads to an image comparison framework that often accounts for perceptual similarity better than other previously proposed methods. The EMD is based on a solution to the transportation problem from linear optimization, for which efficient algorithms are available, and also allows naturally for partial matching. It is more robust than histogram matching techniques, in that it can operate on variable-length representations of the distributions that avoid quantization and other binning problems typical of histograms. When used to compare distributions with the same overall mass, the EMD is a true metric. In this paper we focus on applications to color and texture, and we compare the retrieval performance of the EMD with that of other distances.

[1]  Periodicity. , 1862, Medical critic and psychological journal.

[2]  F. L. Hitchcock The Distribution of a Product from Several Sources to Numerous Localities , 1941 .

[3]  Solomon Kullback,et al.  Information Theory and Statistics , 1960 .

[4]  W D Wright,et al.  Color Science, Concepts and Methods. Quantitative Data and Formulas , 1967 .

[5]  Gunther Wyszecki,et al.  Color Science: Concepts and Methods, Quantitative Data and Formulae, 2nd Edition , 2000 .

[6]  Edward J. Russell,et al.  Letters to the Editor - Extension of Dantzig's Algorithm to Finding an Initial Near-Optimal Basis for the Transportation Problem , 1969, Oper. Res..

[7]  V. Klee,et al.  HOW GOOD IS THE SIMPLEX ALGORITHM , 1970 .

[8]  Solomon Kullback,et al.  Information Theory and Statistics , 1970, The Mathematical Gazette.

[9]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[10]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[11]  A. Tversky Features of Similarity , 1977 .

[12]  G. Wyszecki,et al.  Color Science Concepts and Methods , 1982 .

[13]  Helen C. Shen,et al.  Generalized texture representation and metric , 1983, Comput. Vis. Graph. Image Process..

[14]  Narendra Karmarkar,et al.  A new polynomial-time algorithm for linear programming , 1984, Comb..

[15]  Azriel Rosenfeld,et al.  A distance metric for multidimensional histograms , 1985, Comput. Vis. Graph. Image Process..

[16]  S. Rachev The Monge–Kantorovich Mass Transference Problem and Its Stochastic Applications , 1985 .

[17]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[18]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[19]  Nasser M. Nasrabadi,et al.  Image coding using vector quantization: a review , 1988, IEEE Trans. Commun..

[20]  Michael Werman,et al.  A Unified Approach to the Change of Resolution: Space and Gray-Level , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  K. Zikan The theory and applications of algebraic metric spaces , 1990 .

[22]  Wilson S. Geisler,et al.  Multichannel Texture Analysis Using Localized Spatial Filters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Anil K. Jain,et al.  A multi-channel filtering approach to texture segmentation , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Thomas M. Cover,et al.  Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) , 2006 .

[25]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[26]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[27]  Josef Bigün,et al.  N-folded Symmetries by Complex Moments in Gabor Space and their Application to Unsupervised Texture Segmentation , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[29]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Charles A. Poynton,et al.  A technical introduction to digital video , 1996 .

[31]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Fang Liu,et al.  Periodicity, Directionality, and Randomness: Wold Features for Image Modeling and Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Tolga Bozkaya,et al.  Distance-based indexing for high-dimensional metric spaces , 1997, SIGMOD '97.

[34]  Joachim M. Buhmann,et al.  Non-parametric similarity measures for unsupervised texture segmentation and image retrieval , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[35]  Bruce A. Draper,et al.  FOCUS: Searching for multi-colored objects in a diverse image database , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[36]  Kenneth L. Clarkson,et al.  Nearest Neighbor Queries in Metric Spaces , 1997, STOC '97.

[37]  Z. Meral Özsoyoglu,et al.  Distance-based indexing for high-dimensional metric spaces , 1997, SIGMOD '97.

[38]  Carlo Tomasi,et al.  Texture metrics , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[39]  Jitendra Malik,et al.  Color- and texture-based image segmentation using EM and its application to content-based image retrieval , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[40]  Carlo Tomasi,et al.  Perceptual metrics for image database navigation , 1999 .

[41]  Carlo Tomasi,et al.  The Earth Mover’s Distance , 2001 .

[42]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.