Toward Improved Ranking Metrics

In many computer vision algorithms, a metric or similarity measure is used to determine the distance between two features. The Euclidean or SSD (sum of the squared differences) metric is prevalent and justified from a maximum likelihood perspective when the additive noise distribution is Gaussian. Based on real noise distributions measured from international test sets, we have found that the Gaussian noise distribution assumption is often invalid. This implies that other metrics, which have distributions closer to the real noise distribution, should be used. In this paper, we consider three different applications: content-based retrieval in image databases, stereo matching, and motion tracking. In each of them, we experiment with different modeling functions for the noise distribution and compute the accuracy of the methods using the corresponding distance measures. In our experiments, we compared the SSD metric, the SAD (sum of the absolute differences) metric, the Cauchy metric, and the Kullback relative information. For several algorithms from the research literature which used the SSD or SAD, we showed that greater accuracy could be obtained by using the Cauchy metric instead.

[1]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Chung-Sheng Li,et al.  Image matching by means of intensity and texture matching in the Fourier domain , 1996, Electronic Imaging.

[3]  Ingemar J. Cox,et al.  A Maximum Likelihood Stereo Algorithm , 1996, Comput. Vis. Image Underst..

[4]  J. Deleeuw,et al.  Introduction to Akaike (1973) Information Theory and an Extension of the Maximum Likelihood Principle , 1992 .

[5]  Shree K. Nayar,et al.  Ordinal Measures for Image Correspondence , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  W. Eric L. Grimson,et al.  Computational Experiments with a Feature Based Stereo Algorithm , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[8]  Michael J. Black Robust incremental optical flow , 1992 .

[9]  Patrick M. Kelly,et al.  CANDID: comparison algorithm for navigating digital image databases , 1994, Seventh International Working Conference on Scientific and Statistical Database Management.

[10]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[11]  Harpreet Sawhney,et al.  Efficient color histogram indexing , 1994, Proceedings of 1st International Conference on Image Processing.

[12]  Arnold W. M. Smeulders,et al.  Color-based object recognition , 1997, Pattern Recognit..

[13]  共立出版株式会社 コンピュータ・サイエンス : ACM computing surveys , 1978 .

[14]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[15]  Michael S. Lew,et al.  Efficient content-based image retrieval in digital picture collections using projections: (near)-copy location , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[16]  Ingemar J. Cox,et al.  An Analysis of Camera Noise , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Martin A. Fischler,et al.  Computational Stereo , 1982, CSUR.

[18]  Michael S. Lew,et al.  Quality Measures for Interactive Image Retrieval with a Performance Evaluation of Two 3x3 Texel-based Methods , 1997, ICIAP.

[19]  Solomon Kullback,et al.  Information Theory and Statistics , 1970, The Mathematical Gazette.

[20]  Don R. Hush,et al.  Query by image example: The CANDID approach , 1995 .

[21]  Nicu Sebe,et al.  Which ranking metric is optimal? With applications in image retrieval and stereo matching , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[22]  Werner A. Stahel,et al.  Robust Statistics: The Approach Based on Influence Functions , 1987 .

[23]  T. Poggio,et al.  A computational theory of human stereo vision , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[24]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[25]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[26]  D Marr,et al.  A computational theory of human stereo vision. , 1979, Proceedings of the Royal Society of London. Series B, Biological sciences.

[27]  Emanuele Trucco,et al.  Efficient stereo with multiple windowing , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  H. Maitre,et al.  Using surface model to correct and fit disparity data in stereo vision , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[29]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[30]  Thomas S. Huang,et al.  Learning and Feature Selection in Stereo Matching , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Patrick M. Kelly,et al.  Efficiency issues related to probability density function comparison , 1996, Electronic Imaging.