Most color indexing techniques proposed in the literature are similar: images are represented by color histograms, and a metric on the color histogram space is used to determine the similarity of images. In this paper we determine the limits of these color indexing techniques. We propose two functions to measure the discrimination power of indexing techniques: the capacity (how many distinguishable histograms can be stored) and the maximal match number (the maximal number of retrieved images). We derive bounds for these functions. These bounds have two practical aspects. First, they help a user to decide whether color histograms effectively index database images from a given domain. Second, they facilitate the choice of a good threshold for the distance below which histograms are considered similar. Our arguments are based on an analysis of the metrical properties of the histogram space and results from coding theory. The results show that over a large range of reasonable parameters the capacity is very large. Thus, the set of parameters for which color indexing works well can be described as the set of parameters for which the maximal match number is below an application-dependent maximum.
[1]
John P. Oakley,et al.
Storage and Retrieval for Image and Video Databases
,
1993
.
[2]
Markus A. Stricker,et al.
Color and Geometry as Cues for Indexing
,
1992
.
[3]
Brian V. Funt,et al.
Color Constant Color Indexing
,
1995,
IEEE Trans. Pattern Anal. Mach. Intell..
[4]
N. J. A. Sloane,et al.
Bounds for binary codes of length less than 25
,
1978,
IEEE Trans. Inf. Theory.
[5]
Drew McDermott,et al.
Image signatures for place recognition and map construction
,
1992,
Other Conferences.
[6]
Michael J. Swain,et al.
Interactive indexing into image databases
,
1993,
Electronic Imaging.
[7]
D. A. Robertson,et al.
THE UNIVERSITY OF CHICAGO.
,
1907,
Science.
[8]
N. J. A. Sloane,et al.
Lower bounds for constant weight codes
,
1980,
IEEE Trans. Inf. Theory.