Database query optimates the selectivety of a query to find the most efficient access plan. Multi-dimensional selectivity estimation technique is required for a query with multiple attributes because the attributes are not independent each other. Histogram is practically used in most commercial database products because it approximates data distributions with small overhead and small error rates. However, histogram is inadequate for a query with multiple attributes because it incurs high storage overhead and high error rates. In this paper, we propose a novel method for multi-dimentional selectivity estimation. Compressed information from a large number of small-sized histogram buckets is maintained using the discrete cosine transform. This enables low error rates and low storage overheads even in high dimensions. Extensive experimental results show adventages of the proposed approach.
[1]
Michael V. Mannino,et al.
Statistical profile estimation in database systems
,
1988,
CSUR.
[2]
Naphtali Rishe,et al.
An instant and accurate size estimation method for joins and selections in a retrieval-intensive environment
,
1993,
SIGMOD '93.
[3]
Aidong Zhang,et al.
Efficient resource selection in distributed visual information systems
,
1997,
MULTIMEDIA '97.
[4]
Nick Roussopoulos,et al.
Adaptive selectivity estimation using query feedback
,
1994,
SIGMOD '94.
[5]
Peter J. Haas,et al.
Improved histograms for selectivity estimation of range predicates
,
1996,
SIGMOD '96.
[6]
Jeffrey Scott Vitter,et al.
Data cube approximation and histograms via wavelets
,
1998,
CIKM '98.