Complexity of estimating multi-way join result sizes for area skewed spatial data

In a real life environment, spatial data is highly skewed. In general, there are two kinds of skews in spatial data. One is the placement skew and the other is the area skew. This paper introduces methods and the complexity of estimating the result sizes of the multi-way join for the area skewed spatial data. Especially, this paper describes the number and sort of the statistics which the optimizer should keep in order to calculate the multi-way join result size.

[1]  Timos K. Sellis,et al.  Cost models for join queries in spatial databases , 1998, Proceedings 14th International Conference on Data Engineering.

[2]  Bernd-Uwe Pagel,et al.  Towards an analysis of range query performance in spatial data structures , 1993, PODS '93.

[3]  Sridhar Ramaswamy,et al.  Selectivity estimation in spatial databases , 1999, SIGMOD '99.

[4]  Elke A. Rundensteiner,et al.  A cost model for estimating the performance of spatial joins using R-trees , 1997, Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150).

[5]  Dimitris Papadias,et al.  Processing and optimization of multiway spatial joins using R-trees , 1999, PODS '99.