The domain of existentially uncertain spatial data refers to objects that are modelled using an existential probability accompanying spatial data values. An interesting and challenging query type over existentially uncertain data is the search of the nearest neighbor (NN), since the probability of a potential dataset object to be the NN of the query object depends on the locations and probabilities of other points in the same dataset. In this paper, following a statistical approach, we estimate the average number of the NNsrequired to answer probabilistic thresholding NN(PTNN) queries as function of the threshold t, allowing us to utilize existing approaches and propose a cost model for such queries. Based on the same statistical approach, we propose an efficient algorithm for PTNN queries over arbitrarily structured existentially uncertain spatial data. Our experimental study demonstrates the accuracy and efficiency of the proposed techniques.
[1]
Yufei Tao,et al.
An efficient cost model for optimization of nearest neighbor search in low and medium dimensional spaces
,
2004,
IEEE Transactions on Knowledge and Data Engineering.
[2]
Yufei Tao,et al.
Probabilistic Spatial Queries on Existentially Uncertain Data
,
2005,
SSTD.
[3]
Sridhar Ramaswamy,et al.
Selectivity estimation in spatial databases
,
1999,
SIGMOD '99.
[4]
Yannis Manolopoulos,et al.
R-Trees: Theory and Applications
,
2005,
Advanced Information and Knowledge Processing.
[5]
Nikos Pelekis,et al.
Cost Models and Efficient Query Processing over Existentially Uncertain Spatial Data
,
2008
.
[6]
Hanan Samet,et al.
Distance browsing in spatial databases
,
1999,
TODS.