In many modern applications, there are no exact values available to describe the data objects. Instead, the feature values are considered to be uncertain. This uncertainty is modeled by probability distributions instead of exact feature values. A typical application of such an uncertainty model are moving objects where the exact position of each object can be determined only at discrete time intervals. Queries often involve the positions of objects between two such time stamps or after the last known time stamp. Then the objects are essentially uncertain unless the pattern of movement is very simple (e.g. linear). One of the most important probability density functions for those applications is the Gaussian or normal distribution which can be defined by a mean value and a standard deviation. In this paper, we examine a new type of queries on uncertain data objects, called probability ranking queries (PRQ). A PRQ retrieves those k objects which have the highest probability of being located inside a given query area. To speed up probabilistic queries on large sets of uncertain data objects described by Gaussians, we introduce a novel index structure called Gauss-tree. Furthermore, we provide an algorithm for employing the Gauss-tree to answer PRQs. In our experimental evaluation, we demonstrate that the Gauss-tree achieves a considerable efficiency advantage with respect to PRQs compared to other applicable methods
[1]
Sunil Prabhakar,et al.
Evaluating probabilistic queries over imprecise data
,
2003,
SIGMOD '03.
[2]
Christian Böhm,et al.
Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases
,
2001,
CSUR.
[3]
Hanan Samet,et al.
Ranking in Spatial Databases
,
1995,
SSD.
[4]
Jeffrey Scott Vitter,et al.
Efficient Indexing Methods for Probabilistic Threshold Queries over Uncertain Data
,
2004,
VLDB.
[5]
Marios Hadjieleftheriou,et al.
R-Trees - A Dynamic Index Structure for Spatial Searching
,
2008,
ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.
[6]
Yufei Tao,et al.
Indexing Multi-Dimensional Uncertain Data with Arbitrary Probability Density Functions
,
2005,
VLDB.
[7]
Christian Böhm,et al.
The Gauss-Tree: Efficient Object Identification in Databases of Probabilistic Feature Vectors
,
2006,
22nd International Conference on Data Engineering (ICDE'06).
[8]
Yufei Tao,et al.
Probabilistic Spatial Queries on Existentially Uncertain Data
,
2005,
SSTD.
[9]
Hans-Peter Kriegel,et al.
The X-tree : An Index Structure for High-Dimensional Data
,
2001,
VLDB.