Probabilistic Data Modeling and Querying for Location-Based Data Warehouses

Motivated by the increasing need to handle complex, dynamic, uncertain multidimensional data in location-based warehouses, this paper proposes a novel probabilistic data model that can address the complexities of such data. The model provides a foundation for handling complex hierarchical and uncertain data, e.g., data from the location-based services domain such as transportation infrastructures and the attached static and dynamic content such as speed limits and vehicle positions. The paper also presents algebraic operators that support querying of such data. Use of pre-aggregation for implementation of the operators is also discussed. The work is motivated with a real-world case study, based on our collaboration with a leading Danish vendor of location-based services.

[1]  Markus Schneider,et al.  A foundation for representing and querying moving objects , 2000, TODS.

[2]  Jimeng Sun,et al.  Querying about the past, the present, and the future in spatio-temporal databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[3]  Sunil Prabhakar,et al.  Evaluating probabilistic queries over imprecise data , 2003, SIGMOD '03.

[4]  B. R. Moole A probabilistic multidimensional data model and algebra for OLAP in decision support systems , 2003, IEEE SoutheastCon, 2003. Proceedings..

[5]  Jeffrey Considine,et al.  Spatio-temporal aggregation using sketches , 2004, Proceedings. 20th International Conference on Data Engineering.

[6]  Goce Trajcevski,et al.  Probabilistic range queries in moving objects databases with uncertainty , 2003, MobiDe '03.

[7]  Torben Bach Pedersen,et al.  Integrated Data Management for Mobile Services in the Real World , 2003, VLDB.

[8]  Christian S. Jensen,et al.  Computational data modeling for network-constrained moving objects , 2003, GIS '03.

[9]  Anthony C. Klug Equivalence of Relational Algebra and Relational Calculus Query Languages Having Aggregate Functions , 1982, JACM.

[10]  Ouri Wolfson,et al.  The Geometry of Uncertainty in Moving Objects Databases , 2002, EDBT.

[11]  Reed Jacobson,et al.  Microsoft SQL Server(TM) 2005 Analysis Services Step by Step , 2006 .

[12]  Walter L. Smith Probability and Statistics , 1959, Nature.

[13]  Nectaria Tryfona,et al.  Pre-aggregation in Spatial Data Warehouses , 2001, SSTD.

[14]  Curtis E. Dyreson,et al.  Information Retrieval from an Incomplete Data Cube , 1996, VLDB.

[15]  Christian S. Jensen,et al.  A foundation for capturing and querying complex multidimensional data , 2001, Inf. Syst..

[16]  Yufei Tao,et al.  Query Processing in Spatial Network Databases , 2003, VLDB.

[17]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[18]  Erol Gelenbe,et al.  A probability model of uncertainty in data bases , 1986, 1986 IEEE Second International Conference on Data Engineering.

[19]  Christian S. Jensen,et al.  Indexing the positions of continuously moving objects , 2000, SIGMOD '00.

[20]  Hector Garcia-Molina,et al.  The Management of Probabilistic Data , 1992, IEEE Trans. Knowl. Data Eng..

[21]  Dimitrios Gunopulos,et al.  Efficient aggregation over objects with extent , 2002, PODS '02.

[22]  K S Opiela A GENERIC DATA MODEL FOR LINEAR REFERENCING SYSTEMS , 1997 .

[23]  PedersenTorben Bach,et al.  Multidimensional data modeling for location-based services , 2004, VLDB 2004.

[24]  Dan Suciu,et al.  Efficient query evaluation on probabilistic databases , 2004, The VLDB Journal.

[25]  Christopher Murray Oracle spatial user guide and reference , 2002 .

[26]  Torben Bach Pedersen,et al.  Capturing complex multidimensional data in location-based data warehouses , 2004, GIS '04.

[27]  Dimitrios Gunopulos,et al.  Temporal and spatio-temporal aggregations over data streams using multiple time granularities , 2003, Inf. Syst..

[28]  Sunil Prabhakar,et al.  Querying imprecise data in moving object environments , 2003, IEEE Transactions on Knowledge and Data Engineering.

[29]  Torben Bach Pedersen,et al.  Multidimensional data modeling for location-based services , 2002, GIS '02.