Scalable Algorithms for Maximizing Spatiotemporal Range Sum and Range Sum Change in Spatiotemporal Datasets

In this paper, we introduce the three-dimensional Maximum Range-Sum (3D MaxRS) problem and the Maximum Spatiotemporal Range-Sum Change (MaxStRSC) problem. The 3D MaxRS problem tries to find the 3D range where the sum of weights across all objects inside is maximized, and the MaxStRSC problem tries to find the spatiotemporal range where the sum of weights across all objects inside is maximally increased. The goal of this paper is to provide efficient methods for data analysts to find interesting spatiotemporal regions in a large historical spatiotemporal dataset by addressing two problems. We provide a mathematical explanation for each problem and propose several algorithms for them. Existing methods tried to find the optimal region over two-dimensional datasets or to monitor a burst region over two-dimensional data streams. The majority of them cannot directly solve our problems. Although some existing methods can be used or modified to solve the 3D MaxRS problems, they have limited scalability. In addition, none of them can be used to solve the MaxStRS-RC problem (a type of MaxStRSC problem). Finally, we study the performance of the proposed algorithms experimentally. The experimental results show that the proposed algorithms are scalable and much more efficient than existing methods.

[1]  Takahiro Hara,et al.  A General Framework for MaxRS and MaxCRS Monitoring in Spatial Data Streams , 2017, ACM Trans. Spatial Algorithms Syst..

[2]  Yufei Tao,et al.  Maximizing Range Sum in External Memory , 2014, TODS.

[3]  Raymond Chi-Wing Wong,et al.  Rotating MaxRS queries , 2015, Inf. Sci..

[4]  Goce Trajcevski,et al.  Class-based Conditional MaxRS Query in Spatial Data Streams , 2017, SSDBM.

[5]  Sharad Mehrotra,et al.  Progressive approximate aggregate queries with a multi-resolution tree structure , 2001, SIGMOD '01.

[6]  Reynold Xin,et al.  Apache Spark , 2016 .

[7]  Yufei Tao,et al.  A Scalable Algorithm for Maximizing Range Sum in Spatial Databases , 2012, Proc. VLDB Endow..

[8]  Mario A. López,et al.  STR: a simple and efficient algorithm for R-tree packing , 1997, Proceedings 13th International Conference on Data Engineering.

[9]  Chin-Wan Chung,et al.  Indexing range sum queries in spatio-temporal databases , 2007, Inf. Softw. Technol..

[10]  Rudolf Bayer,et al.  Organization and maintenance of large ordered indexes , 1972, Acta Informatica.

[11]  Nick Koudas,et al.  Identifying, attributing and describing spatial bursts , 2010, Proc. VLDB Endow..

[12]  Hans-Joachim Lenz,et al.  The R/sub a/*-tree: an improved R*-tree with materialized data for supporting range queries on OLAP-data , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[13]  Subhas C. Nandy,et al.  A unified algorithm for finding maximum and minimum object enclosing rectangles and cuboids , 1995 .

[14]  Yufei Tao,et al.  Historical spatio-temporal aggregation , 2005, TOIS.

[15]  Douglas Comer,et al.  Ubiquitous B-Tree , 1979, CSUR.

[16]  Yufei Tao,et al.  Approximate MaxRS in Spatial Databases , 2013, Proc. VLDB Endow..