Parallelizing Shortest Average-Distance Query Processing

The shortest average-distance query (or SAvgDQ) is a novel type of location-based queries, which can be used to provide useful object information by taking into account the spatial closeness of objects to the query object and the neighboring relationship between objects. Due to a large amount of SAvgDQ that need to be evaluated concurrently, the centralized processing system would suffer a heavy query load, leading eventually to poor performance. As a result, in this paper we focus on distributed processing of multiple SAvgDQ using MapReduce platform. We first design a grid structure to manage information of objects, and then develop an algorithm, namely the MapReduce-based SAvgDQ algorithm (or MRSAvgDQ algorithm), to efficiently process SAvgDQ in a distributed manner.

[1]  Yuan-Ko Huang,et al.  Shortest Average-Distance Query on Heterogeneous Neighboring Objects , 2015, IDEAS.

[2]  Cyrus Shahabi,et al.  The spatial skyline queries , 2006, VLDB.

[3]  Walid G. Aref,et al.  SINA: scalable incremental processing of continuous queries in spatio-temporal databases , 2004, SIGMOD '04.

[4]  Reynold Cheng,et al.  Efficient Evaluation of Imprecise Location-Dependent Queries , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[5]  Christian S. Jensen,et al.  Nearest neighbor and reverse nearest neighbor queries for moving objects , 2002, Proceedings International Database Engineering and Applications Symposium.

[6]  A. Prasad Sistla,et al.  Modeling and querying moving objects , 1997, Proceedings 13th International Conference on Data Engineering.

[7]  Yufei Tao,et al.  Time-parameterized queries in spatio-temporal databases , 2002, SIGMOD '02.

[8]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.