Inherent-Cost Aware Collective Spatial Keyword Queries

With the proliferation of spatial-textual data such as location-based services and geo-tagged websites, spatial keyword queries become popular in the literature. One example of these queries is the collective spatial keyword query (CoSKQ) which is to find a set of objects in the database such that it covers a given set of query keywords collectively and has the smallest cost. Some existing cost functions were proposed in the literature, which capture different aspects of the distances among the objects in the set and the query. However, we observe that in some applications, each object has an inherent cost (e.g., workers have monetary costs) which are not captured by any of the existing cost functions. Motivated by this, in this paper, we propose a new cost function called the maximum dot size cost which captures both the distances among objects in a set and a query as existing cost functions do and the inherent costs of the objects. We prove that the CoSKQ problem with the new cost function is NP-hard and develop two algorithms for the problem. One is an exact algorithm which is based on a novel search strategy and employs a few pruning techniques and the other is an approximate algorithm which provides a \(\ln |q{.}\psi |\) approximation factor, where \(|q{.}\psi |\) denotes the number of query keywords. We conducted extensive experiments based on both real datasets and synthetic datasets, which verified our theoretical results and efficiency of our algorithms.

[1]  Cheng Long,et al.  Collective spatial keyword queries: a distance owner-driven approach , 2013, SIGMOD '13.

[2]  Xin Li,et al.  Best Keyword Cover Search , 2015, IEEE Transactions on Knowledge and Data Engineering.

[3]  Anthony K. H. Tung,et al.  Keyword Search in Spatial Databases: Towards Searching by Document , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[4]  Gang Chen,et al.  Efficient Collective Spatial Keyword Query Processing on Road Networks , 2016, IEEE Transactions on Intelligent Transportation Systems.

[5]  Ge Yu,et al.  Clue-based Spatio-textual Query , 2017, Proc. VLDB Endow..

[6]  Naphtali Rishe,et al.  Efficient and Scalable Method for Processing Top-k Spatial Boolean Queries , 2010, SSDBM.

[7]  Naphtali Rishe,et al.  Keyword Search on Spatial Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[8]  Christian S. Jensen,et al.  Finding top-k relevant groups of spatial web objects , 2015, The VLDB Journal.

[9]  Anthony K. H. Tung,et al.  Locating mapped resources in Web 2.0 , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[10]  Shengchao Qin,et al.  Optimal Route Search with the Coverage of Users' Preferences , 2015, IJCAI.

[11]  Xiaokui Xiao,et al.  Keyword-aware Optimal Route Search , 2012, Proc. VLDB Endow..

[12]  Beng Chin Ooi,et al.  Collective spatial keyword querying , 2011, SIGMOD '11.

[13]  B. CI4AZELLE New Upper Bounds for Neighbor Searching , 2005 .

[14]  Beng Chin Ooi,et al.  Efficient Processing of Spatial Group Keyword Queries , 2015, TODS.

[15]  Xiang Cheng,et al.  Group-based collective keyword querying in road networks , 2017, Inf. Process. Lett..

[16]  Christian S. Jensen,et al.  Efficient continuously moving top-k spatial keyword query processing , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[17]  Ken C. K. Lee,et al.  IR-Tree: An Efficient Index for Geographic Document Search , 2011, IEEE Trans. Knowl. Data Eng..

[18]  Jian Pei,et al.  Finding the minimum spatial keyword cover , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[19]  Jun Hu,et al.  SEAL: Spatio-Textual Similarity Search , 2012, Proc. VLDB Endow..

[20]  Christian S. Jensen,et al.  Retrieving Regions of Interest for User Exploration , 2014, Proc. VLDB Endow..

[21]  Anthony K. H. Tung,et al.  Scalable top-k spatial keyword search , 2013, EDBT '13.

[22]  Christian S. Jensen,et al.  A framework for efficient spatial web object retrieval , 2012, The VLDB Journal.

[23]  Tao Guo,et al.  Efficient Algorithms for Answering the m-Closest Keywords Query , 2015, SIGMOD Conference.

[24]  Beng Chin Ooi,et al.  Efficient Spatial Keyword Search in Trajectory Databases , 2012, ArXiv.

[25]  João B. Rocha-Junior,et al.  Top-k spatial keyword queries on road networks , 2012, EDBT '12.

[26]  João B. Rocha-Junior,et al.  Efficient Processing of Top-k Spatial Keyword Queries , 2011, SSTD.

[27]  Christian S. Jensen,et al.  Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects , 2009, Proc. VLDB Endow..

[28]  Panos Kalnis,et al.  User oriented trajectory search for trip recommendation , 2012, EDBT '12.

[29]  Christian S. Jensen,et al.  Joint Top-K Spatial Keyword Query Processing , 2012, IEEE Transactions on Knowledge and Data Engineering.