Retrieving Regions of Interest for User Exploration

We consider an application scenario where points of interest (PoIs) each have a web presence and where a web user wants to identify a region that contains relevant PoIs that are relevant to a set of keywords, e.g., in preparation for deciding where to go to conveniently explore the PoIs. Motivated by this, we propose the length-constrained maximum-sum region (LCMSR) query that returns a spatial-network region that is located within a general region of interest, that does not exceed a given size constraint, and that best matches query keywords. Such a query maximizes the total weight of the PoIs in it w.r.t. the query keywords. We show that it is NP-hard to answer this query. We develop an approximation algorithm with a (5 + e) approximation ratio utilizing a technique that scales node weights into integers. We also propose a more efficient heuristic algorithm and a greedy algorithm. Empirical studies on real data offer detailed insight into the accuracy of the proposed algorithms and show that the proposed algorithms are capable of computing results efficiently and effectively.

[1]  Anthony K. H. Tung,et al.  Keyword Search in Spatial Databases: Towards Searching by Document , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[2]  Ge Yu,et al.  Subject-oriented top-k hot region queries in spatial dataset , 2011, CIKM '11.

[3]  JUSTIN ZOBEL,et al.  Inverted files for text search engines , 2006, CSUR.

[4]  Yufei Tao,et al.  A Scalable Algorithm for Maximizing Range Sum in Spatial Databases , 2012, Proc. VLDB Endow..

[5]  Christian S. Jensen,et al.  Spatial Keyword Querying , 2012, ER.

[6]  João B. Rocha-Junior,et al.  Top-k spatial keyword queries on road networks , 2012, EDBT '12.

[7]  Xing Xie,et al.  A large-scale study on map search logs , 2010, TWEB.

[8]  L. Blume,et al.  The New Palgrave Dictionary of Economics, 2nd edition , 2008 .

[9]  Cheng Long,et al.  Collective spatial keyword queries: a distance owner-driven approach , 2013, SIGMOD '13.

[10]  Naphtali Rishe,et al.  Keyword Search on Spatial Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[11]  David P. Williamson,et al.  A general approximation technique for constrained forest problems , 1992, SODA '92.

[12]  Chen Li,et al.  Processing Spatial-Keyword (SK) Queries in Geographic Information Retrieval (GIR) Systems , 2007, 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007).

[13]  Christian S. Jensen,et al.  Retrieving top-k prestige-based relevant spatial web objects , 2010, Proc. VLDB Endow..

[14]  R. Ravi,et al.  Spanning trees short or small , 1994, SODA '94.

[15]  K. Arrow,et al.  The New Palgrave Dictionary of Economics , 2020 .

[16]  Anthony K. H. Tung,et al.  Locating mapped resources in Web 2.0 , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[17]  Yufei Tao,et al.  Approximate MaxRS in Spatial Databases , 2013, Proc. VLDB Endow..

[18]  Naveen Garg,et al.  A 3-approximation for the minimum tree spanning k vertices , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[19]  Beng Chin Ooi,et al.  Collective spatial keyword querying , 2011, SIGMOD '11.

[20]  Xing Xie,et al.  Hybrid index structures for location-based web search , 2005, CIKM '05.

[21]  Christian S. Jensen,et al.  Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects , 2009, Proc. VLDB Endow..