Reverse nearest neighbor heat maps: A tool for influence exploration

We study the problem of constructing a reverse nearest neighbor (RNN) heat map by finding the RNN set of every point in a two-dimensional space. Based on the RNN set of a point, we obtain a quantitative influence (i.e., heat) for the point. The heat map provides a global view on the influence distribution in the space, and hence supports exploratory analyses in many applications such as marketing and resource management. To construct such a heat map, we first reduce it to a problem called Region Coloring (RC), which divides the space into disjoint regions within which all the points have the same RNN set. We then propose a novel algorithm named CREST that efficiently solves the RC problem by labeling each region with the heat value of its containing points. In CREST, we propose innovative techniques to avoid processing expensive RNN queries and greatly reduce the number of region labeling operations. We perform detailed analyses on the complexity of CREST and lower bounds of the RC problem, and prove that CREST is asymptotically optimal in the worst case. Extensive experiments with both real and synthetic data sets demonstrate that CREST outperforms alternative algorithms by several orders of magnitude.

[1]  Wei Wu,et al.  MaxFirst for MaxBRkNN , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[2]  Xiaoyong Du,et al.  Location selection for utility maximization with capacity constraints , 2012, CIKM '12.

[3]  Philip S. Yu,et al.  Maximizing bichromatic reverse nearest neighbor for Lp-norm in two- and three-dimensional spaces , 2011, The VLDB Journal.

[4]  F. Frances Yao,et al.  Computational Geometry , 1991, Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity.

[5]  Rui Zhang,et al.  The HV-tree , 2010, Proc. VLDB Endow..

[6]  Yang Du,et al.  On Computing Top-t Most Influential Spatial Sites , 2005, VLDB.

[7]  Lei Chen,et al.  Utility-Aware Social Event-Participant Planning , 2015, SIGMOD Conference.

[8]  Dan Lin,et al.  The Min-dist Location Selection Query , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[9]  Vijay K. Vaishnavi,et al.  Computing Point Enclosures , 1982, IEEE Transactions on Computers.

[10]  King-Ip Lin,et al.  An index structure for efficient reverse nearest neighbor queries , 2001, Proceedings 17th International Conference on Data Engineering.

[11]  Derick Wood,et al.  An Optimal Worst Case Algorithm for Reporting Intersections of Rectangles , 1980, IEEE Transactions on Computers.

[12]  Thomas Ottmann,et al.  Algorithms for Reporting and Counting Geometric Intersections , 1979, IEEE Transactions on Computers.

[13]  Jin Huang,et al.  Top-k most influential locations selection , 2011, CIKM '11.

[14]  Jiaheng Lu,et al.  Reverse spatial and textual k nearest neighbor search , 2011, SIGMOD '11.

[15]  Yu Zheng,et al.  Real-Time City-Scale Taxi Ridesharing , 2015, IEEE Transactions on Knowledge and Data Engineering.

[16]  Stefan Langerman,et al.  Facility location problems in the plane based on reverse nearest neighbor queries , 2010, Eur. J. Oper. Res..

[17]  Robert E. Tarjan,et al.  Planar point location using persistent search trees , 1986, CACM.

[18]  Xiaoyong Du,et al.  Top-k Most Incremental Location Selection with Capacity Constraint , 2012, WAIM.

[19]  Xiaoyong Du,et al.  MapReduce based location selection algorithm for utility maximization with capacity constraints , 2013, Computing.

[20]  Lei Chen,et al.  Conflict-aware event-participant arrangement , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[21]  Jignesh M. Patel,et al.  Efficient Evaluation of All-Nearest-Neighbor Queries , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[22]  Mark S. Daskin,et al.  Capacitated facility location/network design problems , 2001, Eur. J. Oper. Res..

[23]  Jan Vahrenhold,et al.  Reverse Nearest Neighbor Queries , 2002, Encyclopedia of GIS.

[24]  Michael Ben-Or,et al.  Lower bounds for algebraic computation trees , 1983, STOC.

[25]  Cheng Long,et al.  Efficient algorithms for optimal location queries in road networks , 2014, SIGMOD Conference.

[26]  Chih-Ya Shen,et al.  On socio-spatial group query for location-based social networks , 2012, KDD.

[27]  Mohamed F. Mokbel,et al.  Location-based and preference-aware recommendation using sparse geo-social networking data , 2012, SIGSPATIAL/GIS.

[28]  Rui Zhang,et al.  K-Nearest Neighbor Temporal Aggregate Queries , 2015, EDBT.

[29]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[30]  S. Muthukrishnan,et al.  Influence sets based on reverse nearest neighbor queries , 2000, SIGMOD '00.

[31]  Lars Kulik,et al.  A Motion-Aware Approach to Continuous Retrieval of 3D Objects , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[32]  Yunjun Gao,et al.  On efficient k-optimal-location-selection query processing in metric spaces , 2015, Inf. Sci..