Geodemographic Influence Maximization

Given a set of locations in a city, on which ones should we place ads on so as to reach as many people as possible within a limited budget? Past research has addressed this question under the assumption that dense trajectory data are available to determine the reach of each ad. However, the data that are available in most industrial settings do not consist of dense, long-range trajectories; instead, they consist of statistics on people's short-range point-to-point movements. In this paper, we address the natural problem that arises such data: given a distribution of population and point-to-point movement statistics over a network, find a set of locations within a budget that achieves maximum expected reach. We call this problem geodemographic influence maximization (GIM). We show that the problem is NP-hard, but its objective function is monotone and submodular, thus admits a greedy algorithm with a 1 over 2 (1-1 over e) approximation ratio. Still, this algorithm is inapplicable on large-scale data for high-frequency digital signage ads. We develop an efficient deterministic algorithm, Lazy-Sower, exploiting a novel, tight double-bounding scheme of marginal influence gain as well as the locality proprieties of the problem; a learning-based variant, NN-Sower, utilizes randomization and deep learning to further improve efficiency, with a slight loss of quality. Our exhaustive experimental study on two real-world urban datasets demonstrates the efficacy and efficiency of our solutions compared to baselines.

[1]  Eleftherios Alamanos,et al.  New insights into the impact of digital signage as a retail atmospheric tool , 2012 .

[2]  Longbo Huang,et al.  User identification in cyber-physical space: a case study on mobile query logs and trajectories , 2016, SIGSPATIAL/GIS.

[3]  Lex van Meurs,et al.  Split-Second Recognition: What Makes Outdoor Advertising Work? , 2009, Journal of Advertising Research.

[4]  Ju Fan,et al.  Maximizing Multifaceted Network Influence , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[5]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[6]  Samir Khuller,et al.  The Budgeted Maximum Coverage Problem , 1999, Inf. Process. Lett..

[7]  Shuai Xu,et al.  Location-Based Influence Maximization in Social Networks , 2015, CIKM.

[8]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[9]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[11]  Y. de Montjoye,et al.  Unique in the shopping mall: On the reidentifiability of credit card metadata , 2015, Science.

[12]  Wei Wu,et al.  Influence Maximization in Trajectory Databases , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[13]  Longbo Huang,et al.  A Unified Framework for User Identification across Online and Offline Data , 2020 .

[14]  Christian Borgs,et al.  Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[15]  Kian-Lee Tan,et al.  Efficient location-aware influence maximization , 2014, SIGMOD Conference.

[16]  Laurence A. Wolsey,et al.  Best Algorithms for Approximating the Maximum of a Submodular Set Function , 1978, Math. Oper. Res..

[17]  Paul R. Cohen,et al.  Maximizing Influence Propagation in Networks with Community Structure , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Panagiotis Karras,et al.  Content Recommendation for Viral Social Influence , 2017, SIGIR.

[19]  Thibaut Horel Notes on Greedy Algorithms for Submodular Maximization Thibaut , 2015 .

[20]  Zhifeng Bao,et al.  Optimizing Impression Counts for Outdoor Advertising , 2019, KDD.

[21]  Panagiotis Karras,et al.  Content-Based Network Influence Probabilities: Extraction and Application , 2019, 2019 International Conference on Data Mining Workshops (ICDMW).

[22]  Andreas Krause,et al.  Lazier Than Lazy Greedy , 2014, AAAI.

[23]  Panagiotis Karras,et al.  An Experimental Study on Network Immunization , 2019, EDBT.

[24]  Michel Minoux,et al.  Accelerated greedy algorithms for maximizing submodular set functions , 1978 .

[25]  J. Shane Culpepper,et al.  Finding Temporal Influential Users Over Evolving Social Networks , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[26]  Wei Wu,et al.  A “semi-lazy” approach to probabilistic path prediction in dynamic environments , 2013, KDD.

[27]  Yuchen Li,et al.  On the Robustness of Cascade Diffusion under Node Attacks , 2020, WWW.

[28]  Yu Wang,et al.  Community-based greedy algorithm for mining top-K influential nodes in mobile social networks , 2010, KDD.

[29]  Ping Zhang,et al.  Trajectory-driven Influential Billboard Placement , 2018, KDD.

[30]  Wei Wu,et al.  R2-D2: a System to Support Probabilistic Path Prediction in Dynamic Environments via "Semi-Lazy" Learning , 2013, Proc. VLDB Endow..

[31]  Jingbo Zhou,et al.  Early Warning of Human Crowds Based on Query Data from Baidu Map: Analysis Based on Shanghai Stampede , 2016, ArXiv.