Approximate k-Cover in Hypergraphs: Efficient Algorithms, and Applications

Given a weighted hypergraph $\mathcal{H}(V, \mathcal{E} \subseteq 2^V, w)$, the approximate $k$-cover problem seeks for a size-$k$ subset of $V$ that has the maximum weighted coverage by \emph{sampling only a few hyperedges} in $\mathcal{E}$. The problem has emerged from several network analysis applications including viral marketing, centrality maximization, and landmark selection. Despite many efforts, even the best approaches require $O(k n \log n)$ space complexities, thus, cannot scale to, nowadays, humongous networks without sacrificing formal guarantees. In this paper, we propose BCA, a family of algorithms for approximate $k$-cover that can find $(1-\frac{1}{e} -\epsilon)$-approximation solutions within an \emph{$O(\epsilon^{-2}n \log n)$ space}. That is a factor $k$ reduction on space comparing to the state-of-the-art approaches with the same guarantee. We further make BCA more efficient and robust on real-world instances by introducing a novel adaptive sampling scheme, termed DTA.

[1]  Eli Upfal,et al.  Scalable Betweenness Centrality Maximization via Sampling , 2016, KDD.

[2]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[3]  Vahab S. Mirrokni,et al.  Optimal Distributed Submodular Optimization via Sketching , 2018, KDD.

[4]  Laks V. S. Lakshmanan,et al.  Revisiting the Stop-and-Stare Algorithms for Influence Maximization , 2017, Proc. VLDB Endow..

[5]  Richard M. Karp,et al.  Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[6]  My T. Thai,et al.  Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks , 2016, SIGMOD Conference.

[7]  Xiequan Fan,et al.  Hoeffding’s inequality for supermartingales , 2011, 1109.4359.

[8]  Thang N. Dinh,et al.  Cost-aware Targeted Viral Marketing in billion-scale networks , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[9]  Yifei Yuan,et al.  Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[10]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[11]  Le Song,et al.  Scalable Influence Estimation in Continuous-Time Diffusion Networks , 2013, NIPS.

[12]  Aristides Gionis,et al.  Fast shortest path distance estimation in large networks , 2009, CIKM.

[13]  Andrew McGregor,et al.  Better Streaming Algorithms for the Maximum Coverage Problem , 2018, Theory of Computing Systems.

[14]  Xiaokui Xiao,et al.  Influence maximization: near-optimal time complexity meets practical efficiency , 2014, SIGMOD Conference.

[15]  Edith Cohen,et al.  Sketch-based Influence Maximization and Computation: Scaling up with Guarantees , 2014, CIKM.

[16]  Kyomin Jung,et al.  IRIE: Scalable and Robust Influence Maximization in Social Networks , 2011, 2012 IEEE 12th International Conference on Data Mining.

[17]  Hung T. Nguyen,et al.  Outward Influence and Cascade Size Estimation in Billion-scale Networks , 2017, Proc. ACM Meas. Anal. Comput. Syst..

[18]  Xiaokui Xiao,et al.  Influence Maximization in Near-Linear Time: A Martingale Approach , 2015, SIGMOD Conference.

[19]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[20]  Junsong Yuan,et al.  Influence Maximization Meets Efficiency and Effectiveness: A Hop-Based Approach , 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[21]  Junsong Yuan,et al.  Online Processing Algorithms for Influence Maximization , 2018, SIGMOD Conference.

[22]  Richard M. Karp,et al.  An Optimal Algorithm for Monte Carlo Estimation , 2000, SIAM J. Comput..

[23]  Takuya Akiba,et al.  Dynamic Influence Analysis in Evolving Networks , 2016, Proc. VLDB Endow..

[24]  Christian Borgs,et al.  Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[25]  G. Nemhauser,et al.  Maximizing Submodular Set Functions: Formulations and Analysis of Algorithms* , 1981 .

[26]  Sainyam Galhotra,et al.  Debunking the Myths of Influence Maximization: An In-Depth Benchmarking Study , 2017, SIGMOD Conference.

[27]  Nimrod Megiddo,et al.  Linear Programming in Linear Time When the Dimension Is Fixed , 1984, JACM.

[28]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..