论文信息 - Approximate k-Cover in Hypergraphs: Efficient Algorithms, and Applications

Approximate k-Cover in Hypergraphs: Efficient Algorithms, and Applications

Given a weighted hypergraph $\mathcal{H}(V, \mathcal{E} \subseteq 2^V, w)$, the approximate $k$-cover problem seeks for a size-$k$ subset of $V$ that has the maximum weighted coverage by \emph{sampling only a few hyperedges} in $\mathcal{E}$. The problem has emerged from several network analysis applications including viral marketing, centrality maximization, and landmark selection. Despite many efforts, even the best approaches require $O(k n \log n)$ space complexities, thus, cannot scale to, nowadays, humongous networks without sacrificing formal guarantees. In this paper, we propose BCA, a family of algorithms for approximate $k$-cover that can find $(1-\frac{1}{e} -\epsilon)$-approximation solutions within an \emph{$O(\epsilon^{-2}n \log n)$ space}. That is a factor $k$ reduction on space comparing to the state-of-the-art approaches with the same guarantee. We further make BCA more efficient and robust on real-world instances by introducing a novel adaptive sampling scheme, termed DTA.

My T. Thai | Hung Nguyen | Thang N. Dinh | Tam Vu | Phuc Thai

[1] Eli Upfal,et al. Scalable Betweenness Centrality Maximization via Sampling , 2016, KDD.

[2] Hosung Park,et al. What is Twitter, a social network or a news media? , 2010, WWW '10.

[3] Vahab S. Mirrokni,et al. Optimal Distributed Submodular Optimization via Sketching , 2018, KDD.

[4] Laks V. S. Lakshmanan,et al. Revisiting the Stop-and-Stare Algorithms for Influence Maximization , 2017, Proc. VLDB Endow..

[5] Richard M. Karp,et al. Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[6] My T. Thai,et al. Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks , 2016, SIGMOD Conference.

[7] Xiequan Fan,et al. Hoeffding’s inequality for supermartingales , 2011, 1109.4359.

[8] Thang N. Dinh,et al. Cost-aware Targeted Viral Marketing in billion-scale networks , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[9] Yifei Yuan,et al. Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[10] Matthew Richardson,et al. Mining the network value of customers , 2001, KDD '01.

[11] Le Song,et al. Scalable Influence Estimation in Continuous-Time Diffusion Networks , 2013, NIPS.

[12] Aristides Gionis,et al. Fast shortest path distance estimation in large networks , 2009, CIKM.

[13] Andrew McGregor,et al. Better Streaming Algorithms for the Maximum Coverage Problem , 2018, Theory of Computing Systems.

[14] Xiaokui Xiao,et al. Influence maximization: near-optimal time complexity meets practical efficiency , 2014, SIGMOD Conference.

[15] Edith Cohen,et al. Sketch-based Influence Maximization and Computation: Scaling up with Guarantees , 2014, CIKM.

[16] Kyomin Jung,et al. IRIE: Scalable and Robust Influence Maximization in Social Networks , 2011, 2012 IEEE 12th International Conference on Data Mining.

[17] Hung T. Nguyen,et al. Outward Influence and Cascade Size Estimation in Billion-scale Networks , 2017, Proc. ACM Meas. Anal. Comput. Syst..

[18] Xiaokui Xiao,et al. Influence Maximization in Near-Linear Time: A Martingale Approach , 2015, SIGMOD Conference.

[19] Andreas Krause,et al. Cost-effective outbreak detection in networks , 2007, KDD '07.

[20] Junsong Yuan,et al. Influence Maximization Meets Efficiency and Effectiveness: A Hop-Based Approach , 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[21] Junsong Yuan,et al. Online Processing Algorithms for Influence Maximization , 2018, SIGMOD Conference.

[22] Richard M. Karp,et al. An Optimal Algorithm for Monte Carlo Estimation , 2000, SIAM J. Comput..

[23] Takuya Akiba,et al. Dynamic Influence Analysis in Evolving Networks , 2016, Proc. VLDB Endow..

[24] Christian Borgs,et al. Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[25] G. Nemhauser,et al. Maximizing Submodular Set Functions: Formulations and Analysis of Algorithms* , 1981 .

[26] Sainyam Galhotra,et al. Debunking the Myths of Influence Maximization: An In-Depth Benchmarking Study , 2017, SIGMOD Conference.

[27] Nimrod Megiddo,et al. Linear Programming in Linear Time When the Dimension Is Fixed , 1984, JACM.

[28] Éva Tardos,et al. Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..