Searching Personalized k-wing in Large and Dynamic Bipartite Graphs

There are extensive studies focusing on the application scenario that all the bipartite cohesive subgraphs need to be discovered in a bipartite graph. However, we observe that, for some applications, one is interested in finding bipartite cohesive subgraphs containing a specific vertex. In this paper, we study a new query dependent bipartite cohesive subgraph search problem based on k-wing model, named as the personalized k-wing search problem. We introduce a k-wing equivalence relationship to summarize the edges of a bipartite graph G into groups. Therefore, all the edges of G are segregated into different groups, i.e. k-wing equivalence class, forming an efficient and wing number conserving index called EquiWing. Further, we propose a more compact version of EquiWing, EquiWing-Comp, which is achieved by integrating our proposed k-butterfly loose approach and discovered hierarchy properties. These indices are used to expedite the personalized k-wing search with a non-repetitive access to G, which leads to linear algorithms for searching the personalized k-wing. Moreover, we conduct a thorough study on the maintenance of the proposed indices for evolving bipartite graphs. We discover novel properties that help us localize the scope of the maintenance at a low cost. By exploiting the discoveries, we propose novel algorithms for maintaining the two indices, which substantially reduces the cost of maintenance. We perform extensive experimental studies in real, large-scale graphs to validate the efficiency and effectiveness of EquiWing and EquiWing-Comp compared to the baseline.

[1]  Ravi Kumar,et al.  Discovering Large Dense Subgraphs in Massive Graphs , 2005, VLDB.

[2]  Philip S. Yu,et al.  Enterprise Employee Training via Project Team Formation , 2017, WSDM.

[3]  Kai Wang,et al.  Vertex Priority Based Butterfly Counting for Large-scale Bipartite Networks , 2018, Proc. VLDB Endow..

[4]  Marco Conti,et al.  Analysis of Co-authorship Ego Networks , 2016, NetSci-X.

[5]  Lu Qin,et al.  Efficient Bitruss Decomposition for Large-scale Bipartite Graphs , 2020, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[6]  Loet Leydesdorff,et al.  Betweenness centrality as a driver of preferential attachment in the evolution of research collaboration networks , 2011, J. Informetrics.

[7]  Zhaonian Zou,et al.  Bitruss Decomposition of Bipartite Graphs , 2016, DASFAA.

[8]  Daniel C. Fain,et al.  Sponsored search: A brief history , 2006 .

[9]  Jérôme Kunegis,et al.  KONECT: the Koblenz network collection , 2013, WWW.

[10]  Xuemin Lin,et al.  Effective and efficient community search over large heterogeneous information networks , 2020, Proc. VLDB Endow..

[11]  Jeffrey Xu Yu,et al.  Querying k-truss community in large and dynamic graphs , 2014, SIGMOD Conference.

[12]  Patricia E. Tweet Brokerage and Closure: An Introduction to Social Capital , 2006 .

[13]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[14]  Weifa Liang,et al.  Efficiently computing k-edge connected components via graph decomposition , 2013, SIGMOD '13.

[15]  Xuemin Lin,et al.  Effective and Efficient Truss Computation over Large Heterogeneous Information Networks , 2020, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[16]  Weifa Liang,et al.  Finding maximal k-edge-connected subgraphs from a large graph , 2012, EDBT '12.

[17]  Peixiang Zhao,et al.  Truss-based Community Search: a Truss-equivalence Based Indexing Approach , 2017, Proc. VLDB Endow..

[18]  Yoshihiro Yamanishi,et al.  Chemogenomic approaches to infer drug-target interaction networks. , 2013, Methods in molecular biology.

[19]  Chengfei Liu,et al.  Pivot-based Maximal Biclique Enumeration , 2020, IJCAI.

[20]  Lixia Zhang,et al.  Observing the evolution of internet as topology , 2007, SIGCOMM '07.

[21]  Xuemin Lin,et al.  Efficient and Effective Community Search on Large-scale Bipartite Graphs , 2020, 2021 IEEE 37th International Conference on Data Engineering (ICDE).

[22]  Ali Pinar,et al.  Peeling Bipartite Networks for Dense Subgraph Discovery , 2016, WSDM.

[23]  Aristides Gionis,et al.  The community-search problem and how to plan a successful cocktail party , 2010, KDD.

[24]  Srikanta Tirthapura,et al.  Butterfly Counting in Bipartite Networks , 2017, KDD.

[25]  Fan Meng,et al.  Domain Knowledge-Based Link Prediction in Customer-Product Bipartite Graph for Product Recommendation , 2019, Int. J. Inf. Technol. Decis. Mak..

[26]  Lu Qin,et al.  Efficient (α\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha $$\end{document}, β\documentclass[12pt]{minimal} \u , 2020, The VLDB Journal.

[27]  Buket Kaya,et al.  Hotel recommendation system by bipartite networks and link prediction , 2019, J. Inf. Sci..

[28]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[29]  Starling David Hunter,et al.  Structural Holes and Banner-Ad Click-Throughs , 2013 .