Efficient (α\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha $$\end{document}, β\documentclass[12pt]{minimal} \u

The problem of computing ( $$\alpha , \beta $$ )-core in a bipartite graph for given $$\alpha $$ and $$\beta $$ is a fundamental problem in bipartite graph analysis and can be used in many applications such as online group recommendation and fraudsters detection Existing solution to computing ( $$\alpha , \beta $$ )-core needs to traverse the entire bipartite graph once and ignore the fact that real-world graphs are often dynamic. Considering the real bipartite graph can be very large and dynamically updated, and the requests to compute ( $$\alpha , \beta $$ )-core can be issued frequently in real applications, the existing solution is too expensive to compute the $$(\alpha ,\beta )$$ -core. In this paper, we present an efficient algorithm for ( $$\alpha , \beta $$ )-core computation based on a novel index such that the algorithm runs in linear time regarding the result size (thus, the algorithm is optimal since it needs at least linear time to output the result). We prove that the index only requires O(m) space where m is the number of edges in the bipartite graph. We also devise an efficient algorithm with time complexity $$O (\delta \cdot m)$$ for index construction where $$\delta $$ is bounded by $$\sqrt{m}$$ and is much smaller than $$\sqrt{m}$$ in practice. Moreover, we discuss efficient algorithms to maintain the index when the bipartite graph is dynamically updated. We show that we can decide whether a node in the index should be updated or not by visiting its neighbors. Based on this locality property, we propose an efficient index maintenance algorithm which only needs to visit a local subgraph near the inserted or removed edge. Finally, we show how to implement our index construction and maintenance algorithms in parallel. The experimental results on real and synthetic graphs (more than 1 billion edges) demonstrate that our algorithms achieve up to 5 orders of magnitude speedup for computing $$(\alpha ,\beta )$$ -core, up to 3 orders of magnitude speedup for index construction and up to 4 orders of magnitude speedup for index maintenance, respectively, compared with existing techniques.

[1]  Hans-Peter Kriegel,et al.  "Strength Lies in Differences": Diversifying Friends for Recommendations through Subspace Clustering , 2014, CIKM.

[2]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[3]  Tamara G. Kolda,et al.  Measuring and modeling bipartite graphs with community structure , 2016, J. Complex Networks.

[4]  Xuemin Lin,et al.  Towards Efficient k-TriPeak Decomposition on Large Graphs , 2019, DASFAA.

[5]  Loet Leydesdorff,et al.  Betweenness centrality as a driver of preferential attachment in the evolution of research collaboration networks , 2011, J. Informetrics.

[6]  René Peeters,et al.  The maximum edge biclique problem is NP-complete , 2003, Discret. Appl. Math..

[7]  Vladimir Batagelj,et al.  Generalized two-mode cores , 2015, Soc. Networks.

[8]  Lijun Chang,et al.  Index-Based Densest Clique Percolation Community Search in Networks , 2018, IEEE Transactions on Knowledge and Data Engineering.

[9]  Jeffrey Xu Yu,et al.  A Fast Order-Based Approach for Core Maintenance , 2016, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[10]  Lijun Chang,et al.  I/O efficient ECC graph decomposition via graph reduction , 2016, The VLDB Journal.

[11]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[12]  Tamara G. Kolda,et al.  A Scalable Generative Graph Model with Community Structure , 2013, SIAM J. Sci. Comput..

[13]  Jean-Loup Guillaume,et al.  Bipartite structure of all complex networks , 2004, Inf. Process. Lett..

[14]  Lijun Chang,et al.  Diversified top-k clique search , 2015, The VLDB Journal.

[15]  Vivekanand Gopalkrishnan,et al.  Towards efficient mining of proportional fault-tolerant frequent itemsets , 2009, KDD.

[16]  Kun-Lung Wu,et al.  Streaming Algorithms for k-core Decomposition , 2013, Proc. VLDB Endow..

[17]  Xuemin Lin,et al.  Efficient (α, β)-core Computation: an Index-based Approach , 2019, WWW.

[18]  Jagadeesh Gorla,et al.  Probabilistic group recommendation via information matching , 2013, WWW.

[19]  Alex Thomo,et al.  K-Core Decomposition of Large Networks on a Single PC , 2015, Proc. VLDB Endow..

[20]  Lixia Zhang,et al.  Observing the evolution of internet as topology , 2007, SIGCOMM.

[21]  Alessandro Vespignani,et al.  K-core Decomposition: a Tool for the Visualization of Large Scale Networks , 2005, ArXiv.

[22]  Yun Zhang,et al.  On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types , 2013, BMC Bioinformatics.

[23]  Serguei Saavedra,et al.  A simple model of bipartite cooperation for ecological and organizational networks , 2009, Nature.

[24]  Emmanuel Müller,et al.  Flexible Fault Tolerant Subspace Clustering for Data with Missing Values , 2011, 2011 IEEE 11th International Conference on Data Mining.

[25]  Reynold Cheng,et al.  Effective Community Search for Large Attributed Graphs , 2016, Proc. VLDB Endow..

[26]  Fan Zhang,et al.  Discovering Strong Communities with User Engagement and Tie Strength , 2018, DASFAA.

[27]  Xuemin Lin,et al.  Efficient Probabilistic K-Core Computation on Uncertain Graphs , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[28]  Gao Cong,et al.  COM: a generative model for group recommendation , 2014, KDD.

[29]  Dik Lun Lee,et al.  Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba , 2018, KDD.

[30]  Sivasankaran Rajamanickam,et al.  BFS and Coloring-Based Parallel Algorithms for Strongly Connected Components and Related Problems , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[31]  Hans-Peter Kriegel,et al.  Fast Group Recommendations by Applying User Clustering , 2012, ER.

[32]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[33]  Hongzhi Wang,et al.  Effective and Efficient Community Search Over Large Directed Graphs , 2019, IEEE Transactions on Knowledge and Data Engineering.

[34]  Stefan Wuchty,et al.  Peeling the yeast protein network , 2005, Proteomics.

[35]  Jonathan W. Berry,et al.  Challenges in Parallel Graph Processing , 2007, Parallel Process. Lett..

[36]  Shivakant Mishra,et al.  Enhancing group recommendation by incorporating social relationship interactions , 2010, GROUP.

[37]  Srikanta Tirthapura,et al.  Butterfly Counting in Bipartite Networks , 2017, KDD.

[38]  Reynold Cheng,et al.  Efficient Algorithms for Densest Subgraph Discovery , 2019, Proc. VLDB Endow..

[39]  Kai Wang,et al.  Efficient Computing of Radius-Bounded k-Cores , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[40]  Ken-ichi Kawarabayashi,et al.  Efficient PageRank Tracking in Evolving Networks , 2015, KDD.

[41]  Reynold Cheng,et al.  Effective and efficient attributed community search , 2017, The VLDB Journal.

[42]  Jun Wang,et al.  Unifying user-based and item-based collaborative filtering approaches by similarity fusion , 2006, SIGIR.

[43]  Dimitrios M. Thilikos,et al.  Evaluating Cooperation in Communities with the k-Core Structure , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[44]  Vladimir Batagelj,et al.  An O(m) Algorithm for Cores Decomposition of Networks , 2003, ArXiv.

[45]  Xiaodong Li,et al.  On Spatial-Aware Community Search , 2019, IEEE Transactions on Knowledge and Data Engineering.

[46]  Guy E. Blelloch,et al.  Theoretically Efficient Parallel Graph Algorithms Can Be Fast and Scalable , 2018, SPAA.

[47]  Sibo Wang,et al.  Reachability queries on large dynamic graphs: a total order approach , 2014, SIGMOD Conference.

[48]  Chen Zhang,et al.  CoreCube: Core Decomposition in Multilayer Graphs , 2019, WISE.

[49]  Michael Ley,et al.  The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives , 2002, SPIRE.

[50]  Ali Pinar,et al.  Peeling Bipartite Networks for Dense Subgraph Discovery , 2016, WSDM.

[51]  Guy E. Blelloch,et al.  Ligra: a lightweight graph processing framework for shared memory , 2013, PPoPP '13.

[52]  Cong Yu,et al.  Group Recommendation: Semantics and Efficiency , 2009, Proc. VLDB Endow..

[53]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[54]  Jinyan Li,et al.  Mining Maximal Quasi-Bicliques to Co-Cluster Stocks and Financial Ratios for Value Investment , 2006, Sixth International Conference on Data Mining (ICDM'06).

[55]  Dimitrios M. Thilikos,et al.  D-cores: measuring collaboration of directed graphs based on degeneracy , 2011, Knowledge and Information Systems.

[56]  Xiaodong Li,et al.  Effective Community Search over Large Spatial Graphs , 2017, Proc. VLDB Endow..

[57]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[58]  Vladimir Batagelj,et al.  Visualisation and analysis of the internet movie database , 2007, 2007 6th International Asia-Pacific Symposium on Visualization.

[59]  Jinyan Li,et al.  Maximal Quasi-Bicliques with Balanced Noise Tolerance: Concepts and Co-clustering Applications , 2008, SDM.

[60]  Elisa Bertino,et al.  Collusion Detection in Online Rating Systems , 2013, APWeb.

[61]  Carsten F. Dormann,et al.  Indices, Graphs and Null Models: Analyzing Bipartite Ecological Networks , 2009 .

[62]  Lusheng Wang,et al.  Modeling Protein Interacting Groups by Quasi-Bicliques: Complexity, Algorithm, and Application , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[63]  Prasad Tetali,et al.  Simple Markov-chain algorithms for generating bipartite graphs and tournaments , 1997, SODA '97.

[64]  Dorit S. Hochbaum,et al.  Approximating Clique and Biclique Problems , 1998, J. Algorithms.

[65]  Hui Li,et al.  Efficient Fault-Tolerant Group Recommendation Using alpha-beta-core , 2017, CIKM.

[66]  Srinivasan Parthasarathy,et al.  Extracting Analyzing and Visualizing Triangle K-Core Motifs within Networks , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[67]  Silvio Lattanzi,et al.  Efficient Densest Subgraph Computation in Evolving Graphs , 2015, WWW.

[68]  Andy Schürr,et al.  Incremental Graph Pattern Matching , 2006 .

[69]  Lijun Chang,et al.  Effective and Efficient Dynamic Graph Coloring , 2017, Proc. VLDB Endow..

[70]  Amedeo Napoli,et al.  Mining gene expression data with pattern structures in formal concept analysis , 2011, Inf. Sci..

[71]  Gail E. Kaiser,et al.  Distributed Authoring and Versioning , 1997 .

[72]  Tatsuya Akutsu,et al.  A mathematical model for generating bipartite graphs and its application to protein networks , 2009 .

[73]  Venkatesan Guruswami,et al.  CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[74]  Hendrik T. Macedo,et al.  Users' satisfaction in recommendation systems for groups: an approach based on noncooperative games , 2013, WWW.

[75]  Lijun Chang,et al.  Distributed computing connected components with linear communication cost , 2018, Distributed and Parallel Databases.

[76]  Kun-Lung Wu,et al.  Incremental k-core decomposition: algorithms and evaluation , 2016, The VLDB Journal.