Top-k Connected Overlapping Densest Subgraphs in Dual Networks

Networks are largely used for modelling and analysing data and relations among them. Recently, it has been shown that the use of a single network may not be the optimal choice, since a single network may misses some aspects. Consequently, it has been proposed to use a pair of networks to better model all the aspects, and the main approach is referred to as dual networks (DNs). DNs are two related graphs (one weighted, the other unweighted) that share the same set of vertices and two different edge sets. In DNs is often interesting to extract common subgraphs among the two networks that are maximally dense in the conceptual network and connected in the physical one. The simplest instance of this problem is finding a common densest connected subgraph (DCS), while we here focus on the detection of the Top-k Densest Connected subgraphs, i.e. a set k subgraphs having the largest density in the conceptual network which are also connected in the physical network. We formalise the problem and then we propose a heuristic to find a solution, since the problem is computationally hard. A set of experiments on synthetic and real networks is also presented to support our approach.

[1]  Andrew V. Goldberg,et al.  Finding a Maximum Density Subgraph , 1984 .

[2]  Mario Cannataro,et al.  μ-CS: An extension of the TM4 platform to manage Affymetrix binary data , 2010, BMC Bioinformatics.

[3]  Johan Håstad,et al.  Clique is hard to approximate within n/sup 1-/spl epsiv// , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[4]  Francesco Bonchi,et al.  Finding Subgraphs with Maximum Total Density and Limited Overlap , 2015, WSDM.

[5]  Charu C. Aggarwal,et al.  A Survey of Algorithms for Dense Subgraph Discovery , 2010, Managing and Mining Graph Data.

[6]  Kostas E. Psannis,et al.  Social networking data analysis tools & challenges , 2016, Future Gener. Comput. Syst..

[7]  Mario Cannataro,et al.  L-HetNetAligner: A novel algorithm for Local Alignment of Heterogeneous Biological Networks , 2020, Scientific Reports.

[8]  Jure Leskovec,et al.  Community-Affiliation Graph Model for Overlapping Network Community Detection , 2012, 2012 IEEE 12th International Conference on Data Mining.

[9]  Christian Komusiewicz,et al.  Multivariate Algorithmics for Finding Cohesive Subnetworks , 2016, Algorithms.

[10]  Riccardo Dondi,et al.  Top-k Overlapping Densest Subgraphs: Approximation and Complexity , 2019, ICTCS.

[11]  Mario Cannataro,et al.  Protein-to-protein interactions: Technologies, databases, and algorithms , 2010, CSUR.

[12]  Christian Ottmann,et al.  Protein-Protein Interactions. , 2017, Drug discovery today. Technologies.

[13]  Moses Charikar,et al.  Greedy approximation algorithms for finding dense components in a graph , 2000, APPROX.

[14]  Xiaoming Liu,et al.  Digger , 2018, ACM Trans. Knowl. Discov. Data.

[15]  Hisao Tamaki,et al.  Greedily Finding a Dense Subgraph , 2000, J. Algorithms.

[16]  Aristides Gionis,et al.  Top-k overlapping densest subgraphs , 2016, Data Mining and Knowledge Discovery.

[17]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[18]  Yasushi Kawase,et al.  The Densest Subgraph Problem with a Convex/Concave Size Function , 2017, Algorithmica.

[19]  Mohammad Mehdi Hosseinzadeh,et al.  Dense Subgraphs in Biological Networks , 2020, SOFSEM.

[20]  Pietro Hiram Guzzi,et al.  Survey of local and global biological network alignment: the need to reconcile the two sides of the same coin , 2017, Briefings Bioinform..

[21]  Mario Cannataro,et al.  HetNetAligner: A Novel Algorithm for Local Alignment of Heterogeneous Biological Networks , 2018, BCB.

[22]  Li Li,et al.  Mining Dual Networks , 2016, ACM Trans. Knowl. Discov. Data.

[23]  P. Phillips Epistasis — the essential role of gene interactions in the structure and evolution of genetic systems , 2008, Nature Reviews Genetics.

[24]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[25]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[26]  Mario Cannataro,et al.  IMPRECO: Distributed prediction of protein complexes , 2010, Future Gener. Comput. Syst..

[27]  David Zuckerman,et al.  Electronic Colloquium on Computational Complexity, Report No. 100 (2005) Linear Degree Extractors and the Inapproximability of MAX CLIQUE and CHROMATIC NUMBER , 2005 .

[28]  Richard M. Karp,et al.  Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[29]  Timothy M. Chan All-pairs shortest paths for unweighted undirected graphs in o(mn) time , 2012, TALG.