An Information-theoretic Framework for the Lossy Compression of Link Streams

Abstract Graph compression is a data analysis technique that consists in the replacement of parts of a graph by more concise structural patterns in order to reduce its description length. It notably provides interesting exploration tools for the study of real, large-scale, and complex graphs which cannot be grasped at first glance. This article proposes a framework for the compression of temporal graphs, that is for the compression of graphs that evolve with time. This framework first builds on a simple and limited scheme, exploiting structural equivalence for the lossless compression of static graphs, then generalises it to the lossy compression of link streams, a recent formalism for the study of temporal graphs. Such generalisation builds on the natural extension of (bidimensional) relational data by the addition of a third temporal dimension. Moreover, we introduce an information-theoretic measure to quantify and to control the information that is lost during compression, as well as an algebraic characterisation of the space of possible compression patterns to enhance the expressiveness of the initial compression scheme. These contributions lead to the definition of a combinatorial optimisation problem, that is the Lossy Multistream Compression Problem, for which we provide an exact algorithm.

[1]  Guy Melançon,et al.  PORGY: A Visual Graph Rewriting Environment for Complex Systems , 2012, Comput. Graph. Forum.

[2]  Jeffrey D. Scargle,et al.  An algorithm for optimal partitioning of data on an interval , 2003, IEEE Signal Processing Letters.

[3]  Ambuj K. Singh,et al.  Closure-Tree: An Index Structure for Graph Queries , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[4]  Jean-Marc Vincent,et al.  Building Optimal Macroscopic Representations of Complex Multi-agent Systems - Application to the Spatial and Temporal Analysis of International Relations Through News Aggregation , 2014, Trans. Comput. Collect. Intell..

[5]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[6]  R. Rathipriya,et al.  Triclustering: An evolution of clustering , 2016, 2016 Online International Conference on Green Engineering and Technologies (IC-GET).

[7]  S. Borgatti,et al.  Regular blockmodels of multiway, multimode matrices☆ , 1992 .

[8]  Uriel G. Rothblum,et al.  Technical Note—A Partitioning Problem with Additive Objective with an Application to Optimal Inventory Groupings for Joint Replenishment , 1982 .

[9]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[10]  Ronald M. Harstad,et al.  Computationally Manageable Combinational Auctions , 1998 .

[11]  Matthieu Latapy,et al.  Computing maximal cliques in link streams , 2015, Theor. Comput. Sci..

[12]  Awi Federgruen,et al.  Structured Partitioning Problems , 1991, Oper. Res..

[13]  Onn Shehory,et al.  Coalition structure generation with worst case guarantees , 2022 .

[14]  Nisheeth Shrivastava,et al.  Graph summarization with bounded error , 2008, SIGMOD Conference.

[15]  Brian A. Davey,et al.  An Introduction to Lattices and Order , 1989 .

[16]  H. White,et al.  STRUCTURAL EQUIVALENCE OF INDIVIDUALS IN SOCIAL NETWORKS , 1977 .

[17]  Ravindra K. Ahuja,et al.  Network Flows: Theory, Algorithms, and Applications , 1993 .

[18]  Pascal Pons,et al.  Post-processing hierarchical community structures: Quality improvements and multi-scale view , 2006, Theor. Comput. Sci..

[19]  Fang Zhou,et al.  Review of network abstraction techniques , 2009 .

[20]  A. Ferligoj,et al.  Direct and indirect methods for structural equivalence , 1992 .

[21]  Yee Whye Teh,et al.  The Mondrian Process , 2008, NIPS.

[22]  Kim Marriott,et al.  Edge Compression Techniques for Visualization of Dense Directed Graphs , 2013, IEEE Transactions on Visualization and Computer Graphics.

[23]  Meera Dhabu,et al.  Partition based Graph Compression , 2013 .

[24]  Jean-Marc Vincent,et al.  Trace Management and Analysis for Embedded Systems , 2013, 2013 IEEE 7th International Symposium on Embedded Multicore Socs.

[25]  Matthieu Latapy,et al.  Stream graphs and link streams for the modeling of interactions over time , 2017, Social Network Analysis and Mining.

[26]  E. Balas,et al.  Set Partitioning: A survey , 1976 .

[27]  Rajeev Motwani,et al.  Clique partitions, graph compression and speeding-up algorithms , 1991, STOC '91.

[28]  Evimaria Terzi,et al.  GraSS: Graph Structure Summarization , 2010, SDM.

[29]  Sebastian E. Ahnert,et al.  Generalised power graph compression reveals dominant relationship patterns in complex networks , 2014, Scientific Reports.

[30]  Jean-Marc Vincent,et al.  A Generic Algorithmic Framework to Solve Special Versions of the Set Partitioning Problem , 2014, 2014 IEEE 26th International Conference on Tools with Artificial Intelligence.

[31]  Fabrice Rossi,et al.  A Triclustering Approach for Time Evolving Graphs , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[32]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[33]  Inderjit S. Dhillon,et al.  Information-theoretic co-clustering , 2003, KDD '03.

[34]  Jignesh M. Patel,et al.  Discovery-driven graph summarization , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[35]  R. Hanneman Introduction to Social Network Methods , 2001 .

[36]  Paolo Serafino,et al.  Speeding up graph clustering via modular decomposition based compression , 2013, SAC '13.

[37]  Jean-Marc Vincent,et al.  A spatiotemporal data aggregation technique for performance analysis of large-scale execution traces , 2014, 2014 IEEE International Conference on Cluster Computing (CLUSTER).

[38]  Fang Zhou,et al.  Compression of weighted graphs , 2011, KDD.

[39]  Rodney R. Howell On Asymptotic Notation with Multiple Variables , 2008 .

[40]  Tuomas Sandholm,et al.  Algorithm for optimal winner determination in combinatorial auctions , 2002, Artif. Intell..

[41]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[42]  R. Vidal Optimal Partition of an Interval — The Discrete Version , 1993 .

[43]  Nicholas R. Jennings,et al.  Coalition Structure Generation : Dynamic Programming Meets Anytime Optimization , 2008 .

[44]  Gonzalo Navarro,et al.  Compressed Representation of Web and Social Networks via Dense Subgraphs , 2012, SPIRE.