Graph compression based on transitivity for neighborhood query

Abstract In recent years, many graph compression methods have been introduced. One successful category of them is based on local decompression designed to answer neighborhood queries. These techniques mainly rely on local similarities of vertices. Besides, their performance is usually a function of graph sparsity. The proposed approach, in this paper, is a lossy compression technique used to answer neighborhood queries with a more general precondition, called transitivity. The output of this method is a sparse graph optimized to keep original adjacent vertices, in at most 2-distance from each other and vice versa. In other words, by traversing a compressed graph by depth of 2, from any desired vertex, its original adjacency list is reconstructed, with an acceptable error. This paper models an optimization problem to solve the inverse problem of finding the best compressed graph in order to minimize the reconstruction error. Then, this NP problem is approximated by a heuristic with a low degree polynomial time-complexity near to the complexity of the forward problem. The results of applying the proposed method on toy and real datasets are compared with the state of the art that improves compression ratio and performance with an acceptable query response time.

[1]  Nieves R. Brisaboa,et al.  Distributed Query Processing on Compressed Graphs Using K2-Trees , 2013, SPIRE.

[2]  Francesco Bonchi,et al.  Graph summarization with quality guarantees , 2014, 2014 IEEE International Conference on Data Mining.

[3]  Kijung Shin,et al.  Incremental Lossless Graph Summarization , 2020, KDD.

[4]  Roberto Grossi,et al.  Zuckerli: A New Compressed Representation for Graphs , 2020, IEEE Access.

[5]  Ryan A. Rossi,et al.  GraphZIP: a clique-based sparse graph compression method , 2018, Journal of Big Data.

[6]  Xin Wang,et al.  Query preserving graph compression , 2012, SIGMOD Conference.

[7]  Richard Hill,et al.  Optimizing K2 trees: A case for validating the maturity of network of practices , 2012, Comput. Math. Appl..

[8]  Evimaria Terzi,et al.  GraSS: Graph Structure Summarization , 2010, SDM.

[9]  Alistair Moffat,et al.  Off-line dictionary-based compression , 2000 .

[10]  Debmalya Panigrahi,et al.  A General Framework for Graph Sparsification , 2019, SIAM J. Comput..

[11]  Sebastiano Vigna,et al.  The webgraph framework I: compression techniques , 2004, WWW '04.

[12]  Michael Sioutis,et al.  On the Effect of Locality in Compressing Social Networks , 2014, ECIR.

[13]  Young-Koo Lee,et al.  Faster compression methods for a weighted graph using locality sensitive hashing , 2017, Inf. Sci..

[14]  Gonzalo Navarro,et al.  Compressed representations for web and social graphs , 2013, Knowledge and Information Systems.

[15]  Jian Pei,et al.  Community Preserving Lossy Compression of Social Networks , 2012, 2012 IEEE 12th International Conference on Data Mining.

[16]  Sebastiano Vigna,et al.  The Webgraph framework II: codes for the World-Wide Web , 2004, Data Compression Conference, 2004. Proceedings. DCC 2004.

[17]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[18]  Gonzalo Navarro,et al.  k2-Trees for Compact Web Graph Representation , 2009, SPIRE.

[19]  Gonzalo Navarro,et al.  Fast and Compact Web Graph Representations , 2010, TWEB.

[20]  Sebastian Maneth,et al.  Compressing graphs by grammars , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[21]  Chen Chen,et al.  Reachability preserving compression for dynamic graph , 2020, Inf. Sci..

[22]  Srinivasan Parthasarathy,et al.  Local graph sparsification for scalable clustering , 2011, SIGMOD '11.

[23]  Sebastian Maneth,et al.  Grammar-Based Graph Compression , 2017, Inf. Syst..

[24]  Danai Koutra,et al.  Graph Summarization Methods and Applications , 2016, ACM Comput. Surv..

[25]  Susana Ladra,et al.  Practical representations for web and social graphs , 2011, CIKM '11.

[26]  Susana Ladra,et al.  Compact structure for sparse undirected graphs based on a clique graph partition , 2021, Inf. Sci..

[27]  Shang-Hua Teng,et al.  Spectral Sparsification of Graphs , 2008, SIAM J. Comput..

[28]  Shou-De Lin,et al.  Egocentric Information Abstraction for Heterogeneous Social Networks , 2009, 2009 International Conference on Advances in Social Network Analysis and Mining.

[29]  Jian Pei,et al.  Neighbor query friendly compression of social networks , 2010, KDD.

[30]  Szymon Grabowski,et al.  Tight and simple Web graph compression for forward and reverse neighbor queries , 2014, Discret. Appl. Math..

[31]  Alberto Apostolico,et al.  Graph Compression by BFS , 2009, Algorithms.

[32]  Gonzalo Navarro,et al.  Compact representation of Web graphs with extended functionality , 2014, Inf. Syst..

[33]  Torsten Hoefler,et al.  Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations , 2018, ArXiv.

[34]  Stephen G. Kobourov,et al.  Graph Spanners: A Tutorial Review , 2020, Comput. Sci. Rev..

[35]  Nieves R. Brisaboa,et al.  Two-Dimensional Block Trees , 2018, 2018 Data Compression Conference.

[36]  Huai Liu,et al.  Metamorphic Testing , 2018, ACM Comput. Surv..

[37]  Shang-Hua Teng,et al.  Spectral sparsification of graphs: theory and algorithms , 2013, CACM.

[38]  Giovanni Toffetti Carughi,et al.  Engineering rich internet applications with a model-driven approach , 2010, TWEB.