Hypergraph querying using structural indexing and layer-related-closure verification

Graph indexing and querying mechanisms have been receiving significant attention due to their importance in analyzing the growing graph datasets in many domains. Although much work has been done in the context of simple graphs, they are not directly applicable to hypergraphs that represent more complex relationships in various applications. The key problem here is to search a given subhypergraph query in a larger hypergraph dataset. This search problem is known to be NP-hard as it is related to graph isomorphism. To solve this search problem in an efficient manner, we first create an index set by extracting the common subhypergraph structures from the given hypergraph dataset. Upon receiving a query, we use the same indexing techniques and create a query index set from the given subhypergraph. Utilizing both indices, we identify the possible locations of the query in the hypergraph dataset. We then start the subhypergraph search to verify whether the query really appears at each location by using an accelerated verification mechanism called layer-related-closure method. Through experiments on a real hypergraph dataset and random datasets, we demonstrate the efficiency and effectiveness of hypergraph indexing and our verification method.

[1]  Sherif Sakr,et al.  Graph indexing and querying: a review , 2010, Int. J. Web Inf. Syst..

[2]  Wei Jin,et al.  SAPPER: Subgraph Indexing and Approximate Matching in Large Graphs , 2010, Proc. VLDB Endow..

[3]  Sergei Vassilvitskii,et al.  Counting triangles and the curse of the last reducer , 2011, WWW.

[4]  Noga Alon,et al.  Finding and counting given length cycles , 1997, Algorithmica.

[5]  Elena V. Konstantinova,et al.  Application of hypergraph theory in chemistry , 2001, Discret. Math..

[6]  Shashi Shekhar,et al.  Multilevel hypergraph partitioning: application in VLSI domain , 1997, DAC.

[7]  Steffen Klamt,et al.  Hypergraphs and Cellular Networks , 2009, PLoS Comput. Biol..

[8]  Claude Berge,et al.  Graphs and Hypergraphs , 2021, Clustering.

[9]  Dennis Shasha,et al.  GraphGrep: A fast and universal method for querying graphs , 2002, Object recognition supported by user interaction for service robots.

[10]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[11]  James Cheng,et al.  Triangle listing in massive networks , 2012, TKDD.

[12]  TaeHyun Hwang,et al.  Learning on Weighted Hypergraphs to Integrate Protein Interactions and Gene Expressions for Cancer Outcome Prediction , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[13]  Richard A. Brualdi The diagonal hypergraph of a matrix (bipartite graph) , 1979, Discret. Math..

[14]  Dorothea Wagner,et al.  Finding, Counting and Listing All Triangles in Large Graphs, an Experimental Study , 2005, WEA.

[15]  Luc De Raedt,et al.  Frequent Hypergraph Mining , 2006, ILP.

[16]  Hakan Kardes,et al.  Structural Graph Indexing for Mining Complex Networks , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems Workshops.

[17]  Philip S. Yu,et al.  Graph indexing: a frequent structure-based approach , 2004, SIGMOD '04.

[18]  Abraham Kandel,et al.  Applied Pattern Recognition , 2008, Studies in Computational Intelligence.

[19]  Horst Bunke,et al.  Theoretical and Algorithmic Framework for Hypergraph Matching , 2005, ICIAP.

[20]  Philip S. Yu,et al.  GString: A Novel Approach for Efficient Search in Graph Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[21]  Jimmy J. Lin,et al.  Design patterns for efficient graph algorithms in MapReduce , 2010, MLG '10.

[22]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[23]  Andrew B. Kahng,et al.  Design and implementation of move-based heuristics for VLSI hypergraph partitioning , 2000, JEAL.

[24]  Shijie Zhang,et al.  GADDI: distance index based subgraph matching in biological networks , 2009, EDBT '09.

[25]  Charalampos E. Tsourakakis Fast Counting of Triangles in Large Real Networks without Counting: Algorithms and Laws , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[26]  John Scott What is social network analysis , 2010 .

[27]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[28]  Jignesh M. Patel,et al.  TALE: A Tool for Approximate Large Graph Matching , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[29]  J. A. Rodríguez-Velázquez,et al.  Complex Networks as Hypergraphs , 2005, physics/0505137.

[30]  Carlos Guestrin,et al.  Distributed GraphLab : A Framework for Machine Learning and Data Mining in the Cloud , 2012 .

[31]  Emad Ramadan,et al.  A hypergraph model for the yeast protein complex network , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[32]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[33]  Igor L. Markov,et al.  Hypergraph Partitioning and Clustering , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[34]  Horst Bunke,et al.  Matching of Hypergraphs - Algorithms, Applications, and Experiments , 2008, Applied Pattern Recognition.

[35]  Jun Huan,et al.  G-hash: towards fast kernel-based similarity search in large graph databases , 2009, EDBT '09.

[36]  Ambuj K. Singh,et al.  Closure-Tree: An Index Structure for Graph Queries , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[37]  Elena V. Konstantinova,et al.  Molecular Hypergraphs: The New Representation of Nonclassical Molecular Structures with Polycentric Delocalized Bonds , 1995, J. Chem. Inf. Comput. Sci..

[38]  Albert,et al.  Topology of evolving networks: local events and universality , 2000, Physical review letters.