Keyword Graph: Answering Keyword Search over Large Graphs

In this paper, we focus on IR style queries, keyword search, over large disk-resident graphs. Since most existing approaches cache the whole graph and indexing structure in memory, these approaches cannot be applied into large graphs, such as RDF graphs and social networks. In this paper, we design a novel indexing structure,(kernel) keyword graph to summarize the structure of original graph. Based on (kernel) keyword graph, we propose an efficient keyword search algorithm. Extensive experiments confirm that our method can scale up to large graphs with millions of nodes and edges. The performance of our approach outperforms state-of-the-art algorithms by at least one order of magnitude.

[1]  Luis Gravano,et al.  Efficient IR-Style Keyword Search over Relational Databases , 2003, VLDB.

[2]  Gerhard Weikum,et al.  STAR: Steiner-Tree Approximation in Relationship Graphs , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[3]  Shan Wang,et al.  Finding Top-k Min-Cost Connected Trees in Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[4]  Francis Jack Smith,et al.  Extension of Zipf’s Law to Words and Phrases , 2002, COLING.

[5]  S. Sudarshan,et al.  Keyword searching and browsing in databases using BANKS , 2002, Proceedings 18th International Conference on Data Engineering.

[6]  Philip S. Yu,et al.  BLINKS: ranked keyword searches on graphs , 2007, SIGMOD '07.

[7]  Vagelis Hristidis,et al.  DISCOVER: Keyword Search in Relational Databases , 2002, VLDB.

[8]  Hai Jin,et al.  Practical and effective IR-style keyword search over semantic web , 2009, Inf. Process. Manag..

[9]  Yeye He,et al.  Keyword++ , 2010, Proc. VLDB Endow..

[10]  S. Sudarshan,et al.  Keyword search on external memory data graphs , 2008, Proc. VLDB Endow..

[11]  Edith Cohen,et al.  Reachability and distance queries via 2-hop labels , 2002, SODA '02.

[12]  Haofen Wang,et al.  Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[13]  Aijun An,et al.  Keyword Search in Graphs: Finding r-cliques , 2011, Proc. VLDB Endow..

[14]  Lei Zou,et al.  Answering pattern match queries in large graph databases via graph embedding , 2011, The VLDB Journal.

[15]  Sonia Bergamaschi,et al.  Keyword search over relational databases: a metadata approach , 2011, SIGMOD '11.

[16]  Donald D. Chamberlin,et al.  XQuery: a query language for XML , 2003, SIGMOD '03.