A Generic Ontology Framework for Indexing Keyword Search on Massive Graphs

Due to the unstructuredness and the lack of schema information of knowledge graphs, social networks and RDF graphs, keyword search has been proposed for querying such graphs/networks. Recently, various keyword search semantics have been designed. In this paper, we propose a generic ontology-based indexing framework for keyword search, called Bisimulation of Generalized Graph Index (BiG-index), to enhance the search performance. The novelties of BiG-index reside in using an ontology graph GOnt to summarize and index a data graph G iteratively, to form a hierarchical index structure G. BiG-index is generic since it only requires keyword search algorithms to generate query answers from summary graphs having two simple properties. Regarding query evaluation, we transform a keyword search q into Q according to GOnt in runtime. The transformed query is searched on the summary graphs in G. The efficiency is due to the small sizes of the summary graphs and the early pruning of semantically irrelevant subgraphs. To illustrate BiG-index’s applicability, we show popular indexing techniques for keyword search (e.g., Blinks and r-clique) can be easily implemented on top of BiG-index. Our extensive experiments show that BiG-index reduced the runtimes of popular keyword search work Blinks by 50.5% and r-clique by 29.5%.

[1]  Nikos Mamoulis,et al.  Top-k Relevant Semantic Place Retrieval on Spatial RDF Data , 2016, SIGMOD Conference.

[2]  Surajit Chaudhuri,et al.  Finding Patterns in a Knowledge Base using Keywords to Compose Table Answers , 2014, Proc. VLDB Endow..

[3]  Shan Wang,et al.  Finding Top-k Min-Cost Connected Trees in Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[4]  Alfred C. Weaver,et al.  A framework for evaluating database keyword search strategies , 2010, CIKM.

[5]  Xin Wang,et al.  Query preserving graph compression , 2012, SIGMOD Conference.

[6]  Dan Suciu,et al.  Index Structures for Path Expressions , 1999, ICDT.

[7]  Aijun An,et al.  Keyword Search in Graphs: Finding r-cliques , 2011, Proc. VLDB Endow..

[8]  Xiang Lian,et al.  Keyword Search over Distributed Graphs with Compressed Signature , 2017, IEEE Transactions on Knowledge and Data Engineering.

[9]  Junhu Wang,et al.  Exploiting Vertex Relationships in Speeding up Subgraph Isomorphism over Large Graphs , 2015, Proc. VLDB Endow..

[10]  Chao Tian,et al.  Incremental Graph Computations: Doable and Undoable , 2017, SIGMOD Conference.

[11]  Andrew Lim,et al.  D(k)-index: an adaptive structural summary for graph-structured data , 2003, SIGMOD '03.

[12]  Jianliang Xu,et al.  AutoG: a visual query autocompletion framework for graph databases , 2017, The VLDB Journal.

[13]  Fabian M. Suchanek,et al.  YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[14]  Yinghui Wu,et al.  Ontology-based subgraph querying , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[15]  Haofen Wang,et al.  Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[16]  S. Sudarshan,et al.  Keyword searching and browsing in databases using BANKS , 2002, Proceedings 18th International Conference on Data Engineering.

[17]  Reynold Cheng,et al.  Effective Community Search for Large Attributed Graphs , 2016, Proc. VLDB Endow..

[18]  Lei Zou,et al.  Semantic SPARQL Similarity Search Over RDF Knowledge Graphs , 2016, Proc. VLDB Endow..

[19]  Peter Buneman,et al.  Edinburgh Research Explorer Path Queries on Compressed XML , 2022 .

[20]  Ehud Gudes,et al.  Exploiting local similarity for indexing paths in graph-structured data , 2002, Proceedings 18th International Conference on Data Engineering.

[21]  Gerhard Weikum,et al.  PATTY: A Taxonomy of Relational Patterns with Semantic Types , 2012, EMNLP.

[22]  Catherine Faron-Zucker,et al.  Searching the semantic Web: approximate query processing based on ontologies , 2006, IEEE Intelligent Systems.

[23]  Jianliang Xu,et al.  A Generic Ontology Framework for Indexing Keyword Search on Massive Graphs (Extended Abstract) , 2021, 2021 IEEE 37th International Conference on Data Engineering (ICDE).

[24]  Jianliang Xu,et al.  Incremental Maintenance of the Minimum Bisimulation of Cyclic Graphs , 2013, IEEE Transactions on Knowledge and Data Engineering.

[25]  Wenfei Fan,et al.  Vectorizing and querying large XML repositories , 2005, 21st International Conference on Data Engineering (ICDE'05).

[26]  S. Sudarshan,et al.  Bidirectional Expansion For Keyword Search on Graph Databases , 2005, VLDB.

[27]  Philip S. Yu,et al.  BLINKS: ranked keyword searches on graphs , 2007, SIGMOD '07.

[28]  Nisheeth Shrivastava,et al.  Graph summarization with bounded error , 2008, SIGMOD Conference.

[29]  Yinghui Wu,et al.  Summarizing Answer Graphs Induced by Keyword Queries , 2013, Proc. VLDB Endow..

[30]  Jignesh M. Patel,et al.  Efficient aggregation for graph summarization , 2008, SIGMOD Conference.

[31]  Gerhard Weikum,et al.  Fine-grained Semantic Typing of Emerging Entities , 2013, ACL.

[32]  Chengqi Zhang,et al.  Scalable big graph processing in MapReduce , 2014, SIGMOD Conference.