Relevance Search over Schema-Rich Knowledge Graphs

Relevance search over a knowledge graph (KG) has gained much research attention. Given a query entity in a KG, the problem is to find its most relevant entities. However, the relevance function is hidden and dynamic. Different users for different queries may consider relevance from different angles of semantics. The ambiguity in a query is more noticeable in the presence of thousands of types of entities and relations in a schema-rich KG, which has challenged the effectiveness and scalability of existing methods. To meet the challenge, our approach called RelSUE requests a user to provide a small number of answer entities as examples, and then automatically learns the most likely relevance function from these examples. Specifically, we assume the intent of a query can be characterized by a set of meta-paths at the schema level. RelSUE searches a KG for diversified significant meta-paths that best characterize the relevance of the user-provided examples to the query entity. It reduces the large search space of a schema-rich KG using distance and degree-based heuristics, and performs reasoning to deduplicate meta-paths that represent equivalent query-specific semantics. Finally, a linear model is learned to predict meta-path based relevance. Extensive experiments demonstrate that RelSUE outperforms several state-of-the-art methods.

[1]  Zhendong Mao,et al.  Knowledge Graph Embedding: A Survey of Approaches and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[2]  Ramez Elmasri,et al.  Querying Knowledge Graphs by Example Entity Tuples , 2015, IEEE Trans. Knowl. Data Eng..

[3]  Reynold Cheng,et al.  Discovering Meta-Paths in Large Heterogeneous Information Networks , 2015, WWW.

[4]  Marcin Sydow,et al.  QBEES: query-by-example entity search in semantic knowledge graphs based on maximal aspects, diversity-awareness and relaxation , 2017, Journal of Intelligent Information Systems.

[5]  Xiaoli Li,et al.  A Heterogeneous Information Network Method for Entity Set Expansion in Knowledge Graph , 2018, PAKDD.

[6]  Kevin Chen-Chuan Chang,et al.  Semantic proximity search on graphs with metagraph-based learning , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[7]  Charu C. Aggarwal,et al.  Co-author Relationship Prediction in Heterogeneous Bibliographic Networks , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[8]  Qingzhong Li,et al.  Integrating meta-path selection with user-preference for top-k relevant search in heterogeneous information networks , 2014, Proceedings of the 2014 IEEE 18th International Conference on Computer Supported Cooperative Work in Design (CSCWD).

[9]  Yizhou Sun,et al.  RelSim: Relation Similarity Search in Schema-Rich Heterogeneous Information Networks , 2016, SDM.

[10]  Takuya Akiba,et al.  Fast exact shortest-path distance queries on large networks by pruned landmark labeling , 2013, SIGMOD '13.

[11]  Po-Wei Chan,et al.  PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks , 2017, KDD.

[12]  Haixun Wang,et al.  Semantic queries by example , 2013, EDBT '13.

[13]  Ke Wang,et al.  Entity Set Expansion via Knowledge Graphs , 2017, SIGIR.

[14]  Kevin Chen-Chuan Chang,et al.  Distance-Aware DAG Embedding for Proximity Search on Heterogeneous Graphs , 2018, AAAI.

[15]  Xiang Li,et al.  Meta Structure: Computing Relevance in Large Heterogeneous Information Networks , 2016, KDD.

[16]  Philip S. Yu,et al.  Relevance search in heterogeneous networks , 2012, EDBT '12.

[17]  Ni Lao,et al.  Relational retrieval using a combination of path-constrained random walks , 2010, Machine Learning.

[18]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[19]  Vagelis Hristidis,et al.  ObjectRank: Authority-Based Keyword Search in Databases , 2004, VLDB.

[20]  Kevin Chen-Chuan Chang,et al.  Semantic Proximity Search on Heterogeneous Graph by Proximity Embedding , 2017, AAAI.

[21]  Bin Wu,et al.  Entity Set Expansion with Meta Path in Knowledge Graph , 2017, PAKDD.

[22]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[23]  Yizhou Sun,et al.  User guided entity similarity search using meta-path selection in heterogeneous information networks , 2012, CIKM.

[24]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[25]  Ji-Rong Wen,et al.  Discovering Neighborhood Pattern Queries by sample answers in knowledge base , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).