Entity set expansion in knowledge graph: a heterogeneous information network perspective

Entity set expansion (ESE) aims to expand an entity seed set to obtain more entities which have common properties. ESE is important for many applications such as dictionary construction and query suggestion. Traditional ESE methods relied heavily on the text and Web information of entities. Recently, some ESE methods employed knowledge graphs (KGs) to extend entities. However, they failed to effectively and efficiently utilize the rich semantics contained in a KG and ignored the text information of entities in Wikipedia. In this paper, we model a KG as a heterogeneous information network (HIN) containing multiple types of objects and relations. Fine-grained multi-type meta paths are proposed to capture the hidden relation among seed entities in a KG and thus to retrieve candidate entities. Then we rank the entities according to the meta path based structural similarity. Furthermore, to utilize the text description of entities in Wikipedia, we propose an extended model CoMeSE++ which combines both structural information revealed by a KG and text information in Wikipedia for ESE. Extensive experiments on real-world datasets demonstrate that our model achieves better performance by combining structural and textual information of entities.

[1]  Zhenyu Qi,et al.  Choosing Better Seeds for Entity Set Expansion by Leveraging Wikipedia Semantic Knowledge , 2012, CCPR.

[2]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[3]  Philip S. Yu,et al.  Integrating meta-path selection with user-guided object clustering in heterogeneous information networks , 2012, KDD.

[4]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[5]  Lidong Bing,et al.  Wikipedia entity expansion and attribute extraction from the web using semi-supervised learning , 2013, WSDM.

[6]  William W. Cohen,et al.  Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods , 2004, KDD.

[7]  Jiawei Han,et al.  SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble , 2017, ECML/PKDD.

[8]  William W. Cohen,et al.  Iterative Set Expansion of Named Entities Using the Web , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[9]  Sameep Mehta,et al.  Select, Link and Rank: Diversified Query Expansion and Entity Ranking Using Wikipedia , 2016, WISE.

[10]  Zhao Jun A Novel Entity Set Expansion Method Leveraging Entity Semantic Knowledge , 2013 .

[11]  Marcin Sydow,et al.  Aspect-Based Similar Entity Search in Semantic Knowledge Graphs with Diversity-Awareness and Relaxation , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[12]  Patrick Pantel,et al.  Discovering word senses from text , 2002, KDD.

[13]  Dan Roth,et al.  Learning from Negative Examples in Set-Expansion , 2011, 2011 IEEE 11th International Conference on Data Mining.

[14]  Ni Lao,et al.  Relational retrieval using a combination of path-constrained random walks , 2010, Machine Learning.

[15]  Stefan Dietze,et al.  Improving Entity Retrieval on Structured Data , 2015, SEMWEB.

[16]  See-Kiong Ng,et al.  Distributional Similarity vs. PU Learning for Entity Set Expansion , 2010, ACL.

[17]  Philip S. Yu,et al.  PathSim , 2011 .

[18]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[19]  Yeye He,et al.  SEISA: set expansion by iterative similarity aggregation , 2011, WWW.

[20]  Charles Elkan,et al.  Learning classifiers from only positive and unlabeled data , 2008, KDD.

[21]  Marcin Sydow,et al.  QBEES: query by entity examples , 2013, CIKM.

[22]  Xiaoli Li,et al.  A Heterogeneous Information Network Method for Entity Set Expansion in Knowledge Graph , 2018, PAKDD.

[23]  Kugatsu Sadamitsu,et al.  Entity Set Expansion using Topic information , 2011, ACL.

[24]  Xiaohuan Cao,et al.  A Meta Path Based Method for Entity Set Expansion in Knowledge Graph , 2022, IEEE Transactions on Big Data.

[25]  Xianpei Han,et al.  A Probabilistic Co-Bootstrapping Method for Entity Set Expansion , 2014, COLING.

[26]  Valentin Jijkoun,et al.  "More like these": growing entity classes from seeds , 2007, CIKM '07.

[27]  Philip S. Yu,et al.  PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks , 2013, TKDD.

[28]  Xiaoyong Du,et al.  Leveraging Fine-Grained Wikipedia Categories for Entity Search , 2018, WWW.

[29]  Jaiwei Han Mining heterogeneous information networks: the next frontier , 2012, KDD.

[30]  Yizhou Sun,et al.  User guided entity similarity search using meta-path selection in heterogeneous information networks , 2012, CIKM.

[31]  Gang Wang,et al.  Understanding user's query intent with wikipedia , 2009, WWW '09.

[32]  Xianpei Han,et al.  A Joint Model for Entity Set Expansion and Attribute Extraction from Web Search Queries , 2016, AAAI.

[33]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[34]  Philip S. Yu,et al.  A Survey of Heterogeneous Information Network Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[35]  Philip S. Yu,et al.  HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks , 2013, IEEE Transactions on Knowledge and Data Engineering.

[36]  William W. Cohen,et al.  Language-Independent Set Expansion of Named Entities Using the Web , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[37]  Jiawei Han,et al.  KnowSim: A Document Similarity Measure on Structured Heterogeneous Information Networks , 2015, 2015 IEEE International Conference on Data Mining.

[38]  Bin Wu,et al.  Entity Set Expansion with Meta Path in Knowledge Graph , 2017, PAKDD.

[39]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.