Knowledge Fragment Enrichment Using Domain Knowledge Base

Knowledge fragment enrichment aims to complete user input concept fragment by augmenting each concept with rich domain information. This is a widely studied problem in cognitive science, but has not been intensively investigated in computer science. In this paper, we formally define the problem of knowledge fragment enrichment in domain knowledge base and develop a probabilistic graphical model to tackle the problem. The proposed model is able to model the dependencies among concepts in the input knowledge fragment and also capture the probabilistic relationship between concepts and domain entities. We empirically evaluate the proposed model on two different genres of datasets: PubMed and NSFC. On both datasets, the proposed model significantly improves the accuracy of label prediction task by up to 3–9 % (in terms of MAP) compared with several alternative enrichment methods.

[1]  Dongyeop Kang,et al.  Hetero-Labeled LDA: A Partially Supervised Topic Model with Heterogeneous Labels , 2014, ECML/PKDD.

[2]  Yizhou Sun,et al.  Ranking-based clustering of heterogeneous information networks with star network schema , 2009, KDD.

[3]  Padhraic Smyth,et al.  Text Modeling using Unsupervised Topic Models and Concept Hierarchies , 2008, ArXiv.

[4]  Wei Li,et al.  Mixtures of hierarchical topics with Pachinko allocation , 2007, ICML '07.

[5]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  M. Ross Quillian,et al.  Retrieval time from semantic memory , 1969 .

[8]  Yee Whye Teh,et al.  On Smoothing and Inference for Topic Models , 2009, UAI.

[9]  E. Tolman Cognitive maps in rats and men. , 1948, Psychological review.

[10]  Andrew McCallum,et al.  Optimizing Semantic Coherence in Topic Models , 2011, EMNLP.

[11]  Juan-Zi Li,et al.  Keyword Extraction Using Support Vector Machine , 2006, WAIM.

[12]  Lawrence K. Saul,et al.  A Variational Approximation for Topic Modeling of Hierarchical Corpora , 2013, ICML.

[13]  Xu Chen,et al.  The contextual focused topic model , 2012, KDD.

[14]  Andrew McCallum,et al.  Topic models for taxonomies , 2012, JCDL '12.

[15]  Qun Liu,et al.  HHMM-based Chinese Lexical Analyzer ICTCLAS , 2003, SIGHAN.

[16]  Heng Ji,et al.  Constructing Topical Hierarchies in Heterogeneous Information Networks , 2013, ICDM.

[17]  Yee Whye Teh,et al.  A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation , 2006, NIPS.

[18]  Thomas L. Griffiths,et al.  Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.