The Revieval of Subject Analysis: A Knowledge-based Approach facilitating Semantic Search

Semantic Search emerged as the new system paradigm in enterprise information systems. However, usually only small amounts of textual enterprise data is semantically prepared for such systems. The manual semantification of these resources typically is a time-consuming process. The automatic semantification requires deep knowledge in Natural Language Processing. Therefore, in this paper we present a novel approach that makes the underlying Subject Indexing task rather a Knowledge Engineering than a Natural Language Processing task. The approach is based on a simple but powerful and intuitive probabilistic model that allows for the easy integration of expert knowledge.

[1]  Sean Bechhofer,et al.  SKOS Simple Knowledge Organization System Reference , 2009 .

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[4]  Jeffrey C. Reynar Statistical Models for Topic Segmentation , 1999, ACL.

[5]  T. Padma,et al.  Knowledge based decision support system to assist work-related risk analysis in musculoskeletal disorder , 2009, Knowl. Based Syst..

[6]  Joseba Quevedo,et al.  TIGER: Knowledge Based Gas Turbine Condition Monitoring , 1996, AI Commun..

[7]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[8]  Freddy Y. Y. Choi Advances in domain independent linear text segmentation , 2000, ANLP.

[9]  W. J. Hutchins The concept of “aboutness” in subject indexing , 1997 .

[10]  Sunita Sarawagi,et al.  Automatic segmentation of text into structured records , 2001, SIGMOD '01.

[11]  Frank Puppe,et al.  Clinical Experiences with a Knowledge-Based System in Sonography (SonoConsult) , 2005, Wissensmanagement.

[12]  Eero Hyvönen,et al.  Semantic Autocompletion , 2006, ASWC.

[13]  Hanne Albrechtsen,et al.  Subject analysis and indexing: from automated indexing to domain analysis , 1993, The Indexer: The International Journal of Indexing: Volume 18, Issue 4.

[14]  Joachim Baumeister,et al.  Semantification of Large Corpora of Technical Documentation , 2016 .

[15]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[16]  Susie Stephens,et al.  The Enterprise Semantic Web , 2007, The Semantic Web: Real-World Applications from Industry.

[17]  Ramanathan V. Guha,et al.  Semantic search , 2003, WWW '03.