Data driven knowledge acquisition method for domain knowledge enrichment in the healthcare

Semantic computing technologies have matured to be applicable to many critical domains, such as life sciences and health care. However, the key to their success is the rich domain knowledge which consists of domain concepts and relationships, whose creation and refinement remains a challenge. In this paper, we develop a technique for enriching domain knowledge, focusing on populating the domain relationships. We determine missing relationships between the domain concepts by validating domain knowledge against real world data sources. We evaluate our approach in the healthcare domain using Electronic Medical Record(EMR) data, and demonstrate that semantic techniques can be used to semi-automate labour intensive tasks without sacrificing fidelity of domain knowledge.

[1]  Vojtech Svátek,et al.  Discovery of Lexical Entries for Non-taxonomic Relations in Ontology Learning , 2004, SOFSEM.

[2]  Vipul Kashyap,et al.  Relationships at the Heart of Semantic Web: Modeling, Discovering, and Exploiting Complex Semantic Relationships , 2004 .

[3]  Nicholas R. Jennings,et al.  An On-Line Algorithm for Semantic Forgetting , 2011, IJCAI.

[4]  Paul Buitelaar,et al.  RelExt: A Tool for Relation Extraction from Text in Ontology Extension , 2005, SEMWEB.

[5]  Amit P. Sheth,et al.  The SSN ontology of the W3C semantic sensor network incubator group , 2012, J. Web Semant..

[6]  Amit P. Sheth,et al.  Active Semantic Electronic Medical Records , 2007, The Semantic Web: Real-World Applications from Industry.

[7]  Peter W. Eklund,et al.  A Framework for Semantic Interoperability in Healthcare: A Service Oriented Architecture based on Health Informatics Standards , 2008, MIE.

[8]  Sue E Bowman Coordination of SNOMED-CT and ICD-10: Getting the Most out of Electronic Health Record Systems , 2005 .

[9]  Amit P. Sheth,et al.  Demonstration: SECURE - Semantics Empowered ResCUe Environment , 2011, SSN.

[10]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[11]  Oscar Corcho,et al.  Semantic Sensor Network XG Final Report , 2011 .

[12]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[13]  Daniel S. Weld,et al.  Automatically refining the wikipedia infobox ontology , 2008, WWW.

[14]  Chris Pierce Semantic Web Use Cases and Case Studies Case Study : A Semantic Web Content Repository for Clinical Research , 2007 .

[15]  Dominic Widdows,et al.  Using LSA and Noun Coordination Information to Improve the Recall and Precision of Automatic Hyponymy Extraction , 2003, CoNLL.

[16]  Dunja Mladenic,et al.  Ontology Extension Towards Analysis of Business News , 2010, Informatica.

[17]  Fouad Zablith,et al.  Evolva: A Comprehensive Approach to Ontology Evolution , 2009, ESWC.

[18]  Aldo Gangemi,et al.  Unsupervised Learning of Semantic Relations between Concepts of a Molecular Biology Ontology , 2005, IJCAI.

[19]  David Faure,et al.  A corpus-based conceptual clustering method for verb frames and ontology , 1998 .

[20]  F. Jones,et al.  International Classification of Diseases , 1978 .

[21]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[22]  Enrico Motta,et al.  Exploring the Semantic Web as Background Knowledge for Ontology Matching , 2008, J. Data Semant..

[23]  Barbara Rosario,et al.  Classifying Semantic Relations in Bioscience Texts , 2004, ACL.

[24]  Ronald Cornet,et al.  SNOMED CT’s Ontological Commitment , 2009 .

[25]  Amit P. Sheth,et al.  An ontological approach to focusing attention and enhancing machine perception on the Web , 2011, Appl. Ontology.

[26]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.