Clinical data analysis using ontology-guided rule learning

Currently, research in Machine Learning (ML) mainly focuses on the ability to process very large amounts of data and build accurate models. Problems related to complexity, heterogeneity, and semantics of healthcare data are often out of the main focus. Healthcare is particularly rich in background knowledge. Surprisingly, few ML methods used in healthcare can handle these sources of background knowledge, and instead treat healthcare data as a set of numbers without particular meaning. This paper explores an approach that can fill in this gap. A medical ontology (i.e., UMLS) is proposed to provide background knowledge for the ML method to understand healthcare data. The ontology-guided ML-based rule induction method is described and illustrated to analyze the clinical data supplemented with an ontology-based background knowledge.

[1]  K. Lavanya,et al.  Healthcare Information Using Machine Learning Approach , 2012 .

[2]  Janusz Wojtusiak,et al.  Using Published Medical Results and Non-homogenous Data in Rule Learning , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[3]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[4]  Kenneth A. Kaufman,et al.  Learning Patterns in Noisy Data: The AQ Approach , 2001, Machine Learning and Its Applications.

[5]  Ryszard S. Michalski,et al.  Semantic and Syntactic Attribute Types in AQ Learning , 2007 .

[6]  Thomas R. Gruber,et al.  A Translation Approach to Portable Ontologies , 1993 .

[7]  R. Michalski Attributional Calculus: A Logic and Representation Language for Natural Induction , 2004 .

[8]  Jesualdo Tomás Fernández-Breis,et al.  Integrating Ripple Down Rules with Ontologies in an Oncology Domain , 2001, AIME.

[9]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[10]  Kenneth A. Kaufman,et al.  A Method for Reasoning with Structured and Continuous Attributes in the INLEN-2 Multistrategy Knowledge Discovery System , 1996, KDD.

[11]  Kenneth A. Kaufman,et al.  The AQ21 Natural Induction Program for Pattern Discovery: Initial Version and its Novel Features , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[12]  Mark S. Tuttle,et al.  NCI Thesaurus: Using Science-Based Terminology to Integrate Cancer Research Results , 2004, MedInfo.

[13]  O Bodenreider,et al.  Biomedical ontologies in action: role in knowledge management, data integration and decision support. , 2008, Yearbook of medical informatics.

[14]  Larry Wright,et al.  Overview and Utilization of the NCI Thesaurus , 2004, Comparative and functional genomics.