Ontology - Supported Machine Learning and Decision Support in Biomedicine

Nowadays, ontologies and machine learning constitute two major technologies for domain-specific knowledge extraction which are actively used in knowledge-based systems of different kind including expert systems, decision support systems, knowledge discovery systems, etc. While the aim of these two technologies is the same - the extraction of useful knowledge - little is known about how the two sources of knowledge can be successfully integrated. Today the two technologies are used mainly separate; even though the knowledge extracted by the two is complementary and significant benefits can be obtained if the technologies were integrated. This problem is especially important for biomedicine where relevant data are often naturally complex having large dimensionality and including heterogeneous features, and where a large body of knowledge is available in the form of ontologies. In this paper we propose one approach for improving the performance of machine learning algorithms by integrating the knowledge provided by ontologies. The basic idea is to redefine the concept of similarity for complex heterogeneous data by incorporating available ontological knowledge, creating a bridge between the two technologies. Potential benefits and difficulties of this integration are discussed, two techniques for empirical evaluation and fine-tuning of feature ontologies are described, and an example from the field of paediatric cardiology is given.

[1]  Kent A. Spackman,et al.  SNOMED clinical terms: overview of the development process and project status , 2001, AMIA.

[2]  Chris F. Taylor,et al.  The MGED Ontology: a resource for semantics-based description of microarray experiments , 2006, Bioinform..

[3]  Olivier Bodenreider,et al.  Incorporating ontology-driven similarity knowledge into functional genomics: an exploratory study , 2004, Proceedings. Fourth IEEE Symposium on Bioinformatics and Bioengineering.

[4]  Rolf Drechsler,et al.  Applications of Evolutionary Computing, EvoWorkshops 2008: EvoCOMNET, EvoFIN, EvoHOT, EvoIASP, EvoMUSART, EvoNUM, EvoSTOC, and EvoTransLog, Naples, Italy, March 26-28, 2008. Proceedings , 2008, EvoWorkshops.

[5]  Christine Golbreich,et al.  The Foundational Model of Anatomy in OWL: Experience and Perspectives , 2006, OWLED.

[6]  Jeffrey M. Bradshaw,et al.  Applying KAoS Services to Ensure Policy Compliance for Semantic Web Services Workflow Composition and Enactment , 2004, SEMWEB.

[7]  Gustavo Camps-Valls,et al.  Composite kernels for hyperspectral image classification , 2006, IEEE Geoscience and Remote Sensing Letters.

[8]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[9]  Betsy L. Humphreys,et al.  Relationships in Medical Subject Headings (MeSH) , 2001 .

[10]  J. Blake,et al.  Creating the Gene Ontology Resource : Design and Implementation The Gene Ontology Consortium 2 , 2001 .

[11]  U. Salzer-Muhar,et al.  Predictors of Spontaneous Closure of Isolated Secundum Atrial Septal Defect in Children: A Longitudinal Study , 2006, Pediatrics.

[12]  Zahir Tari,et al.  On The Move to Meaningful Internet Systems 2003: OTM 2003 Workshops , 2003, Lecture Notes in Computer Science.

[13]  Nikola Kasabov,et al.  Prediction of clinical behaviour and treatment for cancers. , 2003, Applied bioinformatics.

[14]  Gary D. Bader,et al.  BioPAX - Biological Pathways Exchange Language Level 2, Version 1.0 Documentation , 2005 .

[15]  Hans-Peter Schnurr,et al.  SemanticMiner - Ontology-Based Knowledge Retrieval , 2003, J. Univers. Comput. Sci..

[16]  Pearl Pu,et al.  Searching with Semantics: An Interactive Visualization Technique for Exploring an Annotated Image Collection , 2003, OTM Workshops.

[17]  Lina Fatima Soualmia,et al.  Representing the MeSH in OWL: Towards a semi-automatic migration , 2004, KR-MED.

[18]  George Hripcsak,et al.  Inter-patient distance metrics using SNOMED CT defining relationships , 2006, J. Biomed. Informatics.

[19]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[20]  Yannis Kalfoglou,et al.  Ontology mapping: the state of the art , 2003, The Knowledge Engineering Review.

[21]  Andrew McCallum,et al.  Distributional clustering of words for text classification , 1998, SIGIR '98.

[22]  José L. V. Mejino,et al.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy , 2003, J. Biomed. Informatics.

[23]  Guus Schreiber,et al.  The Semantic Web – ISWC 2004 , 2004, Lecture Notes in Computer Science.

[24]  Carol A. Bean,et al.  Relationships in the Organization of Knowledge , 2001, Information Science and Knowledge Management.

[25]  Vladimir A. Oleshchuk,et al.  Ontology based semantic similarity comparison of documents , 2003, 14th International Workshop on Database and Expert Systems Applications, 2003. Proceedings..

[26]  Ralph Bergmann,et al.  DOI: 10.1017/S000000000000000 Printed in the United Kingdom Representation in case-based reasoning , 2022 .

[27]  Alon Y. Halevy,et al.  Data integration and genomic medicine , 2007, J. Biomed. Informatics.

[28]  Francisco Azuaje,et al.  Incorporating Biological Domain Knowledge into Cluster Validity Assessment , 2006, EvoWorkshops.

[29]  Gary D. Bader,et al.  BioPAX – biological pathway data exchange format , 2006 .