Une approche hybride pour la détection automatique des relations sémantiques entre entités médicales

In this paper we tackle semantic relationships extraction from medical texts. We focus on the relations that may occur between Diseases and Treatments. We propose an approach relying on two different techniques to extract the target relations: (i) relation patterns based on human expertise and (ii) machine learning based on SVM classification. This approach takes advantage of the two techniques, relying more on manual patterns when few relation samples are available and more on feature values when sufficient examples are available. Experimentations show that our approach obtains an overall 94.07% F-measure for the extraction of cure, prevent and side effect relations.

[1]  Barbara Rosario,et al.  Classifying Semantic Relations in Bioscience Texts , 2004, ACL.

[2]  B J Stapley,et al.  Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in Medline abstracts. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[3]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[4]  M O Hotvedt,et al.  Continuing medical education: actually learning rather than simply listening. , 1996, JAMA.

[5]  Gerhard Weikum,et al.  Combining linguistic and statistical analysis to extract relations from web documents , 2006, KDD '06.

[6]  Mehdi Embarek,et al.  Learning Patterns for Building Resources about Semantic Relations in the Medical Domain , 2008, LREC.

[7]  Ting Wang,et al.  Automatic Extraction of Hierarchical Relations from Text , 2006, ESWC.

[8]  Pierre Zweigenbaum,et al.  Automatic extraction of semantic relations between medical entities: a rule based approach , 2011, J. Biomed. Semant..

[9]  Angus Roberts,et al.  Extracting Clinical Relationships from Patient Narratives , 2008, BioNLP.

[10]  Christopher S. G. Khoo,et al.  Automatic identification of treatment relations for medical ontology learning : an exploratory study , 2004 .

[11]  Clement J. McDonald,et al.  Extracting Structured Information from Free Text Pathology Reports , 2003, AMIA.

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[14]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[15]  Pierre Zweigenbaum,et al.  Automatic Extraction of semantic relations between medical entities: Application to the treatment relation , 2010, Semantic Mining in Biomedicine.

[16]  Jian Su,et al.  Protein-Protein Interaction Extraction: A Supervised Learning Approach} , 2005 .

[17]  Sophia Ananiadou,et al.  BioNLP: Proceedings of the 2012 Workshop on Biomedical Natural Language Processing , 2012 .

[18]  R Engelbrecht [Expert systems for medicine--functions and developments]. , 1997, Zentralblatt fur Gynakologie.

[19]  Diana Inkpen,et al.  Extraction of Disease-Treatment Semantic Relations from Biomedical Sentences , 2010, BioNLP@ACL.

[20]  Syin Chan,et al.  Extracting Causal Knowledge from a Medical Database Using Graphical Patterns , 2000, ACL.

[21]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[22]  J. Cimino,et al.  Automatic knowledge acquisition from MEDLINE. , 1993, Methods of information in medicine.

[23]  Charles Sneiderman,et al.  Argument identification for arterial branching predications asserted in cardiac catheterization reports , 2000, AMIA.