Ontology-based approach to enhance medical web information extraction

The purpose of this study is to propose a framework for extracting medical information from the Web using domain ontologies. Patient–Doctor conversations have become prevalent on the Web. For instance, solutions like HealthTap or AskTheDoctors allow patients to ask doctors health-related questions. However, most online health-care consumers still struggle to express their questions efficiently due mainly to the expert/layman language and knowledge discrepancy. Extracting information from these layman descriptions, which typically lack expert terminology, is challenging. This hinders the efficiency of the underlying applications such as information retrieval. Herein, an ontology-driven approach is proposed, which aims at extracting information from such sparse descriptions using a meta-model.,A meta-model is designed to bridge the gap between the vocabulary of the medical experts and the consumers of the health services. The meta-model is mapped with SNOMED-CT to access the comprehensive medical vocabulary, as well as with WordNet to improve the coverage of layman terms during information extraction. To assess the potential of the approach, an information extraction prototype based on syntactical patterns is implemented.,The evaluation of the approach on the gold standard corpus defined in Task1 of ShARe CLEF 2013 showed promising results, an F-score of 0.79 for recognizing medical concepts in real-life medical documents.,The originality of the proposed approach lies in the way information is extracted. The context defined through a meta-model proved to be efficient for the task of information extraction, especially from layman descriptions.

[1]  Kaija Saranto,et al.  Definition, structure, content, use and impacts of electronic health records: A review of the research literature , 2008, Int. J. Medical Informatics.

[2]  A. Hoerbst,et al.  Electronic Health Records , 2010, Methods of Information in Medicine.

[3]  Noémie Elhadad,et al.  Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts , 2013, J. Biomed. Informatics.

[4]  C. V. Tellingen About hearsay - or reappraisal of the role of the anamnesis as an instrument of meaningful communication , 2007 .

[5]  Qing Zeng-Treitler,et al.  Exploring and developing consumer health vocabularies. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[6]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[7]  Nancy Longnecker,et al.  Doctor-patient communication: a review. , 2010, The Ochsner journal.

[8]  Pierre Zweigenbaum,et al.  MEANS: A medical question-answering system combining NLP techniques and semantic Web technologies , 2015, Inf. Process. Manag..

[9]  Christine Bauer,et al.  Considering context in the design of intelligent systems: Current practices and suggestions for improvement , 2016, J. Syst. Softw..

[10]  Ján Antolík Automatic Annotation of Medical Records , 2005, MIE.

[11]  M. Ebell,et al.  Analysis of questions asked by family doctors regarding patient care , 1999, BMJ.

[12]  W. Alkema,et al.  Application of text mining in the biomedical domain. , 2015, Methods.

[13]  Li Yujian,et al.  A Normalized Levenshtein Distance Metric , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Yue Gao,et al.  Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information , 2013, IEEE Transactions on Multimedia.

[15]  Sanna Salanterä,et al.  Overview of the ShARe/CLEF eHealth Evaluation Lab 2013 , 2013, CLEF.

[16]  Qinglin Guo,et al.  Question Answering System Based on Ontology and Semantic Web , 2008, RSKT.

[17]  Farhad Ameri,et al.  An ontological approach to engineering requirement representation and analysis , 2016, Artificial Intelligence for Engineering Design, Analysis and Manufacturing.

[18]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[19]  Yi-Liang Zhao,et al.  Bridging the Vocabulary Gap between Health Seekers and Healthcare Knowledge , 2015, IEEE Transactions on Knowledge and Data Engineering.

[20]  C. Teutsch Patient-doctor communication. , 2003, The Medical clinics of North America.

[21]  Hong Yu,et al.  Methods for Linking EHR Notes to Education Materials , 2015, Information Retrieval Journal.

[22]  Girish Chavan,et al.  NOBLE – Flexible concept recognition for large-scale biomedical natural language processing , 2016, BMC Bioinformatics.

[23]  Meng Wang,et al.  Disease Inference from Health-Related Questions via Sparse Deep Learning , 2015, IEEE Transactions on Knowledge and Data Engineering.

[24]  Raymond S. T. Lee,et al.  Computational Knowledge and Ontology , 2011 .

[25]  Pierre Zweigenbaum,et al.  Towards a Medical Question-Answering System: a Feasibility Study , 2003, MIE.

[26]  Ulrich Beez,et al.  Semantic AutoSuggest for Electronic Health Records , 2015, 2015 International Conference on Computational Science and Computational Intelligence (CSCI).

[27]  Natalia Grabar,et al.  Automatic Extraction of Layman Names for Technical Medical Terms , 2014, 2014 IEEE International Conference on Healthcare Informatics.

[28]  Jun Han,et al.  An ontological framework for situation-aware access control of software services , 2015, Inf. Syst..

[29]  Jun Han,et al.  OntCAAC: An Ontology-Based Approach to Context-Aware Access Control for Software Services , 2015, Comput. J..

[30]  Christiane Fellbaum,et al.  Medical WordNet: A New Methodology for the Construction and Validation of Information Resources for Consumer Health , 2004, COLING.