Extracting Meaningful Correlations among Heterogeneous Datasets for Medical Question Answering with Domain Knowledge

A question answering system (QAS) merely built on a predefined medical knowledge base experiences difficulties in providing suitable answers for expert users to make medical and healthcare decisions. This study proposes a comprehensive method of extracting meaningful correlations among heterogeneous datasets using a semantic analysis with domain knowledge and accordingly provide flexible answers to decision support (ATDS) in a medical QAS (MQAS). First, the potential value of the heterogeneous datasets from medical information systems is examined for building ATDS. Second, an extraction algorithm for constructing a term relational network from the questions is proposed. Then, a correlation construction method for integrating the datasets into the MQAS using domain knowledge is proposed. Finally, a novel algorithm for constructing ATDS on the basis of questions and datasets is established. Experimental results indicate that utilizing external medical domain knowledge in analyzing correlations among the datasets outperforms existing algorithms that only involved with the datasets.

[1]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[2]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[3]  Lin Padgham,et al.  Contextual question answering for the health domain , 2012, J. Assoc. Inf. Sci. Technol..

[4]  Sören Auer,et al.  SINA: Semantic interpretation of user queries for question answering on interlinked data , 2015, J. Web Semant..

[5]  Yu Hao,et al.  A Question Answering System Built on Domain Knowledge Base , 2015, WAIM.

[6]  Kemele M. Endris,et al.  Question Answering on Linked Data: Challenges and Future Directions , 2016, WWW.

[7]  Vanessa López,et al.  Core techniques of question answering systems over knowledge bases: a survey , 2017, Knowledge and Information Systems.

[8]  Hong Yu,et al.  AskHERMES: An online question answering system for complex clinical questions , 2011, J. Biomed. Informatics.

[9]  Kuldeep Singh,et al.  Capturing Knowledge in Semantically-typed Relational Patterns to Enhance Relation Linking , 2017, K-CAP.

[10]  Hanspeter Pfister,et al.  Domino: Extracting, Comparing, and Manipulating Subsets Across Multiple Tabular Datasets , 2014, IEEE Transactions on Visualization and Computer Graphics.

[11]  Jens Lehmann,et al.  Towards an open question answering architecture , 2014, SEM '14.

[12]  Sanda M. Harabagiu,et al.  Medical Question Answering for Clinical Decision Support , 2016, CIKM.

[13]  Dejan Dinevski,et al.  Biomedical question answering using semantic relations , 2015, BMC Bioinformatics.

[14]  Said Ouatik El Alaoui,et al.  A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering , 2017, J. Biomed. Informatics.

[15]  Lei Zhang,et al.  CliniQA : Highly Reliable Clinical Question Answering System , 2012, MIE.

[16]  Suraya Hamid,et al.  Accuracy evaluation of methods and techniques in Web-based question answering systems: a survey , 2018, Knowledge and Information Systems.

[17]  Jessica A. Turner,et al.  COINS: An Innovative Informatics and Neuroimaging Tool Suite Built for Large Heterogeneous Datasets , 2011, Front. Neuroinform..

[18]  Sanda M. Harabagiu,et al.  Knowledge Representations and Inference Techniques for Medical Question Answering , 2017, ACM Trans. Intell. Syst. Technol..

[19]  Ramesh Nallapati,et al.  Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.