Bridging the Gap Between Consumers' Medication Questions and Trusted Answers

This paper addresses the task of answering consumer health questions about medications. To better understand the challenge and needs in terms of methods and resources, we first introduce a gold standard corpus for Medication Question Answering created using real consumer questions. The gold standard (https://github.com/abachaa/Medication_QA_MedInfo2019) consists of six hundred and seventy-four question-answer pairs with annotations of the question focus and type and the answer source. We first present the manual annotation and answering process. In the second part of this paper, we test the performance of recurrent and convolutional neural networks in question type identification and focus recognition. Finally, we discuss the research insights from both the dataset creation process and our experiments. This study provides new resources and experiments on answering consumers' medication questions and discusses the limitations and directions for future research efforts.

[1]  Pierre Zweigenbaum,et al.  Towards a Medical Question-Answering System: a Feasibility Study , 2003, MIE.

[2]  Hyoil Han,et al.  Biomedical question answering: A survey , 2010, Comput. Methods Programs Biomed..

[3]  George Hripcsak,et al.  Detection of drug‐drug interactions through data mining studies using clinical sources, scientific literature and social media , 2018, Briefings Bioinform..

[4]  Pierre Zweigenbaum,et al.  Automatic classification of doctor-patient questions for a virtual patient record query task , 2017, BioNLP.

[5]  Ryen W. White,et al.  Experiences with Web Search on Medical Concerns and Self Diagnosis , 2009, AMIA.

[6]  Asma Ben Abacha,et al.  Recognizing Question Entailment for Medical Question Answering , 2016, AMIA.

[7]  Halil Kilicoglu,et al.  Semantic annotation of consumer health questions , 2018, BMC Bioinformatics.

[8]  Diego Mollá Aliod,et al.  Question Answering in Restricted Domains: An Overview , 2007, CL.

[9]  Isabelle Stanton,et al.  Circumlocution in diagnostic medical queries , 2014, SIGIR.

[10]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[11]  Asma Ben Abacha,et al.  A question-entailment approach to question answering , 2019, BMC Bioinformatics.

[12]  David Schlangen,et al.  Resolving Underspecification using Discourse Information , 2001 .

[13]  Graeme Hirst,et al.  Analysis of Semantic Classes in Medical Text for Question Answering , 2004 .

[14]  Halil Kilicoglu,et al.  A protocol‐driven approach to automatically finding authoritative answers to consumer health questions in online resources , 2017, J. Assoc. Inf. Sci. Technol..

[15]  James Jungho Pak,et al.  2 , 2009, NEMS.

[16]  Alla Keselman,et al.  Making Texts in Electronic Health Records Comprehensible to Consumers: A Prototype Translator , 2007, AMIA.

[17]  Maria Kvist,et al.  Medical text simplification using synonym replacement: Adapting assessment of word difficulty to a compounding language , 2014, PITR@EACL.

[18]  Pierre Zweigenbaum,et al.  MEANS: A medical question-answering system combining NLP techniques and semantic Web technologies , 2015, Inf. Process. Manag..

[19]  Asma Ben Abacha,et al.  On the Role of Question Summarization and Information Source Restriction in Consumer Health Question Answering. , 2019, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[20]  Halil Kilicoglu,et al.  Combining Open-domain and Biomedical Knowledge for Topic Recognition in Consumer Health Questions , 2016, AMIA.

[21]  Halil Kilicoglu,et al.  Automatically Classifying Question Types for Consumer Health Questions , 2014, AMIA.

[22]  Dina Demner-Fushman,et al.  MetaMap Lite: an evaluation of a new Java implementation of MetaMap , 2017, J. Am. Medical Informatics Assoc..

[23]  W. Bruce Croft,et al.  Finding similar questions in large question and answer archives , 2005, CIKM '05.