A Biomedical Question Answering System in BioASQ 2017

Question answering, the identification of short accurate answers to users questions, is a longstanding challenge widely studied over the last decades in the open domain. However, it still requires further efforts in the biomedical domain. In this paper, we describe our participation in phase B of task 5b in the 2017 BioASQ challenge using our biomedical question answering system. Our system, dealing with four types of questions (i.e., yes/no, factoid, list, and summary), is based on (1) a dictionary-based approach for generating the exact answers of yes/no questions, (2) UMLS metathesaurus and term frequency metric for extracting the exact answers of factoid and list questions, and (3) the BM25 model and UMLS concepts for retrieving the ideal answers (i.e., paragraph-sized summaries). Preliminary results show that our system achieves good and competitive results in both exact and ideal answers extraction tasks as compared with the participating systems.

[1]  Kerstin Denecke,et al.  Structuring Legacy Pathology Reports by openEHR Archetypes to Enable Semantic Querying , 2017, Methods of Information in Medicine.

[2]  Jimmy J. Lin,et al.  Omnibase: Uniform Access to Heterogeneous Data for Question Answering , 2002, NLDB.

[3]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[4]  Hyoil Han,et al.  Biomedical question answering: A survey , 2010, Comput. Methods Programs Biomed..

[5]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[6]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[7]  Ioannis A. Kakadiaris,et al.  Results of the 4th edition of BioASQ Challenge , 2016 .

[8]  Abdelmonaime Lachkar,et al.  A new and efficient method based on syntactic dependency relations features for ad hoc clinical question classification , 2017, Int. J. Bioinform. Res. Appl..

[9]  Yan Li,et al.  A generic retrieval system for biomedical literatures: USTB at BioASQ2015 Question Answering Task , 2015, CLEF.

[10]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[11]  Said Ouatik El Alaoui,et al.  A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering , 2017, J. Biomed. Informatics.

[12]  Guilherme Del Fiol,et al.  Generating disease-pertinent treatment vocabularies from MEDLINE citations , 2017, J. Biomed. Informatics.

[13]  Eric Nyberg,et al.  Learning to Answer Biomedical Questions: OAQA at BioASQ 4B , 2016 .

[14]  Jungyun Seo,et al.  KSAnswer: Question-answering System of Kangwon National University and Sogang University in the 2016 BioASQ Challenge , 2016 .

[15]  Manoj Kumar Chinnakotla,et al.  IIITH at BioASQ Challange 2015 Task 3b: Bio-Medical Question Answering System , 2015, CLEF.

[16]  Georgios Balikas,et al.  An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition , 2015, BMC Bioinformatics.

[17]  Mariana L. Neves,et al.  HPI Question Answering System in BioASQ 2016 , 2016 .

[18]  Ulf Leser,et al.  Question answering for biology. , 2015, Methods.

[19]  Bruce Elliott,et al.  Baseball , 2003 .

[20]  Said Ouatik El Alaoui,et al.  A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering , 2017, Methods of Information in Medicine.

[21]  Pierre Zweigenbaum,et al.  MEANS: A medical question-answering system combining NLP techniques and semantic Web technologies , 2015, Inf. Process. Manag..