WS4A: a Biomedical Question and Answering System based on public Web Services and Ontologies

This paper describes our system, dubbed WS4A (Web Services for All), that participated in the fourth edition of the BioASQ challenge (2016). We used WS4A to perform the Question and Answering (QA) task 4b, which consisted on the retrieval of relevant concepts, documents, snippets, RDF triples, exact answers and ideal answers for each given question. The novelty in our approach consists on the maximum exploitation of existing web services in each step of WS4A, such as the annotation of text, and the retrieval of metadata for each annotation. The information retrieved included concept identifiers, ontologies, ancestors, and most importantly, PubMed identifiers. The paper describes the WS4A pipeline and also presents the precision, recall and f-measure values obtained in task 4b. Our system achieved two second places in two subtasks on one of the five batches.

[1]  Petr Baudi Biomedical Question Answering using the YodaQA System: Prototype Notes , 2015 .

[2]  Manoj Kumar Chinnakotla,et al.  IIITH at BioASQ Challange 2015 Task 3b: Bio-Medical Question Answering System , 2015, CLEF.

[3]  Yanli Wang,et al.  PubChem: a public information system for analyzing bioactivities of small molecules , 2009, Nucleic Acids Res..

[4]  Helena Sofia Pinto,et al.  The Next Generation of Similarity Measures that Fully Explore the Semantics in Biomedical Ontologies , 2013, J. Bioinform. Comput. Biol..

[5]  Ioannis A. Kakadiaris,et al.  Results of the BioASQ Tasks of the Question Answering Lab at CLEF 2015 , 2015, CLEF.

[6]  Chi Zhang,et al.  Learning to Answer Biomedical Factoid & List Questions: OAQA at BioASQ 3B , 2015, CLEF.

[7]  David Wheeler,et al.  Building Customized Data Pipelines Using the Entrez Programming Utilities (eUtils) , 2004 .

[8]  Dietrich Rebholz-Schuhmann,et al.  Text processing through Web services: calling Whatizit , 2008, Bioinform..

[9]  Brigitte Grau,et al.  LIMSI-CNRS@CLEF 2015: Tree Edit Beam Search for Multiple Choice Question Answering , 2015, CLEF.

[10]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[11]  Natalya F. Noy,et al.  BioPortal: Ontologies and Integrated Data Resources at the Click of a Mouse , 2009 .

[12]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[13]  Björn Rudzewitz,et al.  CoMiC: Exploring Text Segmentation and Similarity in the English Entrance Exams Task , 2015, CLEF.

[14]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[15]  W. O. Berry,et al.  Preface , 1988, Brain Research Bulletin.

[16]  Petr Baudis,et al.  Biomedical Question Answering using the YodaQA System: Prototype Notes , 2015, CLEF.

[17]  María Martín,et al.  UniProt: A hub for protein information , 2015 .