Large-Scale Semantic Indexing and Question Answering in Biomedicine

In this paper we present the methods and approaches employed in terms of our participation in the 2016 version of the BioASQ challenge. For the semantic indexing task, we extended our successful ensemble approach of last year with additional models. The official results obtained so-far demonstrate a continuing consistent advantage of our approaches against the National Library of Medicine (NLM) baselines. For the question answering task, we extended our approach on factoid questions, while we also developed approaches for the document, concept and snippet retrieval sub-tasks.

[1]  Jonathan Weese,et al.  UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[2]  Lei Tang,et al.  Large scale multi-label classification via metalabeler , 2009, WWW '09.

[3]  Manik Varma,et al.  FastXML: a fast, accurate and stable tree-classifier for extreme multi-label learning , 2014, KDD.

[4]  Dina Demner-Fushman,et al.  Recent Enhancements to the NLM Medical Text Indexer , 2014, CLEF.

[5]  Georgios Balikas,et al.  Results of the BioASQ Track of the Question Answering Lab at CLEF 2014 , 2014, CLEF.

[6]  Timothy N. Rubin,et al.  Statistical topic models for multi-label document classification , 2011, Machine Learning.

[7]  Yannis Papanikolaou,et al.  Ensemble Approaches for Large-Scale Multi-Label Classification and Question Answering in Biomedicine , 2014, CLEF.

[8]  ChengXiang Zhai,et al.  The Fudan-UIUC Participation in the BioASQ Challenge Task 2a: The Antinomyra system , 2014, CLEF.

[9]  Ioannis A. Kakadiaris,et al.  Results of the BioASQ Tasks of the Question Answering Lab at CLEF 2015 , 2015, CLEF.

[10]  Zhiyong Lu,et al.  NCBI at the 2014 BioASQ Challenge Task: Large-scale Biomedical Semantic Indexing and Question Answering , 2014, CLEF.

[11]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[12]  Grigorios Tsoumakas,et al.  Effective and Efficient Multilabel Classification in Domains with Large Number of Labels , 2008 .

[13]  Luis M. de Campos,et al.  CoLe and UTAI Participation at the 2014 BioASQ Semantic Indexing Challenge , 2014, CLEF.