Results of the First BioASQ Workshop

The goal of the BioASQ project is to push the research frontier towards hybrid information systems. We aim to promote systems and approaches that are able to deal with the whole diversity of the Web, especially for, but not restricted to the context of bio-medicine. This goal is pursued by the organization of challenges. The first challenge consisted of two tasks: semantic indexing and question answering. 157 systems were registered by 12 different participants for the semantic indexing task, of which between 19 and 29 participated in each batch. The question answering task was tackled by 15 systems, which were developed by three different organizations. Between 2 and 5 of these systems addressed each batch. Overall, the best systems were able to outperform the strong baselines provided in the experiments in two out of three settings. This suggests that advances over the state of the art were achieved through the BioASQ challenge but also that the benchmark in itself is very challenging. In this paper, we present the data used during the challenge as well as the technologies which were at the core of the participants' frameworks.

[1]  Zhiyong Lu,et al.  NCBI at the 2013 BioASQ challenge task: Learning to rank for automatic MeSH indexing , 2013 .

[2]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[3]  Luis M. de Campos,et al.  Two Hierarchical Text Categorization Approaches for BioASQ Semantic Indexing Challenge , 2013, BioASQ@CLEF.

[4]  Antonio Jimeno-Yepes,et al.  The NLM Medical Text Indexer System for Indexing Biomedical Literature , 2013, BioASQ@CLEF.

[5]  Georgios Paliouras,et al.  Evaluation measures for hierarchical classification: a unified view and novel approaches , 2013, Data Mining and Knowledge Discovery.

[6]  Hongfang Liu,et al.  An Incremental Approach for MEDLINE MeSH Indexing , 2013, BioASQ@CLEF.

[7]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[8]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[9]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[10]  Lei Tang,et al.  Large scale multi-label classification via metalabeler , 2009, WWW '09.

[11]  Alex A. Freitas,et al.  A survey of hierarchical classification across different application domains , 2010, Data Mining and Knowledge Discovery.

[12]  Michael Schroeder,et al.  Answering Factoid Questions in the Biomedical Domain , 2013, BioASQ@CLEF.

[13]  Grigorios Tsoumakas,et al.  Large-Scale Semantic Indexing of Biomedical Publications , 2013, BioASQ@CLEF.