Finding Mentions of Abbreviations and Their Definitions in Spanish Clinical Cases: The BARR2 Shared Task Evaluation Results

A common characteristic of content generated by healthcare professionals, regardless the actual clinical discipline or language, is the widespread and frequent use of abbreviations, acronyms, telegraphic phrases and shorthand notes. Despite the well-known issues related to the ambiguity and misinterpretation of abbreviations, their use in practice is required to simplify and enable communication-avoiding repetition of long complex specialized medical terminologies. Moreover, clinical texts typically do not provide explicit abbreviation definitions. Thus the performance of clinical natural language processing and text mining systems is significantly affected by the previous recognition and definition resolution of medical abbreviations. To promote the development of such key components, we have organized the second Biomedical Abbreviation Recognition and Resolution (BARR2) track. The overall aim of this effort was to evaluate strategies for detecting automatically mentions of abbreviations in running text, as well as returning their corresponding definition given the corresponding context from Spanish clinical case studies. For this track, we constructed the Spanish clinical case corpus (SPACCC). This collection was exhaustively annotated by hand by domain experts with abbreviation mentions together with their corresponding definitions, resulting in the BARR2 corpus. A total of 5 teams submitted 26 runs for the two BARR2 subtasks: (a) the detection of explicit occurrences of abbreviation-definition pairs and (b) the resolution of abbreviations regardless whether their definition is mentioned within the actual document. Here we summarize the BARR2 track setting, the obtained results and the methodologies used by participating systems. The BARR2 task summary, resources and evaluation tool for testing systems beyond this campaign are available at: http://temu.bsc.es/BARR2.

[1]  Alfonso Valencia,et al.  The Biomedical Abbreviation Recognition and Resolution (BARR) Track: Benchmarking, Evaluation and Importance of Abbreviation Recognition Systems Applied to Spanish Biomedical Abstracts , 2017, IberEval@SEPLN.

[2]  Elizabeth D. Liddy,et al.  Finding Answers to Complex Questions , 2004, New Directions in Question Answering.

[3]  Hans Uszkoreit,et al.  Annotation of Entities and Relations in Spanish Radiology Reports , 2017, RANLP.

[4]  Chao Li,et al.  Acronym Disambiguation Using Word Embedding , 2015, AAAI.

[5]  Paloma Martínez,et al.  A Simple Method to Extract Abbreviations Within a Document Using Regular Expressions , 2018, IberEval@SEPLN.

[6]  Beatriz Betancourt Ynfiesta,et al.  Translation of acronyms and initialisms in medical texts on cardiology , 2013 .

[7]  S M Aronson On medical abbreviations. , 1989, Rhode Island medical journal.

[8]  Daniel R. Luna,et al.  A Simple Approach to Abbreviation Resolution at BARR2, IberEval 2018 , 2018, IberEval@SEPLN.

[9]  Jon D. Patrick,et al.  ShARe/CLEF eHealth 2013 Normalization of Acronyms/Abbreviations Challenge , 2013, CLEF.

[10]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[11]  Sung-Hyon Myaeng,et al.  Using Candidate Exploration and Ranking for Abbreviation Resolution in Clinical Document , 2013, 2013 IEEE International Conference on Healthcare Informatics.

[12]  Mario Almagro,et al.  MAMTRA-MED at Biomedical Abbreviation Recognition and Resolution - IberEval 2018 , 2018, IberEval@SEPLN.

[13]  Amy Linsky,et al.  A randomized-controlled trial of computerized alerts to reduce unapproved medication abbreviation use , 2011, J. Am. Medical Informatics Assoc..

[14]  Yaakov HaCohen-Kerner,et al.  Combined One Sense Disambiguation of Abbreviations , 2008, ACL.

[15]  Sanna Salanterä,et al.  Overview of the ShARe/CLEF eHealth Evaluation Lab 2013 , 2013, CLEF.

[16]  Jian Su,et al.  Entity Linking with Effective Acronym Expansion, Instance Selection, and Topic Modeling , 2011, IJCAI.

[17]  Aitor García Pablos,et al.  Vicomtech at BARR2: Detecting Biomedical Abbreviations with ML Methods and Dictionary-based Heuristics , 2018, IberEval@SEPLN.

[18]  Martin Krallinger,et al.  Esfuerzos para fomentar la minería de textos en biomedicina más allá del inglés: el plan estratégico nacional español para las tecnologías del lenguaje , 2017, Proces. del Leng. Natural.

[19]  Richard Reading Ambiguous abbreviations: an audit of abbreviations in paediatric note keeping , 2008 .

[20]  Prodip Das-Purkayastha,et al.  Specialist Medical Abbreviations as a Foreign Language , 2004 .

[21]  Xosé Manuel Otero López,et al.  Seguridad de medicamentos.Abreviaturas, símbolos y expresiones de dosis asociados a errores de medicación , 2004 .

[22]  N. Samaranayake,et al.  The pattern of abbreviation use in prescriptions: a way forward in eliminating error-prone abbreviations and standardisation of prescriptions. , 2014, Current drug safety.

[23]  Fernando Sánchez León ARBOREx: Abbreviation Resolution Based on Regular Expressions for BARR2 , 2018, IberEval@SEPLN.

[24]  German Rigau,et al.  IXA pipeline: Efficient and Ready to Use Multilingual NLP tools , 2014, LREC.