Towards a Learning Approach for Abbreviation Detection and Resolution

The explosion of biomedical literature and with it the -uncontrolled- creation of abbreviations presents some special challenges for both human readers and computer applications. We developed an annotated corpus of Dutch medical text, and experimented with two approaches to abbreviation detection and resolution. Our corpus is composed of abstracts from two medical journals from the Low Countries in which approximately 65 percent (NTvG) and 48 percent (TvG) of the abbreviations have a corresponding full form in the

[1]  Wei Luo,et al.  Medstract: creating large-scale information servers from biomedical texts , 2002, ACL Workshop on Natural Language Processing in the Biomedical Domain.

[2]  Kazem Taghva,et al.  Recognizing acronyms and their definitions , 1999, International Journal on Document Analysis and Recognition.

[3]  Hongfang Liu,et al.  A study of abbreviations in MEDLINE abstracts , 2002, AMIA.

[4]  James Pustejovsky,et al.  Automatic Extraction of Acronym-meaning Pairs from MEDLINE Databases , 2001, MedInfo.

[5]  Sophia Ananiadou,et al.  Term extraction using a similarity-based approach , 2001 .

[6]  Hongfang Liu,et al.  Evaluating the UMLS as a source of lexical knowledge for medical language processing , 2001, AMIA.

[7]  Youngja Park,et al.  Hybrid Text Mining for Finding Abbreviations and their Definitions , 2001, EMNLP.

[8]  Brian Roark,et al.  Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction , 2000, COLING.

[9]  Yuji Matsumoto,et al.  Fast Methods for Kernel-Based Text Analysis , 2003, ACL.

[10]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[11]  D. Bloom,et al.  Acronyms, abbreviations and initialisms , 2000, BJU international.

[12]  George Hripcsak,et al.  Mapping abbreviations to full forms in biomedical articles. , 2002, Journal of the American Medical Informatics Association : JAMIA.

[13]  H R Garner,et al.  Heuristics for Identification of Acronym-Definition Patterns within Text: Towards an Automated Construction of Comprehensive Acronym-Definition Dictionaries , 2002, Methods of Information in Medicine.

[14]  Walter Daelemans,et al.  Memory-Based Language Processing , 2009, Studies in natural language processing.

[15]  Russ B. Altman,et al.  Research Paper: Creating an Online Dictionary of Abbreviations from MEDLINE , 2002, J. Am. Medical Informatics Assoc..

[16]  C. Federiuk The effect of abbreviations on MEDLINE searching. , 1999, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[17]  Hongfang Liu,et al.  A study of abbreviations in the UMLS , 2001, AMIA.

[18]  J. Pustejovsky,et al.  Medstract : Creating Large-scale Information Servers for biomedical libraries , 2002 .