Online Biomedical Concept Annotation Using Language Model Mapping

We report the results of applying language technology to the bioinformatics problem of online concept annotation of biomedical text. We extend our concept annotator, CONANN, to find biomedical concepts in using concept language models. The goal of CONANN is to improve annotation speed without losing annotation accuracy as compared to offline systems, facilitating the use of concept annotation in online environments. Intrinsic and extrinsic evaluations show accuracy competitive with a state-of-the-art biomedical text concept annotator with a speed improvement of more than four times.

[1]  Henry J. Lowe,et al.  Selective Automated Indexing of Findings and Diagnoses in Radiology Reports , 2001, J. Biomed. Informatics.

[2]  Craig A. Morioka,et al.  IndexFinder: A Method of Extracting Key Concepts from Clinical Texts for Indexing , 2003, AMIA.

[3]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[4]  William T. Hole,et al.  Finding UMLS Metathesaurus concepts in MEDLINE , 2002, AMIA.

[5]  W. Bruce Croft,et al.  Statistical language modeling for information retrieval , 2006, Annu. Rev. Inf. Sci. Technol..

[6]  Hyoil Han,et al.  CONANN: An Online Biomedical Concept Annotator , 2007, DILS.

[7]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[8]  Joshua C. Denny,et al.  The KnowledgeMap Project: Development of a Concept-Based Medical School Curriculum Database , 2003, AMIA.

[9]  C. Lindberg The Unified Medical Language System (UMLS) of the National Library of Medicine. , 1990, Journal.

[10]  Lawrence Reeve BioChain : Using Lexical Chaining Methods for Biomedical Text Summarization , 2005 .

[11]  P M Nadkarni,et al.  Concept locator: a client-server application for retrieval of UMLS metathesaurus concepts through complex boolean query. , 1997, Computers and biomedical research, an international journal.

[12]  Hyoil Han,et al.  Concept frequency distribution in biomedical text summarization , 2006, CIKM '06.

[13]  A. Brooks,et al.  Evidence-based oncology project. , 2002, Surgical oncology clinics of North America.

[14]  Ani Nenkova,et al.  The Impact of Frequency on Summarization , 2005 .

[15]  R A Greenes,et al.  SAPHIRE--an information retrieval system featuring concept matching, automatic indexing, probabilistic retrieval, and hierarchical relationships. , 1990, Computers and biomedical research, an international journal.

[16]  Howard L. Bleich,et al.  Conceptual mapping of user's queries to medical subject headings , 1997, AMIA.

[17]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[18]  Harold O. Kiess,et al.  Statistical Concepts for the Behavioral Sciences , 1989 .