Extracting Radiological Findings With Normalized Anatomical Information Using a Span-Based BERT Relation Extraction Model

Medical imaging is critical to the diagnosis and treatment of numerous medical problems, including many forms of cancer. Medical imaging reports distill the findings and observations of radiologists, creating an unstructured textual representation of unstructured medical images. Large-scale use of this text-encoded information requires converting the unstructured text to a structured, semantic representation. We explore the extraction and normalization of anatomical information in radiology reports that is associated with radiological findings. We investigate this extraction and normalization task using a span-based relation extraction model that jointly extracts entities and relations using BERT. This work examines the factors that influence extraction and normalization performance, including the body part/organ system, frequency of occurrence, span length, and span diversity. It discusses approaches for improving performance and creating high-quality semantic representations of radiological phenomena.

[1]  Ross W. Filice,et al.  Deep-Learning Language-Modeling Approach for Automated, Personalized, and Iterative Radiology-Pathology Correlation. , 2019, Journal of the American College of Radiology : JACR.

[2]  Sophia Ananiadou,et al.  Mapping anatomical related entities to human body parts based on wikipedia in discharge summaries , 2019, BMC Bioinformatics.

[3]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[4]  Luciano M Prevedello,et al.  Natural Language Processing of Radiology Text Reports: Interactive Text Classification. , 2021, Radiology. Artificial intelligence.

[5]  Jaewoo Kang,et al.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining , 2019, Bioinform..

[6]  Long Chen,et al.  Clinical concept normalization with a hybrid natural language processing system combining multilevel matching and machine learning ranking , 2020, J. Am. Medical Informatics Assoc..

[7]  A. Ulges,et al.  Span-based Joint Entity and Relation Extraction with Transformer Pre-training , 2019, ECAI.

[8]  Wei-Hung Weng,et al.  Publicly Available Clinical BERT Embeddings , 2019, Proceedings of the 2nd Clinical Natural Language Processing Workshop.

[9]  Clement J. McDonald,et al.  What can natural language processing do for clinical decision support? , 2009, J. Biomed. Informatics.

[10]  C. E. Kahn,et al.  Common Data Elements in Radiology. , 2017, Radiology.

[11]  M. Lungren,et al.  Preparing Medical Imaging Data for Machine Learning. , 2020, Radiology.

[12]  Christopher S. Hall,et al.  Automated Tracking of Follow-Up Imaging Recommendations. , 2019, AJR. American journal of roentgenology.

[13]  Amir M. Tahmasebi,et al.  Automatic Normalization of Anatomical Phrases in Radiology Reports Using Unsupervised Learning , 2018, Journal of Digital Imaging.

[14]  Steven Bethard,et al.  Unified Medical Language System resources improve sieve-based generation and Bidirectional Encoder Representations from Transformers (BERT)–based ranking for concept normalization , 2020, J. Am. Medical Informatics Assoc..

[15]  Hua Xu,et al.  BERT-based Ranking for Biomedical Entity Normalization , 2019, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[16]  Kirk Roberts,et al.  RadLex Normalization in Radiology Reports , 2020, AMIA.

[17]  Sergey I. Nikolenko,et al.  Medical concept normalization in social media posts with recurrent neural networks , 2018, J. Biomed. Informatics.

[18]  Özlem Uzuner,et al.  The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records , 2020, J. Am. Medical Informatics Assoc..

[19]  Amir Tahmasebi,et al.  Context-Driven Concept Annotation in Radiology Reports: Anatomical Phrase Labeling. , 2019, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[20]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[21]  Marcus A. Badgeley,et al.  Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports. , 2018, Radiology.

[22]  Philipp Daumke,et al.  Intelligent image retrieval based on radiology reports , 2012, European Radiology.