Automatic Classification of Free-Text Radiology Reports to Identify Limb Fractures using Machine Learning and the SNOMED CT Ontology

Objective To develop and evaluate machine learning techniques that identify limb fractures and other abnormalities (e.g. dislocations) from radiology reports. Materials and Methods 99 free-text reports of limb radiology examinations were acquired from an Australian public hospital. Two clinicians were employed to identify fractures and abnormalities from the reports; a third senior clinician resolved disagreements. These assessors found that, of the 99 reports, 48 referred to fractures or abnormalities of limb structures. Automated methods were then used to extract features from these reports that could be useful for their automatic classification. The Naive Bayes classification algorithm and two implementations of the support vector machine algorithm were formally evaluated using cross-fold validation over the 99 reports. Results Results show that the Naive Bayes classifier accurately identifies fractures and other abnormalities from the radiology reports. These results were achieved when extracting stemmed token bigram and negation features, as well as using these features in combination with SNOMED CT concepts related to abnormalities and disorders. The latter feature has not been used in previous works that attempted classifying free-text radiology reports. Discussion Automated classification methods have proven effective at identifying fractures and other abnormalities from radiology reports (F-Measure up to 92.31%). Key to the success of these techniques are features such as stemmed token bigrams, negations, and SNOMED CT concepts associated with morphologic abnormalities and disorders. Conclusion This investigation shows early promising results and future work will further validate and strengthen the proposed approaches.

[1]  A Bracegirdle,et al.  X-ray reporting in accident and emergency departments--an area for improvements in efficiency. , 1991, Archives of emergency medicine.

[2]  Daniel I Rosenthal,et al.  Automated computer-assisted categorization of radiology reports. , 2005, AJR. American journal of roentgenology.

[3]  Vittorio Miele,et al.  Missed Fractures in the Emergency Department , 2012 .

[4]  M Saab,et al.  X-ray reporting in accident and emergency departments--reducing errors. , 1997, European journal of emergency medicine : official journal of the European Society for Emergency Medicine.

[5]  E. Mcguire,et al.  Most Frequently Missed Fractures in the Emergency Department , 2011, Clinical pediatrics.

[6]  Anthony N. Nguyen,et al.  Automatic Extraction of Cancer Characteristics from Free-Text Pathology Reports for Cancer Notifications , 2011, HIC.

[7]  P. Sprivulis,et al.  Same-day X-ray reporting is not needed in well-supervised emergency departments. , 2001, Emergency medicine.

[8]  Joel D. Martin,et al.  Case Report: Identifying Wrist Fracture Patients with High Accuracy by Automatic Categorization of X-ray Reports , 2006, J. Am. Medical Informatics Assoc..

[9]  Wendy W. Chapman,et al.  ConText: An Algorithm for Identifying Contextual Features from Clinical Text , 2007, BioNLP@ACL.

[10]  E. Mcguire,et al.  Most Frequently Missed Fractures in the Emergency Department , 2011, Clinical pediatrics.

[11]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[12]  Bruce Reiner,et al.  Computerized follow-up of discrepancies in image interpretation between emergency and radiology departments , 1998, Journal of Digital Imaging.

[13]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .