Deep Learning for Classification of Radiology Reports with a Hierarchical Schema

Abstract Radiological reports are a valuable source of textual information, which can be exploited to improve clinical care and to support research. Such information can be extracted and put into a structured form using machine learning techniques. Some of them rely not only on the classification labels but also on the manual annotation of relevant snippets, which is a time consuming job and requires domain experts. In this paper, we apply deep learning techniques and in particular Long Short Term Memory (LSTM) networks to perform such a task relying only on the classification labels. We focus on the classification of chest computed tomography reports in Italian according to a classification schema proposed for this task by the radiologists of Spedali Civili di Brescia. Each report is classified according to such schema using a combination of neural network classifiers. The resulting system is a novel classification system, which we compare to a previous system based on standard machine learning techniques which used annotations of relevant snippets.

[1]  Ramin Khorasani,et al.  Use of Machine Learning to Identify Follow-Up Recommendations in Radiology Reports , 2018, Journal of the American College of Radiology : JACR.

[2]  Hongfang Liu,et al.  A clinical text classification paradigm using weak supervision and deep representation , 2019, BMC Medical Informatics and Decision Making.

[3]  Diego Marcheggiani,et al.  On the Effects of Low-Quality Training Data on Information Extraction from Clinical Reports , 2017, JDIQ.

[4]  Ivan Serina,et al.  Automatic classification of radiological reports for clinical care , 2018, Artif. Intell. Medicine.

[5]  Young Soo Kim,et al.  Automatic Disease Annotation From Radiology Reports Using Artificial Intelligence Implemented by a Recurrent Neural Network. , 2019, AJR. American journal of roentgenology.

[6]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[7]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[8]  Andrea Esuli,et al.  An enhanced CRFs-based system for information extraction from radiology reports , 2013, J. Biomed. Informatics.

[9]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[10]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[11]  Thomas H. Payne,et al.  A text processing pipeline to extract recommendations from radiology reports , 2013, J. Biomed. Informatics.

[12]  Magda Tsintsadze,et al.  Natural Language Processing Based Instrument for Classification of Free Text Medical Records , 2016, BioMed research international.

[13]  C. Langlotz,et al.  Deep Learning to Classify Radiology Free-Text Reports. , 2017, Radiology.

[14]  Nan Ye,et al.  Conditional random field with high-order dependencies for sequence labeling and segmentation , 2014, J. Mach. Learn. Res..

[15]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[16]  Cynthia Brandt,et al.  Semi-supervised clinical text classification with Laplacian SVMs: An application to cancer case management , 2013, J. Biomed. Informatics.

[17]  Meliha Yetisgen-Yildiz,et al.  Classifying tumor event attributes in radiology reports , 2017, J. Assoc. Inf. Sci. Technol..

[18]  Loes M. M. Braun,et al.  Natural Language Processing in Radiology: A Systematic Review. , 2016, Radiology.