Cross-Sectional Relatedness Between Sentences in Breast Radiology Reports: Development of an SVM Classifier and Evaluation Against Annotations of Five Breast Radiologists

Introduce the notion of cross-sectional relatedness as an informational dependence relation between sentences in the conclusion section of a breast radiology report and sentences in the findings section of the same report. Assess inter-rater agreement of breast radiologists. Develop and evaluate a support vector machine (SVM) classifier for automatically detecting cross-sectional relatedness. A standard reference is manually created from 444 breast radiology reports by the first author. A subset of 37 reports is annotated by five breast radiologists. Inter-rater agreement is computed among their annotations and standard reference. Thirteen numerical features are developed to characterize pairs of sentences; the optimal feature set is sought through forward selection. Inter-rater agreement is F-measure 0.623. SVM classifier has F-measure of 0.699 in the 12-fold cross-validation protocol against standard reference. Report length does not correlate with the classifier’s performance (correlation coefficient = −0.073). SVM classifier has average F-measure of 0.505 against annotations by breast radiologists. Mediocre inter-rater agreement is possibly caused by: (1) definition is insufficiently actionable, (2) fine-grained nature of cross-sectional relatedness on sentence level, instead of, for instance, on paragraph level, and (3) higher-than-average complexity of 37-report sample. SVM classifier performs better against standard reference than against breast radiologists’s annotations. This is supportive of (3). SVM’s performance on standard reference is satisfactory. Since optimal feature set is not breast specific, results may transfer to non-breast anatomies. Applications include a smart report viewing environment and data mining.

[1]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[2]  Ira Goldstein,et al.  AUTOMATED CLASSIFICATION OF THE NARRATIVE OF MEDICAL REPORTS USING NATURAL LANGUAGE PROCESSING , 2011 .

[3]  Hongfang Liu,et al.  Representing information in patient reports using natural language processing and the extensible markup language. , 1999, Journal of the American Medical Informatics Association : JAMIA.

[4]  Valentin Jijkoun,et al.  Recognizing Textual Entailment: Is Word Similarity Enough? , 2005, MLCW.

[5]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[6]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[7]  Günter Neumann,et al.  Recognizing Textual Entailment Using Sentence Similarity based on Dependency Tree Skeletons , 2007, ACL-PASCAL@ACL.

[8]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[9]  Richard Simon,et al.  Bias in error estimation when using cross-validation for model selection , 2006, BMC Bioinformatics.

[10]  Mannudeep K. Kalra,et al.  Use of Radcube for Extraction of Finding Trends in a Large Radiology Practice , 2009, Journal of Digital Imaging.

[11]  George Hripcsak,et al.  Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study. , 2008, Journal of the American Medical Informatics Association : JAMIA.

[12]  Roy Bar-Haim,et al.  The Second PASCAL Recognising Textual Entailment Challenge , 2006 .

[13]  Valentin Jijkoun,et al.  Recognizing Textual Entailment Using Lexical Similarity , 2005 .

[14]  Dan I. Moldovan,et al.  COGEX at RTE 3 , 2007, ACL-PASCAL@ACL.

[15]  Charles E. Kahn,et al.  Knowledge Discovery from Structured Mammography Reports Using Inductive Logic Programming , 2005, AMIA.

[16]  N L Jain,et al.  Identification of suspected tuberculosis patients based on natural language processing of chest radiograph reports. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[17]  Yuji Matsumoto,et al.  Chunking with Support Vector Machines , 2001, NAACL.

[18]  Lawrence M. Fagan,et al.  Medical informatics: computer applications in health care and biomedicine (Health informatics) , 2003 .

[19]  Andrew Hickl,et al.  A Discourse Commitment-Based Framework for Recognizing Textual Entailment , 2007, ACL-PASCAL@ACL.

[20]  Xiaojun Wan,et al.  PKUTM participation in TAC2011 , 2011 .

[21]  L. Ferro,et al.  MITRE ’ s Submissions to the EU Pascal RTE Challenge , 2005 .

[22]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[23]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[24]  Andrew Hickl,et al.  Recognizing Textual Entailment with LCC’s G ROUNDHOG System , 2005 .

[25]  R. Khorasani,et al.  Critical finding capture in the impression section of radiology reports. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[26]  Dan I. Moldovan,et al.  COGEX at the Second Recognizing Textual Entailment Challenge , 2006 .

[27]  James H Thrall,et al.  Application of Recently Developed Computer Algorithm for Automatic Classification of Unstructured Radiology Reports: Validation Study 1 , 2004 .

[28]  Carol Friedman,et al.  Natural Language and Text Processing in Biomedicine , 2006 .

[29]  Bruce I. Reiner Customization of Medical Report Data , 2010, Journal of Digital Imaging.

[30]  P. Langenberg,et al.  Breast Imaging Reporting and Data System: inter- and intraobserver variability in feature analysis and final assessment. , 2000, AJR. American journal of roentgenology.

[31]  Sivaji Bandyopadhyay,et al.  TEXTUAL ENTAILMENT USING LEXICAL AND SYNTACTIC SIMILARITY , 2011 .

[32]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[33]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..

[34]  Rob C. van Ommering,et al.  Automatically Correlating Clinical Findings and Body Locations in Radiology Reports Using MedLEE , 2012, Journal of Digital Imaging.