Content-based retrieval for lung nodule diagnosis using learned distance metric

Similarity metric of the lung nodules can be useful in differentiating between benign and malignant lung nodule lesions on computed tomography (CT). Unlike previous computerized schemes, which focus on the features extracting, we concentrate on similarity metric of the lung nodules. In this study, we first assemble a lung nodule dataset which is from LIDC-IDRI lung CT images. This dataset includes 746 lung nodules in which 375 domain radiologists identified malignant nodules and 371 domain radiologists-identified benign nodules. Each nodule is represented by a vector of 26 texture features. We then propose a content-based image retrieval (CBIR) scheme to classify between benign and malignant lung nodules with a learned Mahalanobis distance metric. The Mahalanobis distance metric as a similarity metric can preserve semantic relevance and visual similarity of lung nodules. The CBIR approach uses this Mahalanobis distance to search for most similar reference nodules for each queried nodule. The majority of votes are then computed to predict the likelihood of the queried nodule depicting a malignant lesion. For the classification accuracy, the area under the ROC curve (AUC) can achieve as 0.942±0.008. The recall and precision of benign nodules are 0.860 and 0.889, respectively. The recall and precision of malignant nodules are 0.893 and 0.866, respectively.

[1]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[2]  Jun Yu,et al.  Semantic preserving distance metric learning and applications , 2014, Inf. Sci..

[3]  Max A. Viergever,et al.  Computer-aided diagnosis in chest radiography: a survey , 2001, IEEE Transactions on Medical Imaging.

[4]  Richard C. Pais,et al.  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. , 2011, Medical physics.

[5]  K. Doi,et al.  Investigation of psychophysical measure for evaluation of similar images for mammographic masses: preliminary results. , 2005, Medical physics.

[6]  A. Jemal,et al.  Cancer statistics, 2015 , 2015, CA: a cancer journal for clinicians.

[7]  He Ma,et al.  Similarity measurement of lung masses for medical image retrieval using kernel based semisupervised distance metric. , 2016, Medical physics.

[8]  Rong Jin,et al.  A Boosting Framework for Visuality-Preserving Distance Metric Learning and Its Application to Medical Image Retrieval , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Wen He,et al.  Assessing the use of digital radiography and a real-time interactive pulmonary nodule analysis system for large population lung cancer screening. , 2012, European journal of radiology.

[10]  Alexander McGregor,et al.  Lung cancer screening using low-dose computed tomography in at-risk individuals: the Toronto experience. , 2010, Lung cancer.

[11]  David Dagan Feng,et al.  Content-Based Medical Image Retrieval: A Survey of Applications to Multidimensional and Multimodality Data , 2013, Journal of Digital Imaging.

[12]  R. Pozzi Mucelli,et al.  Acoustic Radiation Force Impulse (ARFI) ultrasound imaging of solid focal liver lesions. , 2012, European journal of radiology.

[13]  B. van Ginneken,et al.  Computer-aided diagnosis in high resolution CT of the lungs. , 2003, Medical physics.

[14]  Hong Liu,et al.  Assessment of performance and reproducibility of applying a content-based image retrieval scheme for classification of breast lesions. , 2015, Medical physics.

[15]  Inderjit S. Dhillon,et al.  Information-theoretic metric learning , 2006, ICML '07.

[16]  Rong Jin,et al.  Distance Metric Learning: A Comprehensive Survey , 2006 .