Content-Based Image Retrieval System for Pulmonary Nodules Using Optimal Feature Sets and Class Membership-Based Retrieval

Lung cancer manifests itself in the form of lung nodules, the diagnosis of which is essential to plan the treatment. Automated retrieval of nodule cases will assist the budding radiologists in self-learning and differential diagnosis. This paper presents a content-based image retrieval (CBIR) system for lung nodules using optimal feature sets and learning to enhance the performance of retrieval. The classifiers with more features suffer from the curse of dimensionality. Like classification schemes, we found that the optimal feature set selected using the minimal-redundancy-maximal-relevance (mRMR) feature selection technique improves the precision performance of simple distance-based retrieval (SDR). The performance of the classifier is always superior to SDR, which leans researchers towards conventional classifier-based retrieval (CCBR). While CCBR improves the average precision and provides 100% precision for correct classification, it fails for misclassification leading to zero retrieval precision. The class membership-based retrieval (CMR) is found to bridge this gap for texture-based retrieval. Here, CMR is proposed for nodule retrieval using shape-, margin-, and texture-based features. It is found again that optimal feature set is important for the classifier used in CMR as well as for the feature set used for retrieval, which may lead to different feature sets. The proposed system is evaluated using two independent databases from two continents: a public database LIDC/IDRI and a private database PGIMER-IITKGP, using three distance metrics, i.e., Canberra, City block, and Euclidean. The proposed CMR-based retrieval system with optimal feature sets performs better than CCBR and SDR with optimal features in terms of average precision. Apart from average precision and standard deviation of precision, the fraction of queries with zero precision retrieval is also measured.

[1]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Punam K. Saha,et al.  Measurements of digitized objects with fuzzy borders in 2D and 3D , 2005, Image Vis. Comput..

[3]  Jacob D. Furst,et al.  Learning lung nodule similarity using a genetic algorithm , 2012, Medical Imaging.

[4]  Niranjan Khandelwal,et al.  Complementary cumulative precision distribution: a new graphical metric for medical image retrieval system , 2014, Medical Imaging.

[5]  Niranjan Khandelwal,et al.  Differential geometry-based techniques for characterization of boundary roughness of pulmonary nodules in CT images , 2015, International Journal of Computer Assisted Radiology and Surgery.

[6]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[7]  Niranjan Khandelwal,et al.  Erratum to: A Segmentation Framework of Pulmonary Nodules in Lung CT Images , 2015, Journal of Digital Imaging.

[8]  J. Hornaday,et al.  Cancer Facts & Figures 2004 , 2004 .

[9]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[10]  Heinz-Otto Peitgen,et al.  Morphological segmentation and partial volume analysis for volumetry of solid pulmonary lesions in thoracic CT scans , 2006, IEEE Transactions on Medical Imaging.

[11]  Don R. Hush,et al.  Query by image example: The CANDID approach , 1995 .

[12]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Prabhat Jha,et al.  Trends in bidi and cigarette smoking in India from 1998 to 2015, by age, gender and education , 2016, BMJ Global Health.

[14]  Lizhuang Ma,et al.  A new feature-preserving mesh-smoothing algorithm , 2009, The Visual Computer.

[15]  Antoine Geissbühler,et al.  A Review of Content{Based Image Retrieval Systems in Medical Applications { Clinical Bene(cid:12)ts and Future Directions , 2022 .

[16]  Ashis Kumar Dhara,et al.  Performance metrics for image contrast , 2011, 2011 International Conference on Image Information Processing.

[17]  Hong Zhao,et al.  Texture Feature Analysis for Computer-Aided Diagnosis on Pulmonary Nodules , 2015, Journal of Digital Imaging.

[18]  Sudipta Mukhopadhyay,et al.  A Segmentation Framework of Pulmonary Nodules in Lung CT Images , 2016, Journal of Digital Imaging.

[19]  Richard C. Pais,et al.  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. , 2011, Medical physics.

[20]  Sudipta Mukhopadhyay,et al.  Content-based texture image retrieval using fuzzy class membership , 2013, Pattern Recognit. Lett..

[21]  W. Heindel,et al.  Screening for early lung cancer with low-dose spiral CT: prevalence in 817 asymptomatic smokers. , 2002, Radiology.

[22]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[23]  Sudipta Mukhopadhyay,et al.  Content-based image retrieval using fuzzy class membership and rules based on classifier confidence , 2015, IET Image Process..

[24]  Rangaraj M. Rangayyan,et al.  Measures of acutance and shape for classification of breast tumors , 1997, IEEE Transactions on Medical Imaging.

[25]  David S. Channin,et al.  BRISC—An Open Source Pulmonary Nodule Image Retrieval Framework , 2007, Journal of Digital Imaging.

[26]  Niranjan Khandelwal,et al.  Content-Based Image Retrieval System for Pulmonary Nodules: Assisting Radiologists in Self-Learning and Diagnosis of Lung Cancer , 2016, Journal of Digital Imaging.

[27]  William E. Lorensen,et al.  Marching cubes: A high resolution 3D surface construction algorithm , 1987, SIGGRAPH.

[28]  Michael Kohnen,et al.  The IRMA code for unique classification of medical images , 2003, SPIE Medical Imaging.

[29]  Niranjan Khandelwal,et al.  Quantitative evaluation of margin sharpness of pulmonary nodules in lung CT images , 2016, IET Image Process..

[30]  B. S. Manjunath,et al.  Texture features and learning similarity , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  Carla E. Brodley,et al.  ASSERT: A Physician-in-the-Loop Content-Based Retrieval System for HRCT Image Databases , 1999, Comput. Vis. Image Underst..