Multimodal Sparse Representation-Based Classification for Lung Needle Biopsy Images

Lung needle biopsy image classification is a critical task for computer-aided lung cancer diagnosis. In this study, a novel method, multimodal sparse representation-based classification (mSRC), is proposed for classifying lung needle biopsy images. In the data acquisition procedure of our method, the cell nuclei are automatically segmented from the images captured by needle biopsy specimens. Then, features of three modalities (shape, color, and texture) are extracted from the segmented cell nuclei. After this procedure, mSRC goes through a training phase and a testing phase. In the training phase, three discriminative subdictionaries corresponding to the shape, color, and texture information are jointly learned by a genetic algorithm guided multimodal dictionary learning approach. The dictionary learning aims to select the topmost discriminative samples and encourage large disagreement among different subdictionaries. In the testing phase, when a new image comes, a hierarchical fusion strategy is applied, which first predicts the labels of the cell nuclei by fusing three modalities, then predicts the label of the image by majority voting. Our method is evaluated on a real image set of 4372 cell nuclei regions segmented from 271 images. These cell nuclei regions can be divided into five classes: four cancerous classes (corresponding to four types of lung cancer) plus one normal class (no cancer). The results demonstrate that the multimodal information is important for lung needle biopsy image classification. Moreover, compared to several state-of-the-art methods (LapRLS, MCMI-AB, mcSVM, ESRC, KSRC), the proposed mSRC can achieve significant improvement (mean accuracy of 88.1%, precision of 85.2%, recall of 92.8%, etc.), especially for classifying different cancerous types.

[1]  Yu-Bin Yang,et al.  Lung cancer cell identification based on artificial neural network ensembles , 2002, Artif. Intell. Medicine.

[2]  Michael C. Lee,et al.  Computer-aided diagnosis of pulmonary nodules using a two-step approach for feature selection and classifier ensemble construction , 2010, Artif. Intell. Medicine.

[3]  Stefano Diciotti,et al.  Automated Segmentation Refinement of Small Lung Nodules in CT Scans by Local Shape Analysis , 2011, IEEE Transactions on Biomedical Engineering.

[4]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Dimitris N. Metaxas,et al.  Automated detection of prostatic adenocarcinoma from high-resolution ex vivo MRI , 2005, IEEE Transactions on Medical Imaging.

[6]  J. Pearlman,et al.  Early lung cancer detection based on registered perfusion MRI. , 2006, Oncology reports.

[7]  Qiao Wei,et al.  Segmentation of Lung Lobes in High-Resolution Isotropic CT Images , 2009, IEEE Transactions on Biomedical Engineering.

[8]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  R. Bharat Rao,et al.  Bayesian Co-Training , 2007, J. Mach. Learn. Res..

[10]  Ting Wang,et al.  Kernel Sparse Representation-Based Classifier , 2012, IEEE Transactions on Signal Processing.

[11]  João Paulo Papa,et al.  Automatic Segmentation and Classification of Human Intestinal Parasites From Microscopy Images , 2013, IEEE Transactions on Biomedical Engineering.

[12]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[13]  Nathalie Harder,et al.  Feature Selection for Evaluating Fluorescence Microscopy Images in Genome-Wide Cell Screens , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[14]  Yang Gao,et al.  Multi-class Multi-instance Learning for Lung Cancer Image Classification Based on Bag Feature Selection , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[15]  Giorgio Valentini,et al.  Support vector machines for candidate nodules classification , 2005, Neurocomputing.

[16]  Daoqiang Zhang,et al.  Ensemble sparse classification of Alzheimer's disease , 2012, NeuroImage.

[17]  Yang Yu,et al.  Diversity Regularized Machine , 2011, IJCAI.

[18]  Bunyarit Uyyanonvara,et al.  An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation , 2012, IEEE Transactions on Biomedical Engineering.

[19]  Jacob D. Furst,et al.  A model for the relationship between semantic and content based similarity using LIDC , 2010, Medical Imaging.

[20]  David Zhang,et al.  Fisher Discrimination Dictionary Learning for sparse representation , 2011, 2011 International Conference on Computer Vision.

[21]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[22]  Joel A. Tropp,et al.  Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[23]  Kensaku Mori,et al.  Recognition of bronchus in three-dimensional X-ray CT images with applications to virtualized bronchoscopy system , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[24]  Zhi-Hua Zhou,et al.  Analyzing Co-training Style Algorithms , 2007, ECML.

[25]  Henning Müller,et al.  Fusing visual and clinical information for lung tissue classification in high-resolution computed tomography , 2010, Artif. Intell. Medicine.

[26]  Guillermo Sapiro,et al.  Online dictionary learning for sparse coding , 2009, ICML '09.

[27]  George Lee,et al.  Multi-modal data fusion schemes for integrated classification of imaging and non-imaging biomedical data , 2011, 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[28]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[29]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[30]  Daoqiang Zhang,et al.  Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease , 2012, NeuroImage.

[31]  Trevor Darrell,et al.  Multi-View Learning in the Presence of View Disagreement , 2008, UAI 2008.

[32]  John J. Grefenstette,et al.  Optimization of Control Parameters for Genetic Algorithms , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[33]  Elisabeth Brambilla,et al.  Pathology and genetics of tumours of the lung , pleura, thymus and heart , 2004 .

[34]  Yinghuan Shi,et al.  Transductive cost-sensitive lung cancer image classification , 2012, Applied Intelligence.

[35]  Joseph F. Murray,et al.  Dictionary Learning Algorithms for Sparse Representation , 2003, Neural Computation.

[36]  Zhi-Hua Zhou,et al.  Semi-supervised learning by disagreement , 2010, Knowledge and Information Systems.