Coarse-to-fine classification via parametric and nonparametric models for computer-aided diagnosis

Classification is one of the core problems in Computer-Aided Diagnosis (CAD), targeting for early cancer detection using 3D medical imaging interpretation. High detection sensitivity with desirably low false positive (FP) rate is critical for a CAD system to be accepted as a valuable or even indispensable tool in radiologists' workflow. Given various spurious imagery noises which cause observation uncertainties, this remains a very challenging task. In this paper, we propose a novel, two-tiered coarse-to-fine (CTF) classification cascade framework to tackle this problem. We first obtain classification-critical data samples (e.g., implicit samples on the decision boundary) extracted from the holistic data distributions using a robust parametric model (e.g., [13]); then we build a graph-embedding based nonparametric classifier on sampled data, which can more accurately preserve or formulate the complex classification boundary. These two steps can also be considered as effective "sample pruning" and "feature pursuing + kNN/template matching", respectively. Our approach is validated comprehensively in colorectal polyp detection and lung nodule detection CAD systems, as the top two deadly cancers, using hospital scale, multi-site clinical datasets. The results show that our method achieves overall better classification/detection performance than existing state-of-the-art algorithms using single-layer classifiers, such as the support vector machine variants [17], boosting [15], logistic regression [11], relevance vector machine [13], k-nearest neighbor [9] or spectral projections on graph [2].

[1]  Frank Nielsen,et al.  Total Bregman divergence and its applications to shape retrieval , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[3]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[4]  Gregory G. Slabaugh,et al.  A Robust and Fast System for CTC Computer-Aided Detection of Colorectal Lesions , 2010, Algorithms.

[5]  Yuxiao Hu,et al.  Learning a Spatially Smooth Subspace for Face Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[7]  Stephen Lin,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[9]  Ilaria Gori,et al.  Comparing and combining algorithms for computer-aided detection of pulmonary nodules in computed tomography scans: The ANODE09 study , 2010, Medical Image Anal..

[10]  Francesco Bertoni,et al.  Hierarchical clustering analysis of pathologic and molecular data identifies prognostically and biologically distinct groups of colorectal carcinomas , 2011, Modern Pathology.

[11]  Ben Taskar,et al.  Learning associative Markov networks , 2004, ICML.

[12]  Matthew T. Freedman,et al.  Classification of lung nodules in diagnostic CT: an approach based on 3D vascular features, nodule density distribution, and shape features , 2003, SPIE Medical Imaging.

[13]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[14]  Deng Cai,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[15]  Huan Liu,et al.  Spectral feature selection for supervised and unsupervised learning , 2007, ICML '07.

[16]  Jitendra Malik,et al.  SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17]  Lior Wolf,et al.  Feature selection for unsupervised and supervised inference: the emergence of sparsity in a weighted-based approach , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[18]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[19]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[21]  Shijun Wang,et al.  Improved classifier for computer-aided polyp detection in CT colonography by nonlinear dimensionality reduction. , 2008, Medical physics.

[22]  Christos Boutsidis,et al.  Unsupervised feature selection for principal components analysis , 2008, KDD.

[23]  Mohammed J. Zaki,et al.  VOKNN: Voting-based Nearest Neighbor Approach for Scalable SVM Training , 2010 .

[24]  J. Rodgers,et al.  Thirteen ways to look at the correlation coefficient , 1988 .

[25]  丸山 徹 Convex Analysisの二,三の進展について , 1977 .

[26]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[27]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[28]  Jiawei Han,et al.  Sparse Projections over Graph , 2008, AAAI.

[29]  Max A. Viergever,et al.  Computer-aided diagnosis in chest radiography: a survey , 2001, IEEE Transactions on Medical Imaging.

[30]  Christos Davatzikos,et al.  GRAM: A framework for geodesic registration on anatomical manifolds , 2010, Medical Image Anal..

[31]  Jacob D. Furst,et al.  Texture versus shape analysis for lung nodule similarity in computed tomography studies , 2008, SPIE Medical Imaging.

[32]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[33]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[34]  Nizar Bouguila,et al.  A Hybrid Feature Extraction Selection Approach for High-Dimensional Non-Gaussian Data Clustering , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Siddheswar Ray,et al.  Determination of Number of Clusters in K-Means Clustering and Application in Colour Image Segmentation , 2000 .

[36]  Frans Vos,et al.  Computer-Aided Detection of Polyps in CT Colonography Using Logistic Regression , 2010, IEEE Transactions on Medical Imaging.

[37]  Murat Dundar,et al.  Bayesian multiple instance learning: automatic feature selection and inductive transfer , 2008, ICML '08.

[38]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[39]  Hiroyuki Yoshida,et al.  Computerized detection of pulmonary nodules in chest radiographs based on morphological features and wavelet snake model , 2002, Medical Image Anal..

[40]  Bram van Ginneken,et al.  A large-scale evaluation of automatic pulmonary nodule detection in chest CT using local image features and k-nearest-neighbour classification , 2009, Medical Image Anal..

[41]  Jinbo Bi,et al.  Dimensionality Reduction via Sparse Support Vector Machines , 2003, J. Mach. Learn. Res..

[42]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[43]  Frank Nielsen,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. 1 Total Bregman Divergence and its Applications to DTI Analysis , 2022 .

[44]  Luís A. Alexandre,et al.  A Multiclassifier Approach for Lung Nodule Classification , 2006, ICIAR.

[45]  Jinbo Bi,et al.  Effective 3D object detection and regression using probabilistic segmentation features in CT images , 2011, CVPR 2011.

[46]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[47]  Dominic Mazzoni,et al.  Multiclass reduced-set support vector machines , 2006, ICML.

[48]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.