Distance weighted 'inside disc' classifier for computer-aided diagnosis of colonic polyps

Feature classification plays an important role in computer-aided diagnosis (CADx) of suspicious lesions or polyps in this concerned study. As one of the simplest machine learning algorithms, the k-nearest neighbor (k-NN) classifier has been widely used in many classification problems. However, the k-NN classifier has a drawback that the majority classes will dominate the prediction of a new sample. To mitigate this drawback, efforts have been devoted to set weight on each neighbor to avoid the influence of the “majority” classes. As a result, various weighted or wk-NN strategies have been explored. In this paper, we explored an alternative strategy, called “distance weighted inside disc” (DWID) classifier, which is different from the k-NN and wk-NN by such a way that it classifies the test point by assigning a corresponding label (instead a weight) with consideration of only those points inside the disc whose center is the test point instead of the k-nearest points. We evaluated this new DWID classifier with comparison to the k-NN, wk-NN, support vector machine (SVM) and random forest (RF) classifiers by experiments on a database of 153 polyps, including 116 neoplastic (malignance) polyps and 37 hyperplastic (benign) polyps, in terms of CADx or differentiation of benign from malignancy. The evaluation outcomes were documented quantitatively by the Receiver Operating Characteristics (ROC) analysis and the merit of area under the ROC curve (AUC), which is a well-established evaluation criterion to various classifiers. The results showed noticeable gain on the polyp differentiation by this new classifier according to the AUC values, as compared to the k-NN and wk-NN, as well as the SVM and RF. In the meantime, this new classifier also showed a noticeable reduction of computing time.

[1]  E. C. Hammond,et al.  Adenomatous lesions of the large bowel: An autopsy survey , 1979, Cancer.

[2]  J. Potter,et al.  Colon cancer: a review of the epidemiology. , 1993, Epidemiologic reviews.

[3]  T. Byers,et al.  American Cancer Society guidelines for screening and surveillance for early detection of colorectal polyps and cancer: Update 1997 , 1997 .

[4]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[5]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[6]  T. Muto,et al.  The evolution of cancer of the colon and rectum , 1974, Cancer.

[7]  S. Stryker,et al.  Natural history of untreated colonic polyps. , 1987, Gastroenterology.

[8]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[9]  Lior Rokach,et al.  An Introduction to Decision Trees , 2007 .

[10]  Perry J Pickhardt,et al.  Screening and Surveillance for the Early Detection of Colorectal Cancer and Adenomatous Polyps, 2008: A Joint Guideline from the American Cancer Society, the US Multi‐Society Task Force on Colorectal Cancer, and the American College of Radiology * † , 2008, CA: a cancer journal for clinicians.

[11]  P. Maisonneuve,et al.  Screening and surveillance for the early detection of colorectal cancer and adenomatous polyps. , 2008, Gastroenterology.

[12]  H. Lynch,et al.  Psychologic Aspects of Cancer Genetic Testing: A Research Update for Clinicians , 1997 .

[13]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  Zhengrong Liang,et al.  Virtual colonoscopy vs optical colonoscopy. , 2010, Expert opinion on medical diagnostics.

[15]  D. Lieberman,et al.  Polyp size and advanced histology in patients undergoing colonoscopy screening: implications for CT colonography. , 2008, Gastroenterology.

[16]  Karen M Horton,et al.  Accuracy of CT colonography for detection of large adenomas and cancers. , 2008, The New England journal of medicine.

[17]  C. Mulrow,et al.  Colorectal cancer screening: clinical guidelines and rationale. , 1997, Gastroenterology.

[18]  J. Saurin,et al.  [Computed tomographic virtual colonoscopy to screen for colorectal neoplasia in asymptomatic adults]. , 2004, Gastroenterologie clinique et biologique.

[19]  Zhengrong Liang,et al.  Volumetric texture features from higher-order images for diagnosis of colon lesions via CT colonography , 2014, International Journal of Computer Assisted Radiology and Surgery.

[20]  Dale Kraemer,et al.  Prevalence of polyps greater than 9 mm in a consortium of diverse clinical practice settings in the United States. , 2005, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[21]  E T Stewart,et al.  A comparison of colonoscopy and double-contrast barium enema for surveillance after polypectomy. National Polyp Study Work Group. , 2000, The New England journal of medicine.

[22]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[23]  I. Bitter,et al.  Computed tomographic virtual colonoscopy computer-aided polyp detection in a screening population. , 2005, Gastroenterology.

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[25]  Irina Rish,et al.  An empirical study of the naive Bayes classifier , 2001 .

[26]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[27]  D A Johnson,et al.  A prospective study of the prevalence of colonic neoplasms in asymptomatic patients with an age-related risk. , 1990, The American journal of gastroenterology.

[28]  Perry J Pickhardt,et al.  Low rates of cancer or high-grade dysplasia in colorectal polyps collected from computed tomography colonography screening. , 2010, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[29]  D. Eddy,et al.  Screening for colorectal cancer , 1992, The Lancet.

[30]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..