Multi-feature based benchmark for cervical dysplasia classification evaluation

Cervical cancer is one of the most common types of cancer in women worldwide. Most deaths due to the disease occur in less developed areas of the world. In this work, we introduce a new image dataset along with expert annotated diagnoses for evaluating image-based cervical disease classification algorithms. A large number of Cervigram® images are selected from a database provided by the US National Cancer Institute. For each image, we extract three complementary pyramid features: Pyramid histogram in L*A*B* color space (PLAB), Pyramid Histogram of Oriented Gradients (PHOG), and Pyramid histogram of Local Binary Patterns (PLBP). Other than hand-crafted pyramid features, we investigate the performance of convolutional neural network (CNN) features for cervical disease classification. Our experimental results demonstrate the effectiveness of both our hand-crafted and our deep features. We intend to release this multi-feature dataset and our extensive evaluations using seven classic classifiers can serve as the baseline.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  Razvan Pascanu,et al.  M L ] 2 0 A ug 2 01 3 Pylearn 2 : a machine learning research library , 2014 .

[3]  B. Dziura,et al.  Assisted primary screening using the automated ThinPrep Imaging System. , 2005, American journal of clinical pathology.

[4]  Jia Gu,et al.  Automated image analysis of uterine cervical images , 2007, SPIE Medical Imaging.

[5]  Majid Mirmehdi,et al.  Comparative Exudate Classification Using Support Vector Machines and Neural Networks , 2002, MICCAI.

[6]  David C Wilbur,et al.  The Becton Dickinson FocalPoint GS Imaging System: clinical trials demonstrate significantly improved sensitivity for the detection of important cervical lesions. , 2009, American journal of clinical pathology.

[7]  L Gaffikin,et al.  A critical assessment of screening methods for cervical neoplasia , 2005, International journal of gynaecology and obstetrics: the official organ of the International Federation of Gynaecology and Obstetrics.

[8]  Sameer Antani,et al.  Digital Tools for Collecting Data from Cervigrams for Research and Training in Colposcopy , 2006, Journal of lower genital tract disease.

[9]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[10]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[11]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[12]  L. Bruni,et al.  Human papillomavirus (HPV) and related cancers in the Global Alliance for Vaccines and Immunization (GAVI) countries. A WHO/ICO HPV Information Centre Report. , 2012, Vaccine.

[13]  Liana G. Apostolova,et al.  Comparison of AdaBoost and Support Vector Machines for Detecting Alzheimer's Disease Through Automated Hippocampal Segmentation , 2010, IEEE Transactions on Medical Imaging.

[14]  Antonio Criminisi,et al.  Image Quality Transfer via Random Forest Regression: Applications in Diffusion MRI , 2014, MICCAI.

[15]  Qiang Ji,et al.  Classifying cervix tissue patterns with texture analysis , 2000, Pattern Recognit..

[16]  Jeff Heflin,et al.  Multimodal Entity Coreference for Cervical Dysplasia Diagnosis , 2015, IEEE Transactions on Medical Imaging.

[17]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Yanxi Liu,et al.  Cervical Cancer Detection Using SVM Based Feature Screening , 2004, MICCAI.

[19]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[20]  Noor Azuan Abu Osman,et al.  Intelligent Screening Systems for Cervical Cancer , 2014, TheScientificWorldJournal.

[21]  L. Mango,et al.  Design and methods of a population-based natural history study of cervical neoplasia in a rural province of Costa Rica: the Guanacaste Project. , 1997, Revista panamericana de salud publica = Pan American journal of public health.

[22]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[23]  Xiaolei Huang,et al.  A Data Driven Approach to Cervigram Image Analysis and Classification , 2013 .

[24]  Junzhou Huang,et al.  Discriminative sparse representations for cervigram image segmentation , 2010, 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[25]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[26]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[27]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[28]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[29]  Hayit Greenspan,et al.  Automatic detection of specular reflections in uterine cervix images , 2006, SPIE Medical Imaging.

[30]  Pietro Perona,et al.  Quickly Boosting Decision Trees - Pruning Underachieving Features Early , 2013, ICML.

[31]  W. Eric L. Grimson,et al.  Performance Issues in Shape Classification , 2002, MICCAI.

[32]  Jenq-Neng Hwang,et al.  Integration of neural networks and decision tree classifiers for automated cytology screening , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[33]  Robert M. Nishikawa,et al.  A study on several Machine-learning methods for classification of Malignant and benign clustered microcalcifications , 2005, IEEE Transactions on Medical Imaging.

[34]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..