Multi-test cervical cancer diagnosis with missing data estimation

Cervical cancer is a leading most common type of cancer for women worldwide. Existing screening programs for cervical cancer suffer from low sensitivity. Using images of the cervix (cervigrams) as an aid in detecting pre-cancerous changes to the cervix has good potential to improve sensitivity and help reduce the number of cervical cancer cases. In this paper, we present a method that utilizes multi-modality information extracted from multiple tests of a patient’s visit to classify the patient visit to be either low-risk or high-risk. Our algorithm integrates image features and text features to make a diagnosis. We also present two strategies to estimate the missing values in text features: Image Classifier Supervised Mean Imputation (ICSMI) and Image Classifier Supervised Linear Interpolation (ICSLI). We evaluate our method on a large medical dataset and compare it with several alternative approaches. The results show that the proposed method with ICSLI strategy achieves the best result of 83.03% specificity and 76.36% sensitivity. When higher specificity is desired, our method can achieve 90% specificity with 62.12% sensitivity.

[1]  R. Fu,et al.  Screening for Cervical Cancer: A Systematic Evidence Review for the U.S. Preventive Services Task Force , 2011 .

[2]  K Leshinskas Screening for Cervical Cancer , 2000, Science.

[3]  Xiaolei Huang,et al.  A Data Driven Approach to Cervigram Image Analysis and Classification , 2013 .

[4]  Jeff Heflin,et al.  Multimodal Entity Coreference for Cervical Dysplasia Diagnosis , 2015, IEEE Transactions on Medical Imaging.

[5]  Gustavo E. A. P. A. Batista,et al.  An analysis of four missing data treatment methods for supervised learning , 2003, Appl. Artif. Intell..

[6]  Sameer Antani,et al.  Digital Tools for Collecting Data from Cervigrams for Research and Training in Colposcopy , 2006, Journal of lower genital tract disease.

[7]  L Gaffikin,et al.  A critical assessment of screening methods for cervical neoplasia , 2005, International journal of gynaecology and obstetrics: the official organ of the International Federation of Gynaecology and Obstetrics.

[8]  L. Bruni,et al.  Human papillomavirus (HPV) and related cancers in the Global Alliance for Vaccines and Immunization (GAVI) countries. A WHO/ICO HPV Information Centre Report. , 2012, Vaccine.

[9]  D G Ferris,et al.  Cervicography for triage of women with mildly abnormal cervical cytology results. , 2001, American journal of obstetrics and gynecology.

[10]  Jose Jeronimo,et al.  Comparative performance analysis of cervix ROI extraction and specular reflection removal algorithms for uterine cervix image analysis , 2007, SPIE Medical Imaging.

[11]  David C Wilbur,et al.  The Becton Dickinson FocalPoint GS Imaging System: clinical trials demonstrate significantly improved sensitivity for the detection of important cervical lesions. , 2009, American journal of clinical pathology.

[12]  Sunanda Mitra,et al.  Classification of Cervix Lesions Using Filter Bank-Based Texture Mode , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[13]  L. Mango,et al.  Design and methods of a population-based natural history study of cervical neoplasia in a rural province of Costa Rica: the Guanacaste Project. , 1997, Revista panamericana de salud publica = Pan American journal of public health.

[14]  Hayit Greenspan,et al.  Automatic detection of specular reflections in uterine cervix images , 2006, SPIE Medical Imaging.

[15]  Shiri Gordon,et al.  Image segmentation of uterine cervix images for indexing in PACS , 2004 .

[16]  Jia Gu,et al.  Automated image analysis of uterine cervical images , 2007, SPIE Medical Imaging.

[17]  C Eskridge,et al.  Cervicography Combined With Repeat Papanicolaou Test as Triage for Low‐Grade Cytologic Abnormalities , 1998, Obstetrics and gynecology.

[18]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[19]  Qiang Ji,et al.  Classifying cervix tissue patterns with texture analysis , 2000, Pattern Recognit..