Multimodal Deep Learning for Cervical Dysplasia Diagnosis

To improve the diagnostic accuracy of cervical dysplasia, it is important to fuse multimodal information collected during a patient’s screening visit. However, current multimodal frameworks suffer from low sensitivity at high specificity levels, due to their limitations in learning correlations among highly heterogeneous modalities. In this paper, we design a deep learning framework for cervical dysplasia diagnosis by leveraging multimodal information. We first employ the convolutional neural network (CNN) to convert the low-level image data into a feature vector fusible with other non-image modalities. We then jointly learn the non-linear correlations among all modalities in a deep neural network. Our multimodal framework is an end-to-end deep network which can learn better complementary features from the image and non-image modalities. It automatically gives the final diagnosis for cervical dysplasia with 87.83 % sensitivity at 90 % specificity on a large dataset, which significantly outperforms methods using any single source of information alone and previous multimodal frameworks.

[1]  Tao Xu,et al.  Multi-test cervical cancer diagnosis with missing data estimation , 2015, Medical Imaging.

[2]  Dinggang Shen,et al.  Deep Learning-Based Feature Representation for AD/MCI Classification , 2013, MICCAI.

[3]  Luca Maria Gambardella,et al.  Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks , 2013, MICCAI.

[4]  Ronald M. Summers,et al.  A New 2.5D Representation for Lymph Node Detection Using Random Sets of Deep Convolutional Neural Network Observations , 2014, MICCAI.

[5]  L. Bruni,et al.  Human papillomavirus (HPV) and related cancers in the Global Alliance for Vaccines and Immunization (GAVI) countries. A WHO/ICO HPV Information Centre Report. , 2012, Vaccine.

[6]  L. Mango,et al.  Design and methods of a population-based natural history study of cervical neoplasia in a rural province of Costa Rica: the Guanacaste Project. , 1997, Revista panamericana de salud publica = Pan American journal of public health.

[7]  Jeff Heflin,et al.  Multimodal Entity Coreference for Cervical Dysplasia Diagnosis , 2015, IEEE Transactions on Medical Imaging.

[8]  Simon J. Doran,et al.  Stacked Autoencoders for Unsupervised Feature Learning and Multiple Organ Detection in a Pilot Study Using 4D Patient Data , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[10]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[11]  Seong-Whan Lee,et al.  Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis , 2014, NeuroImage.

[12]  Michele Follen,et al.  Combined reflectance and fluorescence spectroscopy for in vivo detection of cervical pre-cancer. , 2005, Journal of biomedical optics.

[13]  L. Twiggs,et al.  Spectroscopic Imaging as a Triage Test for Cervical Disease: A Prospective Multicenter Clinical Trial , 2006, Journal of lower genital tract disease.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.