Multi-Modal Fusion Learning For Cervical Dysplasia Diagnosis

Fusion of multi-modal information from a patient’s screening tests can help improve the diagnostic accuracy of cervical dysplasia. In this paper, we present a novel multi-modal deep learning fusion network, called MultiFuseNet, for cervical dysplasia diagnosis, utilizing multi-modal data from cervical screening results. To exploit the relations among different image modalities, we propose an Attention Mutual-Enhance (AME) module to fuse features of each modality at the feature extraction stage. Specifically, we first develop the Fused Faster R-CNN with AME modules for automatic cervix region detection and fused image feature learning, and then incorporate non-image information into the learning model to jointly learn non-linear correlations among all the modalities. To effectively train the Fused Faster R-CNN, we employ an alternating training scheme. Experimental results show the effectiveness of our method, which achieves an average accuracy of 87.4% (88.6% sensitivity and 86.1% specificity) on a large dataset, outperforming the methods using any single modality alone and the known multi-modal methods.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Youngjin Yoo,et al.  Hierarchical Multimodal Fusion of Deep-Learned Lesion and Tissue Integrity Features in Brain MRIs for Distinguishing Neuromyelitis Optica from Multiple Sclerosis , 2017, MICCAI.

[4]  Jeff Heflin,et al.  Multimodal Entity Coreference for Cervical Dysplasia Diagnosis , 2015, IEEE Transactions on Medical Imaging.

[5]  Jia Gu,et al.  Automated image analysis of uterine cervical images , 2007, SPIE Medical Imaging.

[6]  Tao Xu,et al.  A New Image Data Set and Benchmark for Cervical Dysplasia Classification Evaluation , 2015, MLMI.

[7]  Tao Xu,et al.  Adjustable adaboost classifier and pyramid features for image-based cervical cancer diagnosis , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[8]  Xin Yang,et al.  Joint Detection and Diagnosis of Prostate Cancer in Multi-parametric MRI Based on Multimodal Convolutional Neural Networks , 2017, MICCAI.

[9]  Noor Azuan Abu Osman,et al.  Intelligent Screening Systems for Cervical Cancer , 2014, TheScientificWorldJournal.

[10]  Seong-Whan Lee,et al.  Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis , 2014, NeuroImage.

[11]  Xiaogang Wang,et al.  Residual Attention Network for Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Jose Jeronimo,et al.  Comparative performance analysis of cervix ROI extraction and specular reflection removal algorithms for uterine cervix image analysis , 2007, SPIE Medical Imaging.

[13]  Tao Xu,et al.  Multimodal Deep Learning for Cervical Dysplasia Diagnosis , 2016, MICCAI.

[14]  A. Jemal,et al.  Cancer statistics, 2017 , 2017, CA: a cancer journal for clinicians.

[15]  Tao Xu,et al.  Multi-test cervical cancer diagnosis with missing data estimation , 2015, Medical Imaging.

[16]  Margherita Branca,et al.  Performance characteristics of Pap test, VIA, VILI, HR-HPV testing, cervicography, and colposcopy in diagnosis of significant cervical pathology , 2012, Virchows Archiv.