Supervised deep learning embeddings for the prediction of cervical cancer diagnosis

Cervical cancer remains a significant cause of mortality all around the world, even if it can be prevented and cured by removing affected tissues in early stages. Providing universal and efficient access to cervical screening programs is a challenge that requires identifying vulnerable individuals in the population, among other steps. In this work, we present a computationally automated strategy for predicting the outcome of the patient biopsy, given risk patterns from individual medical records. We propose a machine learning technique that allows a joint and fully supervised optimization of dimensionality reduction and classification models. We also build a model able to highlight relevant properties in the low dimensional space, to ease the classification of patients. We instantiated the proposed approach with deep learning architectures, and achieved accurate prediction results (top area under the curve AUC = 0.6875) which outperform previously developed methods, such as denoising autoencoders. Additionally, we explored some clinical findings from the embedding spaces, and we validated them through the medical literature, making them reliable for physicians and biomedical researchers. Subjects Bioinformatics, Computational Biology, Artificial Intelligence, Data Mining and Machine Learning

[1]  Michael I. Jordan,et al.  DiscLDA: Discriminative Learning for Dimensionality Reduction and Classification , 2008, NIPS.

[2]  Marco Masseroli,et al.  Software Suite for Gene and Protein Annotation Prediction and Similarity Search , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  Jaime S. Cardoso,et al.  Automated Detection and Categorization of Genital Injuries Using Digital Colposcopy , 2017, IbPRIA.

[4]  Richard S. Johannes,et al.  Using the ADAP Learning Algorithm to Forecast the Onset of Diabetes Mellitus , 1988 .

[5]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[6]  Jaime S. Cardoso,et al.  Tackling class imbalance with ranking , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[7]  Robert P. Kauffman,et al.  Current Recommendations for Cervical Cancer Screening: Do They Render the Annual Pelvic Examination Obsolete? , 2013, Medical Principles and Practice.

[8]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[9]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[10]  Marco Masseroli,et al.  Computational algorithms to predict Gene Ontology annotations , 2015, BMC Bioinformatics.

[11]  M. Geaffar,et al.  A method of social classification of population samples. , 1956 .

[12]  Leif E. Peterson K-nearest neighbor , 2009, Scholarpedia.

[13]  Bernhard Schölkopf,et al.  Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[14]  Jaime S. Cardoso,et al.  Transfer Learning with Partial Observability Applied to Cervical Cancer Screening , 2017, IbPRIA.

[15]  Jason H Moore,et al.  Computational analysis of gene-gene interactions using multifactor dimensionality reduction , 2004, Expert review of molecular diagnostics.

[16]  T.R. Martinez,et al.  Using permutations instead of student's t distribution for p-values in paired-difference algorithm comparisons , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[17]  Davide Cangelosi,et al.  Artificial neural network classifier predicts neuroblastoma patients’ outcome , 2016, BMC Bioinformatics.

[18]  Jaime S. Cardoso,et al.  Temporal Segmentation of Digital Colposcopies , 2015, IbPRIA.

[19]  Pietro Liò,et al.  Multiplex methods provide effective integration of multi-omic data in genome-scale models , 2016, BMC Bioinformatics.

[20]  Jing-Yu Yang,et al.  Optimal discriminant plane for a small number of samples and design method of classifier on the plane , 1991, Pattern Recognit..

[21]  D. Ayres-de- Campos,et al.  SisPorto 2.0: a program for automated analysis of cardiotocograms. , 2000, The Journal of maternal-fetal medicine.

[22]  Davide Chicco,et al.  Ten quick tips for machine learning in computational biology , 2017, BioData Mining.

[23]  Eric O. Postma,et al.  Dimensionality Reduction: A Comparative Review , 2008 .

[24]  L. Shulman,et al.  Early age at first sexual intercourse and early pregnancy are risk factors for cervical cancer in developing countries , 2009 .

[25]  Tao Xu,et al.  Multimodal Deep Learning for Cervical Dysplasia Diagnosis , 2016, MICCAI.

[26]  Omer Levy,et al.  Linguistic Regularities in Sparse and Explicit Word Representations , 2014, CoNLL.

[27]  J. Peto,et al.  Sexual behaviour and smoking as determinants of cervical HPV infection and of CIN3 among those infected: a case–control study nested within the Manchester cohort , 2000, British Journal of Cancer.

[28]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[29]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[30]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[31]  Pierre Baldi,et al.  Deep autoencoder neural networks for gene ontology annotation predictions , 2014, BCB.

[32]  Jaime S. Cardoso,et al.  Normal breast identification in screening mammography: A study on 18 000 images , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[33]  James E. Fowler,et al.  Locality-Preserving Dimensionality Reduction and Classification for Hyperspectral Image Analysis , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[34]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[35]  Miriam Seoane Santos,et al.  A new cluster-based oversampling method for improving survival prediction of hepatocellular carcinoma patients , 2015, J. Biomed. Informatics.

[36]  M. Elter,et al.  The prediction of breast cancer biopsy outcomes using two CAD approaches that both emphasize an intelligible decision process. , 2007, Medical physics.

[37]  Ruslan Salakhutdinov,et al.  Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models , 2014, ArXiv.

[38]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007 .

[39]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[40]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[41]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[42]  Cervical cancer screening among women aged 18-30 years - United States, 2000-2010. , 2013, MMWR. Morbidity and mortality weekly report.

[43]  B. Frey,et al.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning , 2015, Nature Biotechnology.

[44]  Steve R. Gunn,et al.  Result Analysis of the NIPS 2003 Feature Selection Challenge , 2004, NIPS.

[45]  Lukasz A. Kurgan,et al.  Knowledge discovery approach to automated cardiac SPECT diagnosis , 2001, Artif. Intell. Medicine.

[46]  Christophoros Nikou,et al.  A Review of Automated Techniques for Cervical Cell Image Analysis and Classification , 2013 .