Intraoperative Diagnosis Support Tool for Serous Ovarian Tumors Based on Microarray Data Using Multicategory Machine Learning

Objectives Serous borderline ovarian tumors (SBOTs) are a subtype of serous ovarian carcinoma with atypical proliferation. Frozen-section diagnosis has been used as an intraoperative diagnosis tool in supporting the fertility-sparing surgery by diagnosing SBOTs with accuracy of 48% to 79%. Using DNA microarray technology, we designed multicategory classification models to support frozen-section diagnosis within 30 minutes. Materials and Methods We systematically evaluated 6 machine learning algorithms and 3 feature selection methods using 5-fold cross-validation and a grid search on microarray data obtained from the National Center for Biotechnology Information. To validate the models and selected biomarkers, expression profiles were analyzed in tissue samples obtained from the Yonsei University College of Medicine. Results The best accuracy of the optimal machine learning model was 97.3%. In addition, 5 features, including the expression of the putative biomarkers SNTN and AOX1, were selected to differentiate between normal, SBOT, and serous ovarian carcinoma groups. Different expression levels of SNTN and AOX1 were validated by real-time quantitative reverse-transcription polymerase chain reaction, Western blotting, and immunohistochemistry. A multinomial logistic regression model using SNTN and AOX1 alone was used to construct a simple-to-use equation that gave a diagnostic test accuracy of 91.9%. Conclusions We identified 2 biomarkers, SNTN and AOX1, that are likely involved in the pathogenesis and progression of ovarian tumors. An accurate diagnosis of ovarian tumor subclasses by application of the equation in conjunction with expression analysis of SNTN and AOX1 would offer a new accurate diagnosis tool in conjunction with frozen-section diagnosis within 30 minutes.

[1]  G. McCreanor,et al.  Pregnancy in and incidence of xanthine oxidase deficiency , 1986, Journal of Inherited Metabolic Disease.

[2]  Vikas Sindhwani,et al.  Information Theoretic Feature Crediting in Multiclass Support Vector Machines , 2001, SDM.

[3]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[4]  Yoonkyung Lee,et al.  Classification of Multiple Cancer Types by Multicategory Support Vector Machines Using Gene Expression Data , 2003, Bioinform..

[5]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[6]  Christopher P Crum,et al.  The Tubal Fimbria Is a Preferred Site for Early Adenocarcinoma in Women With Familial Ovarian Cancer Syndrome , 2006, The American journal of surgical pathology.

[7]  R. Lebowitz,et al.  Hereditary xanthinuria presenting in infancy with nephrolithiasis. , 1986, The Journal of pediatrics.

[8]  Akiko Yuba-Kubo,et al.  Sentan: a novel specific component of the apical structure of vertebrate motile cilia. , 2008, Molecular biology of the cell.

[9]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[10]  A. Ozek,et al.  Diagnostic accuracy of intraoperative consultation (frozen section) in borderline ovarian tumours and factors associated with misdiagnosis , 2014, Journal of obstetrics and gynaecology : the journal of the Institute of Obstetrics and Gynaecology.

[11]  Gregory Piatetsky-Shapiro,et al.  Microarray data mining: facing the challenges , 2003, SKDD.

[12]  S. Paik,et al.  Development of the 21-gene assay and its application in clinical practice and clinical trials. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[13]  Constantin F. Aliferis,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..

[14]  M. Sherman,et al.  Micropapillary serous carcinoma of the ovary. A distinctive low-grade carcinoma related to serous borderline tumors. , 1996, The American journal of surgical pathology.

[15]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[16]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[17]  Brigitte M. Ronnett,et al.  The Histologic Type and Stage Distribution of Ovarian Carcinomas of Surface Epithelial Origin , 2004, International journal of gynecological pathology : official journal of the International Society of Gynecological Pathologists.

[18]  Tao Li,et al.  A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression , 2004, Bioinform..

[19]  Yoon-La Choi,et al.  Aberrant hypermethylation of RASSF1A promoter in ovarian borderline tumors and carcinomas , 2006, Virchows Archiv.

[20]  Anne Cathrine Staff,et al.  ZNF385B and VEGFA Are Strongly Differentially Expressed in Serous Ovarian Carcinomas and Correlate with Survival , 2012, PloS one.

[21]  Kyung-Ah Kim,et al.  Mortality prediction of rats in acute hemorrhagic shock using machine learning techniques , 2013, Medical & Biological Engineering & Computing.

[22]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[23]  Lilya V. Matyunina,et al.  Gene expression profiling supports the hypothesis that human ovarian surface epithelia are multipotent and capable of serving as ovarian cancer initiating cells , 2009, BMC Medical Genomics.

[24]  Sung-Bae Cho,et al.  Machine Learning in DNA Microarray Analysis for Cancer Classification , 2003, APBC.

[25]  L. Cope,et al.  Patterns of p53 Mutations Separate Ovarian Serous Borderline Tumors and Low- and High-grade Carcinomas and Provide Support for a New Model of Ovarian Carcinogenesis: A Mutational Analysis With Immunohistochemical Correlation , 2005, The American journal of surgical pathology.

[26]  R. Rouzier,et al.  Factors influencing the use and accuracy of frozen section diagnosis of epithelial ovarian tumors. , 2008, American journal of obstetrics and gynecology.

[27]  D. Bell,et al.  Borderline Tumors of the Ovary: Correlation of Frozen and Permanent Histopathologic Diagnosis , 2000, Obstetrics and gynecology.

[28]  Carl T Wittwer,et al.  Extreme PCR: efficient and specific DNA amplification in 15-60 seconds. , 2015, Clinical chemistry.

[29]  Bin Wang,et al.  Gene Selection for Multiclass Prediction by Weighted Fisher Criterion , 2007, EURASIP J. Bioinform. Syst. Biol..

[30]  P. Saratchandran,et al.  Multicategory Classification Using An Extreme Learning Machine for Microarray Gene Expression Cancer Diagnosis , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[31]  O. Fiehn,et al.  Mass spectrometry-based metabolic profiling reveals different metabolite patterns in invasive ovarian carcinomas and ovarian borderline tumors. , 2006, Cancer research.

[32]  Emily Banks,et al.  The epidemiology of epithelial ovarian cancer: a review , 1997 .

[33]  A. Jemal,et al.  Cancer statistics, 2011 , 2011, CA: a cancer journal for clinicians.

[34]  Michael J. Birrer,et al.  The Anterior Gradient Homolog 3 (AGR3) Gene Is Associated With Differentiation and Survival in Ovarian Cancer , 2011, The American journal of surgical pathology.

[35]  Zne-Jung Lee,et al.  An integrated algorithm for gene selection and classification applied to microarray data of ovarian cancer , 2008, Artif. Intell. Medicine.

[36]  K. Matthews,et al.  Improving the performance of physiologic hot flash measures with support vector machines. , 2009, Psychophysiology.

[37]  L. Tanoue Cancer Statistics, 2011: The Impact of Eliminating Socioeconomic and Racial Disparities on Premature Cancer Deaths , 2012 .

[38]  StatnikovAlexander,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2005 .

[39]  S. Rutkove,et al.  Machine learning algorithms to classify spinal muscular atrophy subtypes , 2012, Neurology.

[40]  Robert Clarke,et al.  Matched Gene Selection and Committee Classifier for Molecular Classification of Heterogeneous Diseases , 2010, J. Mach. Learn. Res..

[41]  Y-H Wu,et al.  COL11A1 promotes tumor progression and predicts poor clinical outcome in ovarian cancer , 2014, Oncogene.