Ovarian Cancer Classification Using Hybrid Synthetic Minority Over-Sampling Technique and Neural Network

Every woman is at risk of ovarian cancer; about 90 percent of women who develop ovarian cancer are above 40 years of age, with the high number of ovarian cancers occurring at the age of 60 years and above. Early and correct diagnosis of ovarian cancer can allow proper treatment and as a result reduce the mortality rate. In this paper, we proposed a hybrid of Synthetic Minority Over-Sampling Technique (SMOTE) and Artificial Neural Network (ANN) to diagnose ovarian cancer from public available ovarian dataset. The dataset were firstly preprocessed using SMOTE before employing Neural Network for classification. This study shows that performance of Neural networks in the cancer classification is improved by employing SMOTE preprocessing algorithm to reduce the effect of data imbalance in the dataset. To justify the performance of the proposed approach, we compared our results with the standard neural network algorithms. The performance measurement evaluated was based on the accuracy, F-measure, Recall, ROC Area Margin Curve and Precision. The results showed that SMOTE + MLP (with above 96% accuracy) performed better than SMOTE + RBF and standard RBF and MLP.

[1]  A fast and adaptive automated disease diagnosis method with an innovative neural network model , 2012, Neural Networks.

[2]  Qeethara Al-Shayea Artificial Neural Networks in Medical Diagnosis , 2024, International Journal of Research Publication and Reviews.

[3]  Jan Piecha The Neural Network Selection for a Medical Diagnostic System using an Artificial Data Set , 2001 .

[4]  Henry Leung,et al.  Staging of Prostate Cancer Using Automatic Feature Selection, Sampling and Dempster-Shafer Fusion , 2009, Cancer informatics.

[5]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[6]  J. L. Rana,et al.  Number of hidden nodes for shape preserving ANN representation of a curve , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[7]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[9]  A. Kouzani,et al.  Segmentation of multiple sclerosis lesions in MR images: a review , 2011, Neuroradiology.

[10]  Mantao Xu,et al.  Classification of Imbalanced Data by Using the SMOTE Algorithm and Locally Linear Embedding , 2006, 2006 8th international Conference on Signal Processing.

[11]  M B Flamm Detection of breast cancer. , 1987, JAMA.

[12]  Abraham Pouliakis,et al.  Artificial neural networks in gynaecological diseases: current and potential future applications. , 2010, Medical science monitor : international medical journal of experimental and clinical research.

[13]  Shankaracharya,et al.  Computational intelligence in early diabetes diagnosis: a review. , 2010, The review of diabetic studies : RDS.

[14]  Brian Patrick Bradley,et al.  Finding biomarkers is getting easier , 2012, Ecotoxicology.

[15]  Sheng Chen,et al.  On combination of SMOTE and particle swarm optimization based radial basis function classifier for imbalanced problems , 2011, The 2011 International Joint Conference on Neural Networks.

[16]  Vincenzo Savarino,et al.  The use of artificial neural network in gastroenterology: the experience of the first 10 years. , 2007, European journal of gastroenterology & hepatology.

[17]  L. Spelt,et al.  Prognostic models for outcome following liver resection for colorectal cancer metastases: A systematic review. , 2012, European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology.

[18]  Mohammad Mansour Riahi Kashani,et al.  Combination of PCA with SMOTE Resampling to Boost the Prediction Rate in Lung Cancer Dataset , 2013, ArXiv.

[19]  Vaishali Ganganwar,et al.  An overview of classification algorithms for imbalanced datasets , 2012 .

[20]  S. Orsulic,et al.  Ovarian Cancer , 1993, British Journal of Cancer.

[21]  Peter Szolovits,et al.  Artificial intelligence in medical diagnosis. , 1988, Annals of internal medicine.

[22]  Farid E Ahmed,et al.  Molecular Cancer BioMed Central Review , 2005 .

[23]  Fadl Mutaher Ba-Alwi,et al.  Comparative Study for Analysis the Prognostic in Hepatitis Data : Data Mining Approach , 2013 .

[24]  Feng Chu,et al.  Applying RBF Neural Networks to Cancer Classification Based on Gene Expressions , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[25]  J. Nilsson,et al.  Artificial neural networks in pancreatic disease , 2008, The British journal of surgery.

[26]  Sansanee Auephanwiriyakul,et al.  Breast Abnormality Detection in Mammograms Using Fuzzy Inference System , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..