Analysing Big Data to Build Knowledge Based System for Early Detection of Ovarian Cancer

Big data analysis plays a crucial role in the health care for early diagnosis of fatal disease. The data mining techniques are widely used for data analysis problem to discover valuable knowledge from a large amount of data. This paper uses the data mining methods such as feature selection and classification to provide a predictive model for ovarian cancer detection. A huge amount of dataset is gathered to build knowledge based system. Rough set theory is utilized to find the data reliance and reduce the feature set contained in the data set. The Hybrid Particle Genetic Swarm Optimization (PGSO) is used to optimize the selected features to efficiently classify the ovarian cancer, either normal or early or different stages of ovarian cancer. Multi class SVM is adopted as the classifier to classify normal or different stages of ovarian cancer using the optimized feature set. The experiment is done on different ovarian cancer dataset and the proposed system has obtained better results for all datasets.

[1]  Dr. S. P. Rajagopalan,et al.  An automatic Oral Cancer Classification using Data Mining Techniques , 2013 .

[2]  R. Shanmugalakshmi,et al.  Multi-Objective Firefly Algorithm for Multi-Class Gene Selection , 2015 .

[3]  Li-Yeh Chuang,et al.  A Hybrid BPSO-CGA Approach for Gene Selection and Classification of Microarray Data , 2012, J. Comput. Biol..

[4]  Gugulothu Narsimha,et al.  Diagnosis of Lung Cancer Prediction System Using Data Mining Classification Techniques , 2013 .

[5]  Nishchal K. Verma,et al.  Arrhythmia classification using SVM with selected features , 2012 .

[6]  Shweta Kharya,et al.  Using data mining techniques for diagnosis and prognosis of cancer disease , 2012, ArXiv.

[7]  N. Mohammadzadeh,et al.  Using Intelligent Data Analysis in Cancer Care: Benefits and Challenges , 2014 .

[8]  Anu Peisker,et al.  Data Analytics for Rural Development , 2015 .

[9]  Manjula Sanjay Koti,et al.  Knowledge Discovery in Medical Data by using Rough Set Rule Induction Algorithms , 2014 .

[10]  Geok See Ng,et al.  Ovarian cancer diagnosis using complementary learning fuzzy neural network , 2005 .

[11]  Karam M. Sallam,et al.  A TOPSIS based Method for Gene Selection for Cancer Classification , 2013 .

[12]  K. Usha Rani,et al.  ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS , 2011 .

[13]  Hesham Arafat,et al.  Using Intelligent Techniques for Breast Cancer Classification , 2012 .

[14]  Reza Effatnejad,et al.  Unit Commitment in Power System t by Combination of Dynamic Programming (DP), Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) , 2014 .

[15]  Yahya Slimani,et al.  A Novel RFE-SVM-based Feature Selection Approach for Classification , 2012 .

[16]  Kyoo-Sung Noh Plan for Vitalisation of Application of Big Data for e-Learning in South Korea , 2015 .

[17]  ChulSu Lim,et al.  Creating Values from a Noisy Accumulated Contents Based on Data Analysis , 2015 .

[18]  Heena Farooq Bhat,et al.  Modified One-Against-All Algorithm Based on Support Vector Machine , 2014 .