Early Detection and Prevention of Cancer using Data Mining Techniques

Cancer is one of the leading causes of death worldwide. Early detection and prevention of cancer plays a very important role in reducing deaths caused by cancer. Identification of genetic and environmental factors is very important in developing novel methods to detect and prevent cancer. Therefore a novel multi layered method combining clustering and decision tree techniques to build a cancer risk prediction system is proposed here which predicts lung, breast, oral, cervix, stomach and blood cancers and is also user friendly, time and cost saving. This research uses data mining technology such as classification, clustering and prediction to identify potential cancer patients. The gathered data is preprocessed, fed into the database and classified to yield significant patterns using decision tree algorithm. Then the data is clustered using Kmeans clustering algorithm to separate cancer and non cancer patient data. Further the cancer cluster is subdivided into six clusters. Finally a prediction system is developed to analyze risk levels which help in prognosis. This research helps in detection of a person’s predisposition for cancer before going for clinical and lab tests which is cost and time consuming.

[1]  Zakaria Suliman Zubi,et al.  Improves Treatment Programs of Lung Cancer Using Data Mining Techniques , 2014 .

[2]  T. Sridevi,et al.  An Intelligent Classifier for Breast Cancer Diagnosis based on K-Means Clustering and Rough Set , 2014 .

[3]  Hany A. Elsalamony,et al.  Diagnosis of Breast Cancer using Decision Tree Models and SVM , 2013 .

[4]  R. Shanmugalakshmi,et al.  Childhood Cancer-a Hospital based study using Decision Tree Techniques , 2011 .

[5]  V. Ranjani,et al.  Data Mining Applications In Healthcare Sector: A Study , 2013 .

[6]  Neelamadhab Padhy,et al.  The Survey of Data Mining Applications And Feature Scope , 2012, ArXiv.

[7]  Xiuzhen Cheng,et al.  Developing Prognostic Systems of Cancer Patients by Ensemble Clustering , 2009, Journal of biomedicine & biotechnology.

[8]  Amiya Kumar Rath,et al.  A hybridized K-means clustering approach for high dimensional dataset , 2010 .

[9]  Shardul Pandya,et al.  A comparative study of data mining and statistical learning techniques for prediction of cancer survivability , 2012 .

[10]  B. Milović,et al.  Prediction and Decision Making in Health Care using Data Mining , 2012 .

[11]  Saudi Arabia,et al.  A study of digital mammograms by using clustering algorithms , 2012 .

[12]  R. Geetha Ramani,et al.  Data Mining in Clinical Data Sets: A Review , 2012 .

[13]  Rajneet Kaur Using Some Data Mining Techniques to Predict the Survival Year of Lung Cancer Patient , 2013 .

[14]  Labeed K Abdulgafoor,et al.  Detection of Brain Tumor using Modified K Means Algorithm and SVM , 2013 .

[15]  Dr. P. Sumathi,et al.  A Survey on Data Mining using Clustering Techniques , 2013 .

[16]  Nasru Minallah,et al.  Classification and Regression Analysis of the Prognostic Breast Cancer using Generation Optimizing Algorithms , 2013 .

[17]  Gugulothu Narsimha,et al.  Diagnosis of Lung Cancer Prediction System Using Data Mining Classification Techniques , 2013 .

[18]  R. Anitha,et al.  Classifiers based Approach for Pre-Diagnosis of Lung Cancer Disease , 2014 .