A Comparative Study of Breast Cancer Diagnosis Using Supervised Machine Learning Techniques

Cancer is a class of diseases, which is driven by change in cells of the body and increase beyond normal growth and control. Breast cancer is one of the frequent types of cancer. Prognosis of breast cancer recurrence is highly required to raise the survival rate of patient suffering from breast cancer. With the advancement of technology and machine learning techniques, the cancer diagnosis and detection accuracy has improved. Machine learning (ML) techniques offer various probabilistic and statistical methods that allow intelligent systems to learn from reoccurring past experiences to detect and identify patterns from a dataset. The research work presented an overview of evolve the machine learning techniques in cancer disease by applying learning algorithms on breast cancer Wisconsin data –Linear regression, Random Forest, Multi-layer Perceptron and Decision Trees (DT). The result outcome shows that Multilayer perceptron performs better than other techniques.

[1]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[2]  Aik Choon Tan,et al.  Ensemble machine learning on gene expression data for cancer classification. , 2003, Applied bioinformatics.

[3]  Ya Zhang,et al.  A gene signature for breast cancer prognosis using support vector machine , 2012, 2012 5th International Conference on BioMedical Engineering and Informatics.

[4]  Angeline Christobel,et al.  An Empirical Comparison of Data Mining Classification Methods , 2011 .

[5]  Kenli Li,et al.  A Parallel Random Forest Algorithm for Big Data in a Spark Cloud Computing Environment , 2017, IEEE Transactions on Parallel and Distributed Systems.

[6]  S. Muruganandam,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2020 .

[7]  Ferenc Szeifert,et al.  Supervised fuzzy clustering for the identification of fuzzy classifiers , 2003, Pattern Recognit. Lett..

[8]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[9]  Dr.G. Wiselin Jiji,et al.  An Efficient CBIR Approach for Diagnosing the Stages of Breast Cancer Using KNN Classifier , 2012 .

[10]  V. Marx Biology: The big challenges of big data , 2013, Nature.

[11]  S. Pal,et al.  Data Mining Techniques: To Predict and Resolve Breast Cancer Survivability , 2017 .

[12]  Natasha A. Khovanova,et al.  Handling limited datasets with neural networks in medical applications: A small-data approach , 2017, Artif. Intell. Medicine.

[13]  Abdel-Badeeh M. Salem,et al.  Clustering-based approach for detecting breast cancer recurrence , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[14]  Zhongming Zhao,et al.  Classification of Cancer Primary Sites Using Machine Learning and Somatic Mutations , 2015, BioMed research international.

[15]  Miriam Seoane Santos,et al.  Predicting Breast Cancer Recurrence Using Machine Learning Techniques , 2016, ACM Comput. Surv..

[16]  H. Joensuu,et al.  Artificial Neural Networks Applied to Survival Prediction in Breast Cancer , 1999, Oncology.

[17]  K. Usha Rani,et al.  ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA , 2012 .

[18]  Zhi-Hua Zhou,et al.  Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble , 2003, IEEE Transactions on Information Technology in Biomedicine.

[19]  J. Shavlik,et al.  Breast cancer risk estimation with artificial neural networks revisited , 2010, Cancer.

[20]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[21]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[22]  Luís M. Silva,et al.  High-Content Analysis of Breast Cancer Using Single-Cell Deep Transfer Learning , 2016, Journal of biomolecular screening.

[23]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[24]  K.S. Nikita,et al.  Classification of medical data with a robust multi-level combination scheme , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[25]  R nbspPatelBrijain,et al.  A Survey on Decision Tree Algorithm for Classification , 2014 .

[26]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.