Accuracy Enhanced Lung Cancer Prognosis for Improving Patient Survivability Using Proposed Gaussian Classifier System

Statistical classifier and good accuracy is an essential part of the research in medical data mining. Accurate prediction of lung cancer is an essential step for making effective clinical decisions. After identifying the lung cancer, minimum scopes are available in the medications for patient living in the world. Hemoglobin level and TNM stage wise patients survival period has to be varied. Some group of people survival period is minimal and another group of people survival time is lengthy. This study is aimed to develop a prediction model with new clinical variables to predict lung cancer patients. It’s based on revised 8th edition study of TNM in lung cancer. These new attributes are collected from SEER databases, Indian cancer hospitals and research centers. The collected new attributes are classified using supervised machine learning algorithms of linear regression, Naïve Bayes classifier and proposed algorithms of Gaussian K-Base NB classifier. In particular, for TNM stage 1 group of people with normal hemoglobin level (NHBL), that group of lung cancer patient quality of life is highly enhanced. Which proved by using supervised machine learning algorithms. The proposed algorithm classified the database in terms of with respect to tumor size and HB level and the results are confirmed in the R environment. The continuous attribute classification method to prove first level of TNM in lung cancer patient along with standard hemoglobin has to be maintained that the people survivability rate is higher than the smaller level of hemoglobin people survival rate. The Gaussian K-Base NB classifier is more effective than the existing machine learning algorithms for lung cancer prediction model. The proposed classification accuracy has measured using ROC methods.

[1]  F. Detterbeck,et al.  The eighth edition TNM stage classification for lung cancer: What does it mean on main street? , 2018, The Journal of thoracic and cardiovascular surgery.

[2]  Yang Zhang,et al.  Identification of TNM stage-specific genes in lung adenocarcinoma by genome-wide expression profiling , 2013, Oncology letters.

[3]  B. Laird,et al.  Prognosis in advanced lung cancer--A prospective study examining key clinicopathological factors. , 2015, Lung cancer.

[4]  Yan Xing,et al.  Relationship Between Tumor Size and Survival in Non–Small-Cell Lung Cancer (NSCLC): An Analysis of the Surveillance, Epidemiology, and End Results (SEER) Registry , 2015, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[5]  G. Tonini,et al.  Hemoglobin levels and quality of life in patients with symptomatic chemotherapy-induced anemia: the eAQUA study , 2016, Cancer management and research.

[6]  T. Kadir,et al.  Bayesian Networks for Clinical Decision Support in Lung Cancer Care , 2013, PloS one.

[7]  Bahram Goliaei,et al.  Prediction of lung tumor types based on protein attributes by machine learning algorithms , 2013, SpringerPlus.

[8]  Mostafa Langarizadeh,et al.  Applying Naive Bayesian Networks to Disease Prediction: a Systematic Review , 2016, Acta informatica medica : AIM : journal of the Society for Medical Informatics of Bosnia & Herzegovina : casopis Drustva za medicinsku informatiku BiH.

[9]  Ce Zhang,et al.  Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features , 2016, Nature Communications.

[10]  Reza Javidan,et al.  Predicting lung cancer survivability using ensemble learning methods , 2017, 2017 Intelligent Systems Conference (IntelliSys).

[11]  Matthew Toews,et al.  Predicting survival time of lung cancer patients using radiomic analysis , 2017, Oncotarget.

[12]  D. Libby,et al.  Tumor size predicts survival within stage IA non-small cell lung cancer. , 2003, Chest.

[13]  S. Sasikala,et al.  Multi Filtration Feature Selection (MFFS) to improve discriminatory ability in clinical data set , 2016 .

[14]  D. Schrump,et al.  Lung cancer staging in the genomics era. , 2006, Thoracic surgery clinics.

[15]  Lalith Polepeddi,et al.  Colon cancer survival prediction using ensemble data mining on SEER data , 2013, 2013 IEEE International Conference on Big Data.

[16]  Tolga Yuksel,et al.  A New Feature Ensemble with a Multistage Classification Scheme for Breast Cancer Diagnosis , 2017, Journal of healthcare engineering.

[17]  Pedro Larrañaga,et al.  Feature selection in Bayesian classifiers for the prognosis of survival of cirrhotic patients treated with TIPS , 2005, J. Biomed. Informatics.

[18]  James A. Bartholomai,et al.  Prediction of lung cancer patient survival via supervised machine learning classification techniques , 2017, Int. J. Medical Informatics.

[19]  J. Crowley,et al.  The IASLC Lung Cancer Staging Project: Proposals for Revision of the TNM Stage Groupings in the Forthcoming (Eighth) Edition of the TNM Classification for Lung Cancer , 2016, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[20]  Jaw-Yuan Wang,et al.  The impact of white blood cell count and hemoglobin level on the response to radiotherapy in patients with colorectal cancer , 2014 .

[21]  Y. Zhang,et al.  Correlations of Hemoglobin Level and Perioperative Blood Transfusion with the Prognosis of Gastric Cancer: A Retrospective Study , 2017, Medical science monitor : international medical journal of experimental and clinical research.

[22]  Andrew Janowczyk,et al.  Prediction of recurrence in early stage non-small cell lung cancer using computer extracted nuclear features from digital H&E images , 2017, Scientific Reports.

[23]  D. Gerber,et al.  Baseline tumour measurements predict survival in advanced non-small cell lung cancer , 2013, British Journal of Cancer.

[24]  P. Lambin,et al.  Externally validated HPV-based prognostic nomogram for oropharyngeal carcinoma patients yields more accurate predictions than TNM staging. , 2014, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[25]  E. Richardsen,et al.  CD45RO+ Memory T Lymphocytes — a Candidate Marker for TNM-Immunoscore in Squamous Non–Small Cell Lung Cancer1 , 2015, Neoplasia.

[26]  Y. Zhang,et al.  Pre-treatment hemoglobin levels are an independent prognostic factor in patients with non-small cell lung cancer. , 2018, Molecular and clinical oncology.