Learning rich geographical representations: Predicting colorectal cancer survival in the state of Iowa

Neural networks are capable of learning rich, nonlinear feature representations shown to be beneficial in many predictive tasks. In this work, we use these models to explore the use of geographical features in predicting colorectal cancer survival curves for patients in the state of Iowa, spanning the years 1989 to 2013. Specifically, we compare model performance using a newly defined metric – area between the curves (ABC) – to assess (a) whether survival curves can be reasonably predicted for colorectal cancer patients in the state of Iowa, (b) whether geographical features improve predictive performance, and (c) whether a simple binary representation or richer, spectral clustering-based representation perform better. Our findings suggest that survival curves can be reasonably estimated on average, with predictive performance deviating at the five-year survival mark. We also find that geographical features improve predictive performance, and that the best performance is obtained using richer, spectral analysis-elicited features.

[1]  M De Laurentiis,et al.  A technique for using neural network analysis to perform survival analysis of censored data. , 1994, Cancer letters.

[2]  Paolo Emilio Puddu,et al.  Artificial neural networks versus proportional hazards Cox models to predict 45-year all-cause mortality in the Italian Rural Areas of the Seven Countries Study , 2012, BMC Medical Research Methodology.

[3]  Tianbao Yang,et al.  Predicting Traffic Accidents Through Heterogeneous Urban Data : A Case Study , 2017 .

[4]  Kyle Luh,et al.  Community Detection Using Spectral Clustering on Sparse Geosocial Data , 2012, SIAM J. Appl. Math..

[5]  Fan Zhang,et al.  A traffic flow approach to early detection of gathering events , 2016, SIGSPATIAL/GIS.

[6]  Dharminder Kumar,et al.  DATA MINING CLASSIFICATION TECHNIQUES APPLIED FOR BREAST CANCER DIAGNOSIS AND PROGNOSIS , 2011 .

[7]  Vanessa Frías-Martínez,et al.  Spectral clustering for sensing urban land use using Twitter activity , 2014, Engineering applications of artificial intelligence.

[8]  Savita Goel,et al.  A study on prediction of breast cancer recurrence using data mining techniques , 2017, 2017 7th International Conference on Cloud Computing, Data Science & Engineering - Confluence.

[9]  Gerard Rushton,et al.  Who does not receive treatment for cancer? , 2013, Journal of oncology practice.

[10]  Saeedeh Pourahmad,et al.  Five Years Survival of Patients After Liver Transplantation and Its Effective Factors by Neural Network and Cox Poroportional Hazard Regression Models , 2015, Hepatitis monthly.

[11]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[12]  Smaranda Belciug,et al.  A two stage decision model for breast cancer detection , 2010 .

[13]  Alberto Maria Segre,et al.  A Large-Scale Exploration of Factors Affecting Hand Hygiene Compliance Using Linear Predictive Models , 2017, 2017 IEEE International Conference on Healthcare Informatics (ICHI).

[14]  Chetan Tiwari,et al.  Using Spatially Adaptive Filters to Map Late Stage Colorectal Cancer Incidence in Iowa , 2004, SDH.

[15]  D.,et al.  Regression Models and Life-Tables , 2022 .

[16]  Nidhi Mittal,et al.  A SURVEY ON PREDICTIVE ANALYSIS OF CANCER SURVIVABILITY RATE USING MACHINE LEARNING ALGORITHM , 2017 .

[17]  Smaranda Belciug,et al.  A hybrid neural network/genetic algorithm applied to breast cancer detection and recurrence , 2013, Expert Syst. J. Knowl. Eng..

[18]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[19]  Neng Wan,et al.  Spatial Access to Health Care Services and Disparities in Colorectal Cancer Stage at Diagnosis in Texas , 2013 .

[20]  I. Sandhu,et al.  ARTIFICIAL NEURAL NETWORK : AS EMERGING DIAGNOSTIC TOOL FOR BREAST CANCER , 2015 .

[21]  Chih-Lin Chi,et al.  Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets , 2007, AMIA.

[22]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.