Can We Classify the Participants of a Longitudinal Epidemiological Study from Their Previous Evolution?

Medical research can greatly benefit from advances in data mining. We propose a mining approach for cohort analysis in a longitudinal population-based epidemiological study, and show that modelling and exploiting the evolution of cohort participants over time improves classification quality towards an outcome (a disease). Our mining workflow encompasses steps for tracing the evolution of the cohort participants and for using evolution features in classification. We show that our approach separates better between classes and that change in the values of variables is predictive. We report on results for the liver disorder hepatic steatosis (high fat accumulation in the liver), but our approach is appropriate for classification of longitudinal epidemiological data on further disorders.

[1]  M. Stepanova,et al.  Association of Sleep Disorders with Nonalcoholic Fatty Liver Disease (NAFLD): A Population-based Study. , 2013, Journal of clinical and experimental hepatology.

[2]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[3]  Myra Spiliopoulou,et al.  Mining Longitudinal Epidemiological Data to Understand a Reversible Disorder , 2014, IDA.

[4]  Jimeng Sun,et al.  A System for Mining Temporal Physiological Data Streams for Advanced Prognostic Decision Support , 2010, 2010 IEEE International Conference on Data Mining.

[5]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[6]  Myra Spiliopoulou,et al.  Using Participant Similarity for the Classification of Epidemiological Data on Hepatic Steatosis , 2014, 2014 IEEE 27th International Symposium on Computer-Based Medical Systems.

[7]  Mykola Pechenizkiy,et al.  Heart failure hospitalization prediction in remote patient management systems , 2010, 2010 IEEE 23rd International Symposium on Computer-Based Medical Systems (CBMS).

[8]  Myra Spiliopoulou,et al.  Learning and inspecting classification rules from longitudinal epidemiological data to identify predictive features on hepatic steatosis , 2014, Expert Syst. Appl..

[9]  Michael Frankfurter,et al.  Applied Longitudinal Data Analysis Modeling Change And Event Occurrence , 2016 .

[11]  Dheeraj Raju,et al.  Exploring factors associated with pressure ulcers: a data mining approach. , 2015, International journal of nursing studies.

[12]  W. Rathmann,et al.  Cohort profile: the study of health in Pomerania. , 2011, International journal of epidemiology.