ICE: Interactive Classification Rule Exploration on Epidemiological Data

Personalized medicine benefits from the identification of subpopulations that exhibit higher prevalence of a disease than the general population: such subpopulations can become the target of more intensive investigations to identify risk factors and to develop dedicated therapies. Classification rule discovery algorithms are an appropriate tool for discovering such subpopulations: they scale well, even for multi-dimensional data and deliver comprehensible patterns. However, they may generate hundreds of rules and thus call for exploration methods. In this study, we extend the tool Interactive Medical Miner for the discovery of classification rules, into the Interactive Classification rule Explorer ICE, which offers functionalities for rule exploration, grouping, rule visualization and statistics. We report on our first results for the classification of cohort data on goiter, a disorder of the thyroid gland.

[1]  Iain E. Buchan,et al.  Requirements Engineering for E-science: Experiences in Epidemiology , 2009, IEEE Software.

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  Johannes Fürnkranz,et al.  Foundations of Rule Learning , 2012, Cognitive Technologies.

[4]  Myra Spiliopoulou,et al.  Interactive Medical Miner: Interactively Exploring Subpopulations in Epidemiological Datasets , 2014, ECML/PKDD.

[5]  Vincenzo Forgetta,et al.  Software Application Profiles: useful and novel software for epidemiological data analysis. , 2016, International journal of epidemiology.

[6]  Byoung-Tak Zhang,et al.  Rule-based in vitro molecular classification and visualization , 2013, BioChip Journal.

[7]  Yoones A. Sekhavat,et al.  Visualizing Association Rules Using Linked Matrix, Graph, and Detail Views , 2013 .

[8]  Alok N. Choudhary,et al.  Identifying hotspots in five year survival electronic health records of older adults , 2016, 2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS).

[9]  Myra Spiliopoulou,et al.  Learning and inspecting classification rules from longitudinal epidemiological data to identify predictive features on hepatic steatosis , 2014, Expert Syst. Appl..

[10]  W. Rathmann,et al.  Cohort profile: the study of health in Pomerania. , 2011, International journal of epidemiology.

[11]  Henry Völzke,et al.  Association of socioeconomic status with iodine supply and thyroid disorders in northeast Germany. , 2013, Thyroid : official journal of the American Thyroid Association.

[12]  Zhiyuan Zhang,et al.  Iterative cohort analysis and exploration , 2015, Inf. Vis..