Contrasting temporal trend discovery for large healthcare databases

With the increased acceptance of electronic health records, we can observe the increasing interest in the application of data mining approaches within this field. This study introduces a novel approach for exploring and comparing temporal trends within different in-patient subgroups, which is based on associated rule mining using Apriori algorithm and linear model-based recursive partitioning. The Nationwide Inpatient Sample (NIS), Healthcare Cost and Utilization Project (HCUP), Agency for Healthcare Research and Quality was used to evaluate the proposed approach. This study presents a novel approach where visual analytics on big data is used for trend discovery in form of a regression tree with scatter plots in the leaves of the tree. The trend lines are used for directly comparing linear trends within a specified time frame. Our results demonstrate the existence of opposite trends in relation to age and sex based subgroups that would be impossible to discover using traditional trend-tracking techniques. Such an approach can be employed regarding decision support applications for policy makers when organizing campaigns or by hospital management for observing trends that cannot be directly discovered using traditional analytical techniques.

[1]  Roberto J. Bayardo,et al.  Mining the most interesting rules , 1999, KDD '99.

[2]  Kwang Sun Ryu,et al.  Discovering Medical Knowledge using Association Rule Mining in Young Adults with Acute Myocardial Infarction , 2013, Journal of Medical Systems.

[3]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[4]  J S Alpert,et al.  A community-wide perspective of gender differences and temporal trends in the use of diagnostic and revascularization procedures for acute myocardial infarction. , 1993, The American journal of cardiology.

[5]  Wouter Duivesteijn,et al.  Exceptional Model Mining , 2008, Data Mining and Knowledge Discovery.

[6]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[7]  Gediminas Adomavicius,et al.  C-TREND: Temporal Cluster Graphs for Identifying and Visualizing Trends in Multiattribute Transactional Data , 2008, IEEE Transactions on Knowledge and Data Engineering.

[8]  Rob Law,et al.  Identifying changes and trends in Hong Kong outbound tourism , 2011 .

[9]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[10]  Tamara Pilishvili,et al.  Changing epidemiology of invasive pneumococcal disease among older adults in the era of pediatric pneumococcal conjugate vaccine. , 2005, JAMA.

[11]  Rajeev Motwani,et al.  Beyond Market Baskets: Generalizing Association Rules to Dependence Rules , 1998, Data Mining and Knowledge Discovery.

[12]  W. Baine,et al.  The Agency for Healthcare Research and Quality , 2006, Italian Journal of Public Health.

[13]  Jiong Yang,et al.  TAR: temporal association rules on evolving numerical attributes , 2001, Proceedings 17th International Conference on Data Engineering.

[14]  Viola Vaccarino,et al.  Association of age and sex with myocardial infarction symptom presentation and in-hospital mortality. , 2012, JAMA.

[15]  Bharati M. Ramageri DATA MINING TECHNIQUES AND APPLICATIONS , 2011 .

[16]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[17]  David C. Yen,et al.  Data mining techniques for customer relationship management , 2002 .

[18]  Sushil Jajodia,et al.  Discovering calendar-based temporal association rules , 2001, Proceedings Eighth International Symposium on Temporal Representation and Reasoning. TIME 2001.

[19]  Edward Omiecinski,et al.  Alternative Interest Measures for Mining Associations in Databases , 2003, IEEE Trans. Knowl. Data Eng..

[20]  Matthew Crosby,et al.  Association for the Advancement of Artificial Intelligence , 2014 .

[21]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[22]  K. Hornik,et al.  party : A Laboratory for Recursive Partytioning , 2009 .

[23]  Andreas Holzinger,et al.  Optimizing long-term treatment of rheumatoid arthritis with systematic documentation , 2011, 2011 5th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth) and Workshops.

[24]  Dave Dongelmans,et al.  PRIM versus CART in subgroup discovery: When patience is harmful , 2010, J. Biomed. Informatics.

[25]  Andreas Holzinger,et al.  On Knowledge Discovery and Interactive Intelligent Visualization of Biomedical Data - Challenges in Human-Computer Interaction & Biomedical Informatics , 2012, DATA.

[26]  Niels Peek,et al.  Intelligent Data Analysis for Knowl edge Discovery, Patient Monitoring and Quality Assessment , 2012, Methods of Information in Medicine.

[27]  K. Hornik,et al.  Generalized M‐fluctuation tests for parameter instability , 2007 .

[28]  D. Andrews Tests for Parameter Instability and Structural Change with Unknown Change Point , 1993 .

[29]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[30]  W. J. Boscardin,et al.  Age and Sex Variation in Prevalence of Chronic Medical Conditions in Older Residents of U.S. Nursing Homes , 2012, Journal of the American Geriatrics Society.

[31]  Kurt Hornik,et al.  Introduction to arules – A computational environment for mining association rules and frequent item sets , 2009 .

[32]  H Du,et al.  Data Mining Techniques and Applications , 2010 .