Mining consequence events in temporal health data

It is useful, sometimes crucial in medicine domain, to discover a temporal association or causal relationship among events. Such a mining problem is often challenging because 'consequence events' may not reliably occur after each trigger event of interest. This makes it difficult to apply existing temporal data mining techniques directly to real world problems. In this paper, we formalise the problem of mining consequence events of newly-introduced interventions. We combine the Before-After-Control-Impact (BACI) design with frequent pattern mining techniques to define an interestingness measure called consequency. We then propose a Multiple Occurrence of Target events Mining (MOTM) algorithm. MOTM is applied to the real world problem of monitoring the consequence effects of newly-marketed medicines in linked administrative health databases. The results for the case of the cholesterol lowering drug atorvastatin highlight the consequence events with lowest negative consequency values, which suggest replacement of existing therapies with the new one. The consequence events with highest consequency values are likely to be associated with adverse reactions of atorvastatin or treatments of cardiovascular (or associated) conditions. Sensitivity examination of MOTM on another drug further illustrates its effectiveness.

[1]  Jack T. Harvey,et al.  Data Mining of the Australian Adverse Drug Reactions Database , 2002 .

[2]  Maria E. Orlowska,et al.  Finding Event-Oriented Patterns in Long Temporal Sequences , 2003, PAKDD.

[3]  Ronald HB Meyboom,et al.  Rosuvastatin and the statin wars—the way to peace , 2004, The Lancet.

[4]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[5]  Deborah Layton,et al.  Identification and evaluation of a possible signal of exacerbation of colitis during rofecoxib treatment, using Prescription‐Event Monitoring data , 2004, Journal of clinical pharmacy and therapeutics.

[6]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[7]  Haym Hirsh,et al.  Learning to Predict Rare Events in Event Sequences , 1998, KDD.

[8]  Tao Li,et al.  Mining temporal patterns without predefined time windows , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[9]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[10]  P. S. Sastry,et al.  Discovering frequent episodes and learning hidden Markov models: a formal connection , 2005, IEEE Transactions on Knowledge and Data Engineering.

[11]  A. Moore,et al.  Wsare: What’s strange about recent events? , 2003, Journal of Urban Health.

[12]  Geoffrey I. Webb Efficient search for association rules , 2000, KDD '00.

[13]  Brian L. Strom,et al.  Drug Epidemiology and Post-Marketing Surveillance , 1992, NATO ASI Series.

[14]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[15]  Diane J. Cook,et al.  An event set approach to sequence discovery in medical data , 2000, Intell. Data Anal..

[16]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[17]  P. Corey,et al.  Incidence of Adverse Drug Reactions in Hospitalized Patients , 2012 .

[18]  E. Roughead The nature and extent of drug-related hospitalisations in Australia. , 1999, Journal of quality in clinical practice.

[19]  L. L. Eberhardt,et al.  Quantitative ecology and impact assessment , 1976 .

[20]  Jie Chen,et al.  Mining Unexpected Associations for Signalling Potential Adverse Drug Reactions from Administrative Health Databases , 2006, PAKDD.

[21]  Maria E. Orlowska,et al.  Finding Negative Event-Oriented Patterns in Long Temporal Sequences , 2004, PAKDD.

[22]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[23]  Ming-Syan Chen,et al.  Progressive Partition Miner: An Efficient Algorithm for Mining General Temporal Association Rules , 2003, IEEE Trans. Knowl. Data Eng..

[24]  Lehana Thabane,et al.  Application of data mining techniques in pharmacovigilance. , 2003, British journal of clinical pharmacology.

[25]  John F. Roddick,et al.  A Survey of Temporal Knowledge Discovery Paradigms and Methods , 2002, IEEE Trans. Knowl. Data Eng..

[26]  Mohammed J. Zaki,et al.  PlanMine: Predicting Plan Failures Using Sequence Mining , 1998, Artificial Intelligence Review.

[27]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[28]  Sushil Jajodia,et al.  Discovering calendar-based temporal association rules , 2003 .

[29]  Jie Chen,et al.  Mining Unexpected Temporal Associations: Applications in Detecting Adverse Drug Reactions , 2008, IEEE Transactions on Information Technology in Biomedicine.

[30]  N. Laird,et al.  Incidence of Adverse Drug Events and Potential Adverse Drug Events: Implications for Prevention , 1995 .

[31]  Jie Chen,et al.  Temporal Sequence Associations for Rare Events , 2004, PAKDD.

[32]  M. Hauben,et al.  Quantitative Methods in Pharmacovigilance , 2003, Drug safety.