Association rules and data mining in hospital infection control and public health surveillance.

OBJECTIVES The authors consider the problem of identifying new, unexpected, and interesting patterns in hospital infection control and public health surveillance data and present a new data analysis process and system based on association rules to address this problem. DESIGN The authors first illustrate the need for automated pattern discovery and data mining in hospital infection control and public health surveillance. Next, they define association rules, explain how those rules can be used in surveillance, and present a novel process and system--the Data Mining Surveillance System (DMSS)--that utilize association rules to identify new and interesting patterns in surveillance data. RESULTS Experimental results were obtained using DMSS to analyze Pseudomonas aeruginosa infection control data collected over one year (1996) at University of Alabama at Birmingham Hospital. Experiments using one-, three-, and six-month time partitions yielded 34, 57, and 28 statistically significant events, respectively. Although not all statistically significant events are clinically significant, a subset of events generated in each analysis indicated potentially significant shifts in the occurrence of infection or antimicrobial resistance patterns of P. aeruginosa. CONCLUSION The new process and system are efficient and effective in identifying new, unexpected, and interesting patterns in surveillance data. The clinical relevance and utility of this process await the results of prospective studies currently in progress.

[1]  P. O'Hanley,et al.  Infectious disease management of adult leukemic patients undergoing chemotherapy: 1982 to 1986 experience at Stanford University Hospital. , 1989, The American journal of medicine.

[2]  R N Jones,et al.  Antibiotic resistance. Epidemiology and therapeutics. , 1992, Diagnostic microbiology and infectious disease.

[3]  J. Mcgowan,et al.  Do Intensive Hospital Antibiotic Control Programs Prevent the Spread of Antibiotic Resistance? , 1994, Infection Control & Hospital Epidemiology.

[4]  Daryl Pregibon,et al.  A Statistical Perspective on Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[5]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[6]  David Wai-Lok Cheung,et al.  Efficient Mining of Association Rules in Distributed Databases , 1996, IEEE Trans. Knowl. Data Eng..

[7]  Heikki Mannila,et al.  Multiple Uses of Frequent Sets and Condensed Representations (Extended Abstract) , 1996, KDD.

[8]  J. Freeman,et al.  Quantitative Epidemiology , 1996, Infection Control & Hospital Epidemiology.

[9]  I. Tager,et al.  Application of exponential smoothing for nosocomial infection surveillance. , 1996, American journal of epidemiology.

[10]  Rakesh Agrawal,et al.  Parallel Mining of Association Rules , 1996, IEEE Trans. Knowl. Data Eng..

[11]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[12]  Nick Andrews,et al.  A Statistical Algorithm for the Early Detection of Outbreaks of Infectious Disease , 1996 .

[13]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[14]  L. Hutwagner,et al.  Using laboratory-based surveillance data for prevention: an algorithm for detecting Salmonella outbreaks. , 1997, Emerging infectious diseases.

[15]  R. Gaynes,et al.  Surveillance of Nosocomial Infections: A Fundamental Ingredient for Quality , 1997, Infection Control & Hospital Epidemiology.

[16]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[17]  Mary Jane Ferraro,et al.  Methods for dilution antimicrobial susceptibility tests for bacteria that grow aerobically : approved standard , 2000 .