Query-constraint-based mining of association rules for exploratory analysis of clinical datasets in the National Sleep Research Resource

BackgroundAssociation Rule Mining (ARM) has been widely used by biomedical researchers to perform exploratory data analysis and uncover potential relationships among variables in biomedical datasets. However, when biomedical datasets are high-dimensional, performing ARM on such datasets will yield a large number of rules, many of which may be uninteresting. Especially for imbalanced datasets, performing ARM directly would result in uninteresting rules that are dominated by certain variables that capture general characteristics.MethodsWe introduce a query-constraint-based ARM (QARM) approach for exploratory analysis of multiple, diverse clinical datasets in the National Sleep Research Resource (NSRR). QARM enables rule mining on a subset of data items satisfying a query constraint. We first perform a series of data-preprocessing steps including variable selection, merging semantically similar variables, combining multiple-visit data, and data transformation. We use Top-k Non-Redundant (TNR) ARM algorithm to generate association rules. Then we remove general and subsumed rules so that unique and non-redundant rules are resulted for a particular query constraint.ResultsApplying QARM on five datasets from NSRR obtained a total of 2517 association rules with a minimum confidence of 60% (using top 100 rules for each query constraint). The results show that merging similar variables could avoid uninteresting rules. Also, removing general and subsumed rules resulted in a more concise and interesting set of rules.ConclusionsQARM shows the potential to support exploratory analysis of large biomedical datasets. It is also shown as a useful method to reduce the number of uninteresting association rules generated from imbalanced datasets. A preliminary literature-based analysis showed that some association rules have supporting evidence from biomedical literature, while others without literature-based evidence may serve as the candidates for new hypotheses to explore and investigate. Together with literature-based evidence, the association rules mined over the NSRR clinical datasets may be used to support clinical decisions for sleep-related problems.

[1]  M. Marre,et al.  Hypertension and diabetes mellitus. , 1993, Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie.

[2]  J. McMurray,et al.  Relationship between angina pectoris and outcomes in patients with heart failure and reduced ejection fraction: an analysis of the Controlled Rosuvastatin Multinational Trial in Heart Failure (CORONA). , 2014, European heart journal.

[3]  Alok N. Choudhary,et al.  Identifying HotSpots in Lung Cancer Data Using Association Rule Mining , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[4]  Vincent S. Tseng,et al.  Mining Top-K Association Rules , 2012, Canadian Conference on AI.

[5]  M. Lader Anxiety and Depression , 1983 .

[6]  Sang Min Park,et al.  Antidepressant Use and Diabetes Mellitus Risk: A Meta-Analysis , 2013, Korean journal of family medicine.

[7]  Shuo Wang,et al.  Ensemble diversity for class imbalance learning , 2011 .

[8]  Saso Dzeroski,et al.  Supporting Discovery in Medicine by Association Rule Mining in Medline and UMLS , 2001, MedInfo.

[9]  Vincent S. Tseng,et al.  Mining Top-K Non-redundant Association Rules , 2012, ISMIS.

[10]  C. Pisinger,et al.  Anxiety and depression in patients with chronic obstructive pulmonary disease (COPD). A review , 2004, Nordic journal of psychiatry.

[11]  P. Shekelle,et al.  Systematic Review: Impact of Health Information Technology on Quality, Efficiency, and Costs of Medical Care , 2006, Annals of Internal Medicine.

[12]  E. N. Smolar Diabetes and hypertension. , 1995, Comprehensive therapy.

[13]  Ulrich Güntzer,et al.  Algorithms for association rule mining — a general survey and comparison , 2000, SKDD.

[14]  Megha A. Parikh,et al.  Angiotensin‐Converting Inhibitors and Angiotensin II Receptor Blockers and Longitudinal Change in Percent Emphysema on Computed Tomography. The Multi‐Ethnic Study of Atherosclerosis Lung Study , 2017, Annals of the American Thoracic Society.

[15]  J. Kaprio,et al.  SNORING AS A RISK FACTOR FOR HYPERTENSION AND ANGINA PECTORIS , 1985, The Lancet.

[16]  Xiaofeng Wang,et al.  Mining hidden connections among biomedical concepts from disjoint biomedical literature sets through semantic‐based association rule , 2010, Int. J. Intell. Syst..

[17]  C. Reid,et al.  Cost–Utility of Angiotensin-Converting Enzyme Inhibitor-Based Treatment Compared With Thiazide Diuretic-Based Treatment for Hypertension in Elderly Australians Considering Diabetes as Comorbidity , 2015, Medicine.

[18]  David R. Williams,et al.  The Association between Hypertension and Depression and Anxiety Disorders: Results from a Nationally-Representative Sample of South African Adults , 2009, PloS one.

[19]  E. Dunlop On anxiety and depression. , 1969, Psychosomatics.

[20]  L. Thal,et al.  NSAIDs and hypertension , 2003, Archives of internal medicine.

[21]  O. Jolobe Angiotensin-converting enzyme inhibitors. , 1995, British journal of hospital medicine.

[22]  P. Nafstad,et al.  Relation between occurrence of type 1 diabetes and asthma , 2001, The Lancet.

[23]  B. Salako,et al.  Bronchial asthma: a risk factor for hypertension? , 2000, African journal of medicine and medical sciences.

[24]  M. Goldfracht,et al.  Antipsychotics and Diabetes: An Age-Related Association , 2008, The Annals of pharmacotherapy.

[25]  F. Zannad,et al.  Loop diuretics and ultrafiltration in heart failure , 2013, Expert opinion on pharmacotherapy.

[26]  B. Ivanović,et al.  The Mechanisms Responsible for Atherosclerosis Development in Arterial Hypertension and Hypercholesterolemia , 2015 .

[27]  May D. Wang,et al.  icuARM-An ICU Clinical Decision Support System Using Association Rule Mining , 2013, IEEE Journal of Translational Engineering in Health and Medicine.

[28]  M. Agius,et al.  What evidence is there to show which antipsychotics are more diabetogenic than others? , 2015, Psychiatria Danubina.

[29]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[30]  R. Becker,et al.  Loop Diuretics Combined with an ACE Inhibitor for Treatment of Hypertension: A Study with Furosemide, Piretanide, and Ramipril in Spontaneously Hypertensive Rats , 1989, Journal of cardiovascular pharmacology.

[31]  R.M. Rangayyan,et al.  Mammographic information analysis through association-rule mining , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[32]  Licong Cui,et al.  Query-constraint-based association rule mining from diverse clinical datasets in the national sleep research resource , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[33]  Carlos Ordonez,et al.  Association rule discovery with the train and test approach for heart disease prediction , 2006, IEEE Transactions on Information Technology in Biomedicine.

[34]  Antonio Gomariz,et al.  The SPMF Open-Source Data Mining Library Version 2 , 2016, ECML/PKDD.

[35]  Catherine P. Jayapandian,et al.  Scaling Up Scientific Discovery in Sleep Medicine: The National Sleep Research Resource. , 2016, Sleep.

[36]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[37]  Jean-Philippe Laurenceau,et al.  Anxiety characteristics independently and prospectively predict myocardial infarction in men the unique contribution of anxiety among psychologic factors. , 2008, Journal of the American College of Cardiology.

[38]  Syed Umar Amin,et al.  Data Mining in Clinical Decision Support Systems for Diagnosis, Prediction and Treatment of Heart Disease , 2013 .

[39]  M. Maes,et al.  Depression and myocardial infarction: relationship between heart and mind , 2001, Progress in Neuro-Psychopharmacology and Biological Psychiatry.

[40]  I. Sartori Hemostatic Factors and the Risk of Myocardial Infarction or Sudden Death in Patients with Angina Pectoris , 1996 .

[41]  K. Ajlouni,et al.  Anxiety and Depression Among Adult Patients With Diabetic Foot: Prevalence and Associated Factors , 2018, Journal of clinical medicine research.

[42]  J. Tuomilehto,et al.  Beta-blockers versus diuretics in hypertensive men: main results from the HAPPHY trial. , 1987, Journal of hypertension.

[43]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[44]  M. Friedman,et al.  Depression and Hypertension , 1977, Psychosomatic medicine.

[45]  G. Umpierrez,et al.  Sulfonylureas: A New Look at Old Therapy , 2014, Current Diabetes Reports.

[46]  Gregory Titus,et al.  Hypercholesterolemia is a Potential Risk Factor for Asthma , 2006, The Journal of asthma : official journal of the Association for the Care of Asthma.

[47]  Dimitris Kanellopoulos,et al.  Association Rules Mining: A Recent Overview , 2006 .

[48]  Fariborz Rezaeitalab,et al.  The correlation of anxiety and depression with obstructive sleep apnea syndrome , 2014, Journal of research in medical sciences : the official journal of Isfahan University of Medical Sciences.

[49]  Vijay V. Raghavan,et al.  Itemset Trees for Targeted Association Querying , 2003, IEEE Trans. Knowl. Data Eng..

[50]  Anyuan Zhong,et al.  Association between Self-Reported Habitual Snoring and Diabetes Mellitus: A Systemic Review and Meta-Analysis , 2016, Journal of diabetes research.

[51]  G. Dinç,et al.  Prevalence of habitual snoring and symptoms of sleep-disordered breathing in adolescents. , 2009, International journal of pediatric otorhinolaryngology.

[52]  Liuying Zheng,et al.  Non-steroidal Anti-inflammatory Drugs and Hypertension , 2014, Cell Biochemistry and Biophysics.

[53]  G. Criner,et al.  Clinical characteristics and prediction of pulmonary hypertension in severe emphysema. , 2014, Respiratory Medicine.

[54]  B. Stegmayr,et al.  Snoring and witnessed sleep apnea is related to diabetes mellitus in women. , 2009, Sleep medicine.

[55]  F. Dunn,et al.  Hypertension and myocardial infarction. , 1983, Journal of the American College of Cardiology.

[56]  C. Shin,et al.  Snoring as an independent risk factor for hypertension in the nonobese population: the Korean Health and Genome Study. , 2007, American journal of hypertension.

[57]  G. Felker,et al.  Loop diuretics in heart failure , 2012, Heart Failure Reviews.