Data Mining Approach to Estimate the Duration of Drug Therapy from Longitudinal Electronic Medical Records

Electronic Medical Records (EMRs) from primary/ ambulatory care systems present a new and promising source of information for conducting clinical and translational research. To address the methodological and computational challenges in order to extract reliable medication information from raw data which is often complex, incomplete and erroneous. To assess whether the use of specific chaining fields of medication information may additionally improve the data quality. Guided by a range of challenges associated with missing and internally inconsistent data, we introduce two methods for the robust extraction of patient-level medication data. First method relies on chaining fields to estimate duration of treatment (“chaining”), while second disregards chaining fields and relies on the chronology of records (“continuous”). Centricity EMR database was used to estimate treatment duration with both methods for two widely prescribed drugs among type 2 diabetes patients: insulin and glucagon-like peptide-1 receptor agonists. At individual patient level the “chaining” approach could identify the treatment alterations longitudinally and produced more robust estimates of treatment duration for individual drugs, while the “continuous” method was unable to capture that dynamics. At population level, both methods produced similar estimates of average treatment duration, however, notable differences were observed at individual-patient level. The proposed algorithms explicitly identify and handle longitudinal erroneous or missing entries and estimate treatment duration with specific drug(s) of interest, which makes them a valuable tool for future EMR based clinical and pharmaco-epidemiological studies. To improve accuracy of real-world based studies, implementing chaining fields of medication information is recommended.

[1]  K. Khunti,et al.  Association of smoking and concomitant metformin use with cardiovascular events and mortality in people newly diagnosed with type 2 diabetes , 2016, Journal of diabetes.

[2]  M. Toumi,et al.  Comparison Of Comorbidity Measures To Predict Economic Outcomes In A Large Uk Primary Care Database. , 2015, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[3]  From Checklists to Tools: Lowering the Barrier to Better Research Reporting , 2015, PLoS medicine.

[4]  J. West,et al.  Calculating Total Health Service Utilisation and Costs from Routinely Collected Electronic Health Records Using the Example of Patients with Irritable Bowel Syndrome Before and After Their First Gastroenterology Appointment , 2015, PharmacoEconomics.

[5]  David Moher,et al.  The REporting of Studies Conducted Using Observational Routinely-Collected Health Data (RECORD) Statement: Methods for Arriving at Consensus and Developing Reporting Guidelines , 2015, PloS one.

[6]  K. Khunti,et al.  Delay in treatment intensification increases the risks of cardiovascular events in patients with type 2 diabetes , 2015, Cardiovascular Diabetology.

[7]  Remle Newton-Dame,et al.  The state of population health surveillance using electronic health records: a narrative review. , 2015, Population health management.

[8]  Mike Rayner,et al.  The epidemiology of cardiovascular disease in the UK 2014 , 2015, Heart.

[9]  J. Denny,et al.  Extracting research-quality phenotypes from electronic health records to support precision medicine , 2015, Genome Medicine.

[10]  Michael Klompas,et al.  Uses of electronic health records for public health surveillance to advance public health. , 2015, Annual review of public health.

[11]  K. Klein,et al.  The association of the treatment with glucagon-like peptide-1 receptor agonist exenatide or insulin with cardiovascular outcomes in patients with type 2 diabetes: a retrospective observational study , 2015, Cardiovascular Diabetology.

[12]  K. Khunti,et al.  Hypoglycemia and Risk of Cardiovascular Disease and All-Cause Mortality in Insulin-Treated People With Type 1 and Type 2 Diabetes: A Cohort Study , 2014, Diabetes Care.

[13]  Emmanouel Garoufallou,et al.  The Effectiveness of Big Data in Health Care: A Systematic Review , 2014, MTSR.

[14]  Use of electronic medical records for clinical research in the management of type 2 diabetes. , 2014, Research in social & administrative pharmacy : RSAP.

[15]  M. Etminan Reporting guidelines for pharmacoepidemiological studies are urgently needed , 2014, BMJ : British Medical Journal.

[16]  S. Gudbjörnsdottir,et al.  Albuminuria and renal function as predictors of cardiovascular events and mortality in a general population of patients with type 2 diabetes: A nationwide observational study from the Swedish National Diabetes Register , 2013, Diabetes & vascular disease research.

[17]  R. Hansen,et al.  Real-world utilization patterns and outcomes of colesevelam hcl in the ge electronic medical record , 2013, BMC Endocrine Disorders.

[18]  K Furu,et al.  The Nordic prescription databases as a resource for pharmacoepidemiological research—a literature review , 2013, Pharmacoepidemiology and drug safety.

[19]  J. Meyers,et al.  Real-world comparative outcomes of US type 2 diabetes patients initiating analog basal insulin therapy , 2013, Current medical research and opinion.

[20]  Hua Xu,et al.  Comparative analysis of pharmacovigilance methods in the detection of adverse drug reactions using electronic medical records , 2013, J. Am. Medical Informatics Assoc..

[21]  Carrie McAdam-Marx,et al.  Application of electronic medical record data for health outcomes research: a review of recent literature , 2013, Expert review of pharmacoeconomics & outcomes research.

[22]  Miriam Sturkenboom,et al.  Postmarketing Safety Surveillance , 2013, Drug Safety.

[23]  Preciosa M. ColomaGianluca Trifiro Where does Signal Detection Using Electronic Healthcare Records Fit into the Big Picture , 2013 .

[24]  J. A. Sánchez,et al.  Epidemiology of cardiovascular disease , 2013 .

[25]  Joshua C. Denny,et al.  Chapter 13: Mining Electronic Health Records in the Genomics Era , 2012, PLoS Comput. Biol..

[26]  Phil Aponte,et al.  The effectiveness of implementing an electronic health record on diabetes care and outcomes. , 2012, Health services research.

[27]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[28]  A. Martins,et al.  Overview of Pharmacoepidemiological Databases in the Assessment of Medicines Under Real-Life Conditions , 2012 .

[29]  P. Home,et al.  A comparison of duration of first prescribed insulin therapy in uncontrolled type 2 diabetes. , 2011, Diabetes research and clinical practice.

[30]  Lisa A. Grabenbauer,et al.  Electronic Health Record Adoption – Maybe It’s not about the Money , 2011, Applied Clinical Informatics.

[31]  I. Petersen,et al.  Prevalence of long-term oral glucocorticoid prescriptions in the UK over the past 20 years. , 2011, Rheumatology.

[32]  O. Baser,et al.  A real-world study of patients with type 2 diabetes initiating basal insulins via disposable pens , 2011, Advances in therapy.

[33]  Heather N. Watson,et al.  Use of electronic medical records (EMR) for oncology outcomes research: assessing the comparability of EMR information to patient registry and health claims data , 2011, Clinical epidemiology.

[34]  Lixia Yao,et al.  Electronic health records: Implications for drug discovery. , 2011, Drug discovery today.

[35]  N. Menachemi,et al.  Benefits and drawbacks of electronic health record systems , 2011 .

[36]  Prakash M. Nadkarni,et al.  Drug safety surveillance using de-identified EMR and claims data: issues and challenges , 2010, J. Am. Medical Informatics Assoc..

[37]  Pierre Zweigenbaum,et al.  Extracting medical information from narrative patient records: the case of medication-related information , 2010, J. Am. Medical Informatics Assoc..

[38]  Albert G Crawford,et al.  Comparison of GE Centricity Electronic Medical Record database and National Ambulatory Medical Care Survey findings on the prevalence of major conditions in the United States. , 2010, Population health management.

[39]  C. Mullins,et al.  Perspectives on electronic medical records adoption: electronic medical records (EMR) in outcomes research. , 2010, Patient related outcome measures.

[40]  B. Dean,et al.  Review: Use of Electronic Medical Records for Health Outcomes Research , 2009, Medical care research and review : MCRR.

[41]  A. Majeed,et al.  Identifying undiagnosed diabetes: cross-sectional survey of 3.6 million patients' electronic records. , 2008, The British journal of general practice : the journal of the Royal College of General Practitioners.

[42]  Jessica S. Ancker,et al.  Redesigning electronic health record systems to support public health , 2007, J. Biomed. Informatics.

[43]  Chengqi Zhang,et al.  Data preparation for data mining , 2003, Appl. Artif. Intell..

[44]  M. Dixon,et al.  - 1-Preparing Clean Views of Data for Data Mining , 2000 .