Identification of Dyslipidemic Patients Attending Primary Care Clinics Using Electronic Medical Record (EMR) Data from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) Database

The objective of this study was to define the optimal algorithm to identify patients with dyslipidemia using electronic medical records (EMRs). EMRs of patients attending primary care clinics in St. John’s, Newfoundland and Labrador (NL), Canada during 2009–2010, were studied to determine the best algorithm for identification of dyslipidemia. Six algorithms containing three components, dyslipidemia ICD coding, lipid lowering medication use, and abnormal laboratory lipid levels, were tested against a gold standard, defined as the existence of any of the three criteria. Linear discriminate analysis, and bootstrapping were performed following sensitivity/specificity testing and receiver’s operating curve analysis. Two validating datasets, NL records of 2011–2014, and Canada-wide records of 2010–2012, were used to replicate the results. Relative to the gold standard, combining laboratory data together with lipid lowering medication consumption yielded the highest sensitivity (99.6%), NPV (98.1%), Kappa agreement (0.98), and area under the curve (AUC, 0.998). The linear discriminant analysis for this combination resulted in an error rate of 0.15 and an Eigenvalue of 1.99, and the bootstrapping led to AUC: 0.998, 95% confidence interval: 0.997–0.999, Kappa: 0.99. This algorithm in the first validating dataset yielded a sensitivity of 97%, Negative Predictive Value (NPV) = 83%, Kappa = 0.88, and AUC = 0.98. These figures for the second validating data set were 98%, 93%, 0.95, and 0.99, respectively. Combining laboratory data with lipid lowering medication consumption within the EMR is the best algorithm for detecting dyslipidemia. These results can generate standardized information systems for dyslipidemia and other chronic disease investigations using EMRs.

[1]  J V Tu,et al.  Myocardial infarction and the validation of physician billing and hospitalization data using electronic medical records. , 2010, Chronic diseases in Canada.

[2]  W. Trick,et al.  Electronic Algorithmic Prediction of Central Vascular Catheter Use , 2010, Infection Control & Hospital Epidemiology.

[3]  T. Williamson,et al.  Does the Prevalence of Dyslipidemias Differ between Newfoundland and the Rest of Canada? Findings from the Electronic Medical Records of the Canadian Primary Care Sentinel Surveillance Network , 2015, Front. Cardiovasc. Med..

[4]  Bruce E. Bray,et al.  A bootstrapping algorithm to improve cohort identification using structured data , 2011, J. Biomed. Informatics.

[5]  Karen Tu,et al.  Using data from electronic medical records: theory versus practice. , 2008, Healthcare quarterly.

[6]  T. Williamson,et al.  Low density lipoprotein cholesterol control status among Canadians at risk for cardiovascular disease: findings from the Canadian Primary Care Sentinel Surveillance Network Database , 2015, Lipids in Health and Disease.

[7]  Donald R. Miller,et al.  Who has diabetes? Best estimates of diabetes prevalence in the Department of Veterans Affairs based on computerized patient data. , 2004, Diabetes care.

[8]  N. Chaiyakunapruk,et al.  Validation of electronic medical database in patients with atrial fibrillation in community hospitals. , 2011, Journal of the Medical Association of Thailand = Chotmaihet thangphaet.

[9]  K. Collins,et al.  Using Electronic Medical Record to Identify Patients With Dyslipidemia in Primary Care Settings: International Classification of Disease Code Matters From One Region to a National Database , 2017, Biomedical informatics insights.

[10]  L. Joseph,et al.  Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. , 1995, American journal of epidemiology.

[11]  T. Williamson,et al.  Single and mixed dyslipidaemia in Canadian primary care settings: findings from the Canadian primary care sentinel surveillance network database , 2015, BMJ Open.

[12]  L. Radican,et al.  Differences in baseline characteristics between patients prescribed sitagliptin versus exenatide based on a US electronic medical record database , 2010, Advances in therapy.

[13]  M. Lougheed,et al.  Asthma Electronic Medical Records in Primary Care: An Integrative Review , 2010, The Journal of asthma : official journal of the Association for the Care of Asthma.

[14]  M. Martinell,et al.  Prevalence of lipid abnormalities before and after introduction of lipid modifying therapy among Swedish patients with dyslipidemia (PRIMULA) , 2010, BMC public health.

[15]  John F. Hurdle,et al.  Measuring diagnoses: ICD code accuracy. , 2005, Health services research.

[16]  Scott T. Weiss,et al.  Prediction of chronic obstructive pulmonary disease (COPD) in asthma patients using electronic medical records. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[17]  Robert Dufour,et al.  Canadian Cardiovascular Society / Canadian guidelines for the diagnosis and treatment of dyslipidemia and prevention of cardiovascular disease in the adult – 2009 recommendations , 2009 .

[18]  D. Manuel,et al.  Burden of cardiovascular disease in Canada. , 2003, The Canadian journal of cardiology.

[19]  J. Biskupiak,et al.  Prevalence of High‐Risk Cardiovascular Conditions and the Status of Hypertension Management Among Hypertensive Adults 65 Years and Older in the United States: Analysis of a Primary Care Electronic Medical Records Database , 2010, Journal of clinical hypertension.

[20]  Sheila A Ryan,et al.  The use of aggregate data for measuring practice improvement. , 2002, Seminars for nurse managers.

[21]  Karim Keshavjee,et al.  Building a Pan-Canadian Primary Care Sentinel Surveillance Network: Initial Development and Moving Forward , 2009, The Journal of the American Board of Family Medicine.

[22]  S. Asai,et al.  Effect of candesartan monotherapy on lipid metabolism in patients with hypertension: a retrospective longitudinal survey using data from electronic medical records , 2010, Cardiovascular diabetology.

[23]  Jessica Chubak,et al.  Tradeoffs between accuracy measures for electronic health care data algorithms. , 2012, Journal of clinical epidemiology.

[24]  Karen Tu,et al.  Diabetics can be identified in an electronic medical record using laboratory tests and prescriptions. , 2011, Journal of clinical epidemiology.

[25]  G. Cooper,et al.  The use of screening colonoscopy for patients cared for by the Department of Veterans Affairs. , 2006, Archives of internal medicine.

[26]  Hye Jin Kam,et al.  A novel algorithm for detection of adverse drug reaction signals using a hospital electronic medical record database , 2011, Pharmacoepidemiology and drug safety.

[27]  Daniel J. Mollura,et al.  Electronic Medical Record (EMR) Utilization for Public Health Surveillance , 2008, AMIA.

[28]  D. Protti Comparison of information technology in general practice in 10 countries. , 2007, Healthcare Quarterly.

[29]  J. Couto,et al.  Prevalence of obesity, type II diabetes mellitus, hyperlipidemia, and hypertension in the United States: findings from the GE Centricity Electronic Medical Record database. , 2010, Population health management.

[30]  J. Tu,et al.  Regional variations in cardiovascular mortality in Canada. , 2003, The Canadian journal of cardiology.

[31]  C. McAdam-Marx,et al.  Results of a retrospective, observational pilot study using electronic medical records to assess the prevalence and characteristics of patients with resistant hypertension in an ambulatory care setting. , 2009, Clinical therapeutics.

[32]  B. Trock,et al.  Validity of administrative coding in identifying patients with upper urinary tract calculi. , 2010, The Journal of urology.

[33]  B. Gage,et al.  Accuracy of ICD-9-CM Codes for Identifying Cardiovascular and Stroke Risk Factors , 2005, Medical care.