The Quality of Routinely Collected Data: Using the "Principal Diagnosis" in Emergency Department Databases as an Example

Objectives: This paper aims to estimate the reliability of using "principal diagnosis" to identify people with diabetes mellitus (DM), cardiovascular diseases (CVD), and asthma or chronic obstructive pulmonary disease (COPD) in Firstnet, the  emergency department (ED)  module of the NSW Health Electronic Medical Record (eMR). Methods: A list of patients who attended a community hospital ED in 2009 with a specific "principal diagnosis" of DM, CVD, or asthma/COPD, or inferred based on possible keywords, was generated from Firstnet. This Firstnet list was compared with a list extracted from the underlying eMR database tables, using similar specific and possible coded terms. The concordance for an episode of care and for the overall was calculated. Patients on the Firstnet list who were admitted had their discharge summaries audited to confirm the principal diagnosis. The proportion of admitted patients correctly identified as having one of the chronic diseases was calculated. Results: The Firstnet list contained 2,559 patients with a principal diagnosis of DM, CVD, or asthma/COPD. The concordance (episode) of the Firstnet list with the eMR list were: 87% of CVD cases, 69% of DM and 38% of asthma/COPD cases. The audit of the discharge summaries of the Firstnet patients who were admitted confirmed the diagnosis of DM, asthma/COPD, and CVD for 79%, 66%, and 56% of the patients respectively. Discussion: An empirical method to examine the accuracy of the prinicipal diagnosis in Firstnet is described. The incomplete concordance of diagnoses of the selected chronic diseases generated via different modules of the same information system raises doubts about the reliability of data and information quality collected, stored and used by the eMR. Further research is required to understand the determinants of data quality and develop tools to automate data quality assessment and management. This is particularly important with the increasing use of eMR in routine clinical practice and use of routinely collected clinical data for clinical and research purposes.

[1]  L. Melton,et al.  Determining perioperative complications associated with vaginal hysterectomy: code classification versus chart review. , 2009, Journal of the American College of Surgeons.

[2]  R. Feachem,et al.  Getting more for their dollar: a comparison of the NHS with California's Kaiser Permanente. , 2002, BMJ.

[3]  Chunliu Zhan,et al.  Identification of Hospital-Acquired Catheter-Associated Urinary Tract Infections From Medicare Claims: Sensitivity and Positive Predictive Value , 2009, Medical care.

[4]  J. Allison,et al.  Administrative codes combined with medical records based criteria accurately identified bacterial infections among rheumatoid arthritis patients. , 2009, Journal of clinical epidemiology.

[5]  Hude Quan,et al.  Validation of a Case Definition to Define Hypertension Using Administrative Data , 2009, Hypertension.

[6]  R. Baker,et al.  Primary healthcare teams’ views on using mortality data to review clinical policies , 2007, Quality and Safety in Health Care.

[7]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[8]  C. Weel,et al.  The use of routinely collected computer data for research in primary care: opportunities and challenges. , 2006, Family practice.

[9]  Jonathan M Morris,et al.  The prevalence of maternal medical conditions during pregnancy and a validation of their reporting in hospital discharge data , 2008, The Australian & New Zealand journal of obstetrics & gynaecology.

[10]  M L Moro,et al.  Can hospital discharge diagnoses be used for surveillance of surgical-site infections? , 2004, The Journal of hospital infection.

[11]  Simon de Lusignan,et al.  Routinely-collected general practice data are complex, but with systematic processing can be used for quality improvement and research. , 2006, Informatics in primary care.