Administrative database research infrequently used validated diagnostic or procedural codes.

OBJECTIVE Administrative database research (ADR) frequently uses codes to identify diagnoses or procedures. The association of these codes with the condition it represents must be measured to gauge misclassification in the study. Measure the proportion of ADR studies using diagnostic or procedural codes that measured or referenced code accuracy. STUDY DESIGN AND SETTING Random sample of 150 MEDLINE-cited ADR studies stratified by year of publication. The proportion of ADR studies using codes to define patient cohorts, exposures, or outcomes that measured or referenced code accuracy and Bayesian estimates for probability of disease given code operating characteristics were measured. RESULTS One hundred fifteen ADR studies (76.7% [95% confidence interval (CI), 69.3-82.8]) used codes. Of these studies, only 14 (12.1% [7.3-19.5]) measured or referenced the association of the code with the entity it supposedly represented. This proportion did not vary by year of publication but was significantly higher in journals with greater impact factors. Of five studies reporting code sensitivity and specificity, the estimated probability of code-related condition in code-positive patients was less than 50% in two. CONCLUSION In ADR, diagnostic and procedural codes are commonly used but infrequently validated. People with a code frequently do not have the condition it represents.

[1]  J. Sunyer,et al.  Nondifferential disease misclassification may bias incidence risk ratios away from the null. , 2006, Journal of clinical epidemiology.

[2]  S. Jain,et al.  Assessing the Accuracy of Administrative Data in Health Information Systems , 2004, Medical care.

[3]  Elena Losina,et al.  Rates and Outcomes of Primary and Revision Total Hip Replacement in the United States Medicare Population , 2003, The Journal of bone and joint surgery. American volume.

[4]  Long-Term Medical and Social Consequences of Preterm Birth , 2009 .

[5]  N. Krause,et al.  Socioeconomic status, occupation, and risk of hospitalisation due to coxarthrosis in Denmark 1981–99 , 2003 .

[6]  H Brenner,et al.  Variation of sensitivity, specificity, likelihood ratios and predictive values with disease prevalence. , 1997, Statistics in medicine.

[7]  Emmanuel Lesaffre,et al.  Estimating Disease Prevalence in a Bayesian Framework Using Probabilistic Constraints , 2006, Epidemiology.

[8]  H Checkoway,et al.  Bias due to misclassification in the estimation of relative risk. , 1977, American journal of epidemiology.

[9]  J. Avorn,et al.  Improvements in long-term mortality after myocardial infarction and increased use of cardiovascular drugs after discharge: a 10-year trend analysis. , 2008, Journal of the American College of Cardiology.

[10]  Sharon-Lise T Normand,et al.  Pay for performance in commercial HMOs. , 2006, The New England journal of medicine.

[11]  B. Graubard,et al.  Statistical validation of intermediate endpoints for chronic diseases. , 1992, Statistics in medicine.

[12]  B. Gladen,et al.  Estimating prevalence from the results of a screening test. , 1978, American journal of epidemiology.

[13]  V De Gruttola,et al.  Estimating the proportion of treatment effect explained by a surrogate marker. , 1997, Statistics in medicine.

[14]  P. Chyou Patterns of bias due to differential misclassification by case–control status in a case–control study , 2007, European Journal of Epidemiology.

[15]  P. Austin,et al.  Utilisation of coronary angiography after acute myocardial infarction in Ontario over time: have referral patterns changed? , 2002, Heart.

[16]  Anne Gadomski,et al.  Impact of a Medicaid Primary Care Provider and Preventive Care on Pediatric Hospitalization , 1998, Pediatrics.

[17]  L. Lesko,et al.  Use of biomarkers and surrogate endpoints in drug development and regulatory decision making: criteria, validation, strategies. , 2001, Annual review of pharmacology and toxicology.

[18]  P. Gideon,et al.  Nonaspirin NSAIDs, Cyclooxygenase 2 Inhibitors, and the Risk for Stroke , 2008, Stroke.

[19]  J. Peters,et al.  The Epidemiology and Cost of Inpatient Care for Peripheral Vascular Disease, Infection, Neuropathy, and Ulceration in Diabetes , 1998, Diabetes Care.

[20]  K. Jamrozik,et al.  Falling incidence of amputations for peripheral occlusive arterial disease in Western Australia between 1980 and 1992. , 1997, European journal of vascular and endovascular surgery : the official journal of the European Society for Vascular Surgery.

[21]  A. Forster,et al.  Derivation and validation of a MEDLINE search strategy for research studies that use administrative data. , 2010, Health Services Research.

[22]  J. Skranes,et al.  Cerebral palsy in Norway: prevalence, subtypes and severity. , 2008, European journal of paediatric neurology : EJPN : official journal of the European Paediatric Neurology Society.

[23]  M. Render,et al.  Methods for Estimating and Comparing VA Outpatient Drug Benefits With the Private Sector , 2003, Medical care.

[24]  A. Walker,et al.  A systematic review of discharge coding accuracy. , 2001, Journal of public health medicine.

[25]  HyunSook Ryu Song,et al.  Climbing up the pay-for-performance learning curve: where are the early adopters now? , 2007, Health affairs.

[26]  Lawrence Joseph,et al.  Estimating the prevalence of a rare disease: adjusted maximum likelihood , 1998 .