Measuring prevalence and incidence of chronic conditions in claims and electronic health record databases

Background Health care databases are natural sources for estimating prevalence and incidence of chronic conditions, but substantial variation in estimates limits their interpretability and utility. We evaluated the effects of design choices when estimating prevalence and incidence in claims and electronic health record databases. Methods Prevalence and incidence for five chronic diseases at increasing levels of expected frequencies, from cystic fibrosis to COPD, were estimated in the Clinical Practice Research Datalink (CPRD) and MarketScan databases from 2011 to 2014. Estimates were compared using different definitions of lookback time and contributed person-time. Results Variation in lookback time substantially affected estimates. In 2014, for CPRD, use of an all-time vs a 1-year lookback window resulted in 4.3–8.3 times higher prevalence (depending on disease), reducing incidence by 1.9–3.3 times. All-time lookback resulted in strong temporal trends. COPD prevalence between 2011 and 2014 in MarketScan increased by 25% with an all-time lookback but stayed relatively constant with a 1-year lookback. Varying observability did not substantially affect estimates. Conclusion This framework draws attention to the underrecognized potential for widely varying incidence and prevalence estimates, with implications for care planning and drug development. Though prevalence and incidence are seemingly straightforward concepts, careful consideration of methodology is required to obtain meaningful estimates from health care databases.

[1]  D. Margolis,et al.  Validity of The Health Improvement Network (THIN) for the study of psoriasis , 2011, The British journal of dermatology.

[2]  Kyungwon Oh,et al.  Data Resource Profile: The Korea National Health and Nutrition Examination Survey (KNHANES) , 2014, International journal of epidemiology.

[3]  R. Carnahan,et al.  A systematic review of validated methods for identifying lymphoma using administrative data , 2012, Pharmacoepidemiology and drug safety.

[4]  S. Kimmel,et al.  Examples of Automated Databases , 2013 .

[5]  T. To,et al.  Trends in chronic obstructive pulmonary disease prevalence, incidence, and mortality in ontario, Canada, 1996 to 2007: a population-based study. , 2010, Archives of internal medicine.

[6]  S. Suissa Immeasurable time bias in observational studies of drug effects on mortality. , 2008, American journal of epidemiology.

[7]  J. Rassen,et al.  Effects of expanding the look‐back period to all available data in the assessment of covariates , 2017, Pharmacoepidemiology and drug safety.

[8]  A. Armstrong,et al.  Psoriasis prevalence among adults in the United States. , 2014, Journal of the American Academy of Dermatology.

[9]  P. Lakatos,et al.  The burden of inflammatory bowel disease in Europe. , 2013, Journal of Crohn's & colitis.

[10]  R. Tamblyn,et al.  The use of prescription claims databases in pharmacoepidemiological research: the accuracy and comprehensiveness of the prescription claims database in Québec. , 1995, Journal of clinical epidemiology.

[11]  Shawn N Murphy,et al.  Out-of-system Care and Recording of Patient Characteristics Critical for Comparative Effectiveness Research , 2017, Epidemiology.

[12]  P. Farrell,et al.  The prevalence of cystic fibrosis in the European Union. , 2008, Journal of cystic fibrosis : official journal of the European Cystic Fibrosis Society.

[13]  Sebastian Schneeweiss,et al.  When and How Can Real World Data Analyses Substitute for Randomized Controlled Trials? , 2017, Clinical pharmacology and therapeutics.

[14]  E. Loftus,et al.  Incidence and Prevalence of Crohn’s Disease and Ulcerative Colitis in Olmsted County, Minnesota From 1970 Through 2010 , 2017, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[15]  J. Farley,et al.  Revisiting the washout period in the incident user study design: why 6-12 months may not be sufficient. , 2015, Journal of comparative effectiveness research.

[16]  Robin M. Murray,et al.  Incidence of Schizophrenia and Other Psychoses in England, 1950–2009: A Systematic Review and Meta-Analyses , 2012, PloS one.

[17]  Ben Birkman,et al.  The Cystic Fibrosis Foundation , 2017 .

[18]  M. Graffar [Modern epidemiology]. , 1971, Bruxelles medical.

[19]  E. Susser,et al.  Incidence and cumulative risk of treated schizophrenia in the prenatal determinants of schizophrenia study. , 2000, Schizophrenia bulletin.

[20]  Aziz Sheikh,et al.  Global and regional estimates of COPD prevalence: Systematic review and meta–analysis , 2015, Journal of global health.

[21]  P. J. Kelly,et al.  Inflammatory bowel disease: epidemiology and management in an English general practice population , 2000, Alimentary pharmacology & therapeutics.

[22]  Ken Kleinman,et al.  The prevalence and geographic distribution of Crohn's disease and ulcerative colitis in the United States. , 2007, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[23]  J. Avorn,et al.  A review of uses of health care utilization databases for epidemiologic research on therapeutics. , 2005, Journal of clinical epidemiology.

[24]  R. Carnahan,et al.  Mini‐Sentinel's systematic reviews of validated methods for identifying health outcomes using administrative data: summary of findings and suggestions for future research , 2012, Pharmacoepidemiology and drug safety.

[25]  S. Schneeweiss,et al.  Considerations for the analysis of longitudinal electronic health records linked to claims data to study the effectiveness and safety of drugs , 2016, Clinical pharmacology and therapeutics.

[26]  L. Lix,et al.  Influence of Using Different Databases and ‘Look Back’ Intervals to Define Comorbidity Profiles for Patients with Newly Diagnosed Hypertension: Implications for Health Services Researchers , 2016, PloS one.

[27]  J. Avorn,et al.  Supplementing claims data with outpatient laboratory test results to improve confounding adjustment in effectiveness studies of lipid-lowering treatments , 2012, BMC Medical Research Methodology.

[28]  K. Rascati,et al.  Estimating the direct and indirect costs for community‐dwelling patients with schizophrenia , 2013 .

[29]  E. Faerstein,et al.  A DICTIONARY OF EPIDEMIOLOGY , 2016 .

[30]  S. Gabriel,et al.  Trends in incidence of adult-onset psoriasis over three decades: a population-based study. , 2009, Journal of the American Academy of Dermatology.

[31]  K. Bhaskaran,et al.  Data Resource Profile: Clinical Practice Research Datalink (CPRD) , 2015, International journal of epidemiology.

[32]  K. Fox,et al.  Treatment and referral patterns for psoriasis in United Kingdom primary care: a retrospective cohort study , 2013, BMC Dermatology.