Chronic disease prevalence from Italian administrative databases in the VALORE project: a validation through comparison of population estimates with general practice databases and national survey

BackgroundAdministrative databases are widely available and have been extensively used to provide estimates of chronic disease prevalence for the purpose of surveillance of both geographical and temporal trends. There are, however, other sources of data available, such as medical records from primary care and national surveys. In this paper we compare disease prevalence estimates obtained from these three different data sources.MethodsData from general practitioners (GP) and administrative transactions for health services were collected from five Italian regions (Veneto, Emilia Romagna, Tuscany, Marche and Sicily) belonging to all the three macroareas of the country (North, Center, South). Crude prevalence estimates were calculated by data source and region for diabetes, ischaemic heart disease, heart failure and chronic obstructive pulmonary disease (COPD). For diabetes and COPD, prevalence estimates were also obtained from a national health survey. When necessary, estimates were adjusted for completeness of data ascertainment.ResultsCrude prevalence estimates of diabetes in administrative databases (range: from 4.8% to 7.1%) were lower than corresponding GP (6.2%-8.5%) and survey-based estimates (5.1%-7.5%). Geographical trends were similar in the three sources and estimates based on treatment were the same, while estimates adjusted for completeness of ascertainment (6.1%-8.8%) were slightly higher. For ischaemic heart disease administrative and GP data sources were fairly consistent, with prevalence ranging from 3.7% to 4.7% and from 3.3% to 4.9%, respectively. In the case of heart failure administrative estimates were consistently higher than GPs’ estimates in all five regions, the highest difference being 1.4% vs 1.1%. For COPD the estimates from administrative data, ranging from 3.1% to 5.2%, fell into the confidence interval of the Survey estimates in four regions, but failed to detect the higher prevalence in the most Southern region (4.0% in administrative data vs 6.8% in survey data). The prevalence estimates for COPD from GP data were consistently higher than the corresponding estimates from the other two sources.ConclusionThis study supports the use of data from Italian administrative databases to estimate geographic differences in population prevalence of ischaemic heart disease, treated diabetes, diabetes mellitus and heart failure. The algorithm for COPD used in this study requires further refinement.

[1]  L. Lix,et al.  Population-based data sources for chronic disease surveillance. , 2008, Chronic diseases in Canada.

[2]  H. Ellekjær,et al.  Identification of incident stroke in Norway: hospital discharge data compared with a population-based stroke register. , 1999, Stroke.

[3]  G. Mazzaglia,et al.  Computerized general practice databases provide quick and cost-effective information on the prevalence of angina pectoris. , 2005, Italian heart journal : official journal of the Italian Federation of Cardiology.

[4]  Vicki Freedman,et al.  Specificity and sensitivity of claims-based algorithms for identifying members of Medicare+Choice health plans that have chronic medical conditions. , 2004, Health services research.

[5]  R. Westerling,et al.  Measures of prevalence: which healthcare registers are applicable? , 2001, Scandinavian journal of public health.

[6]  Miguel A Hernán,et al.  With great data comes great responsibility: publishing comparative effectiveness research in epidemiology. , 2011, Epidemiology.

[7]  D. Louis,et al.  Using pharmacy data to identify those with chronic conditions in Emilia Romagna, Italy , 2005, Journal of health services research & policy.

[8]  M. Vigotti,et al.  [Objectives, tools and methods for an epidemiological use of electronic health archives in various areas of Italy]. , 2008, Epidemiologia e prevenzione.

[9]  G. Bruno,et al.  Socio-economic differences in the prevalence of diabetes in Italy: the population-based Turin study. , 2008, Nutrition, metabolism, and cardiovascular diseases : NMCD.

[10]  Jerry H. Gurwitz,et al.  A systematic review of validated methods for identifying heart failure using administrative data , 2012, Pharmacoepidemiology and drug safety.

[11]  Y. Lacasse,et al.  The validity of diagnosing chronic obstructive pulmonary disease from a large administrative database. , 2005, Canadian respiratory journal.

[12]  G. Mazzaglia,et al.  Prevalence estimates for chronic diseases in Italy: exploring the differences between self-report and primary care databases. , 2003, Journal of public health medicine.

[13]  Walter Ricciardi,et al.  Italy: health system review. , 2014, Health systems in transition.

[14]  Sophie Couffignal,et al.  An algorithm to identify patients with treated type 2 diabetes using medico-administrative data , 2011, BMC Medical Informatics Decis. Mak..

[15]  Annunziata Faustini,et al.  The Reliability of Hospital and Pharmaceutical Data to Assess Prevalent Cases of Chronic Obstructive Pulmonary Disease , 2012, COPD.

[16]  Capture-recapture and multiple-record systems estimation I: History and theoretical development. International Working Group for Disease Monitoring and Forecasting. , 1995, American journal of epidemiology.

[17]  G. Privitera,et al.  epidemiologia e prevenzione , 2014 .

[18]  Jerry H. Gurwitz,et al.  Mini-Sentinel Systematic Evaluation of Health Outcome of Interest Definitions for Studies Using Administrative and Claims Data: Heart Failure , 2012 .

[19]  Gabriella Guasticchi,et al.  Can we use the pharmacy data to estimate the prevalence of chronic conditions? a comparison of multiple data sources , 2011, BMC public health.

[20]  G. Tognoni,et al.  Prevalence of chronic obstructive pulmonary disease and pattern of comorbidities in a general population , 2007, International journal of chronic obstructive pulmonary disease.

[21]  P. O’Connor,et al.  Are Claims Data Accurate Enough to Identify Patients for Performance Measures or Quality Improvement? The Case of Diabetes, Heart Disease, and Depression , 2006, American journal of medical quality : the official journal of the American College of Medical Quality.

[22]  G. Bortolan,et al.  Prevalence of chronic diseases in older Italians: Comparing self-reported and clinical diagnoses , 1997 .

[23]  Amardeep Thind,et al.  Investigating concordance in diabetes diagnosis between primary care charts (electronic medical records) and health administrative data: a retrospective cohort study , 2010, BMC health services research.

[24]  J. Leikauf,et al.  Comparisons of Self‐Reported and Chart‐Identified Chronic Diseases in Inner‐City Seniors , 2009, Journal of the American Geriatrics Society.

[25]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation I: History and theoretical development ( Review ) , 1995 .

[26]  L. Lix,et al.  Stroke surveillance in Manitoba, Canada: estimates from administrative databases. , 2008, Chronic diseases in Canada.

[27]  Stephen M. Anderson,et al.  The validity of using ICD-9 codes and pharmacy records to identify patients with chronic obstructive pulmonary disease , 2011, BMC health services research.

[28]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation II: Applications in human diseases. International Working Group for Disease Monitoring and Forecasting. , 1995, American journal of epidemiology.

[29]  M. Brownell,et al.  Administrative record linkage as a tool for public health research. , 2011, Annual review of public health.