A selection-bias free method to estimate the prevalence of hypertension from an administrative primary health care database in the Girona Health Region, Spain

The prevalence of common illnesses could be estimated using general practice databases, providing certain advantages when compared to other alternative sources of information, in particular being relatively more cost-effective. The main limitation is that it is a threat of selection bias. Some individuals have a higher probability of having used primary health care, implying that the potential result, 'contact registration', is overrepresented in the sample observed. The selection bias would provide inconsistent estimators of prevalence. The objective of this study is to propose a bias-free selection method to estimate prevalence using an administrative primary health care database. It proposes re-weighting the estimations of prevalence obtained from the database according to the probability of their being present in the same. These probabilities will be appropriately estimated from a health survey using a treatment effects model with a discrete response, i.e. a hurdle model. As an application, it was estimated the prevalence of hypertension in the population covered by public primary health care providers in the Girona Health Region, Spain, in 2005. Using this bias-free selection method the prevalence of hypertension has been estimated that 15.5% of individuals aged 15 and above (14.1% among men and 16.9% among women) suffer from hypertension. Likewise, the prevalence is estimated at 31.1% (30.3% men and 32.0% women) in individuals aged 45 and over; 48.3% (44.1% men and 51.9% women) among those aged 65 and over; and 13.1% (11.8% men and 13.9% women) in the general population. The proposed method provides estimators of the prevalence of hypertension very close to those obtained directly from the 2006 Catalan Health Survey. It is concluded that the proposed method could be used to estimate the prevalence of hypertension in an approximately unbiased form. Given the use of the administrative primary health care database corresponding to all users of primary health care in the 23 public managed Health Areas of the Girona Health Region during 2005, the proposed method will be more cost-effective and will provide much more population information than health questionnaires.

[1]  G. Coenders,et al.  Use of primary health care services according to the different degrees of obesity in the Girona Health Region, Spain. , 2006, Health economics.

[2]  J. S. Silva,et al.  Two-part multiple spell models for health care demand , 2001 .

[3]  R. Hoogenveen,et al.  Disease prevalence estimations based on contact registrations in general practice , 2002, Statistics in medicine.

[4]  M. Saez Condicionantes en la utilización de los servicios de atención primaria. Evidencias empíricas e inconsistencias metodológicas , 2003 .

[5]  R. Westerling,et al.  Measures of prevalence: which healthcare registers are applicable? , 2001, Scandinavian journal of public health.

[6]  R. M. Schwartz,et al.  Administrative data for quality improvement. , 1999, Pediatrics.

[7]  F. Schellevis,et al.  Prevalence estimates of asthma or COPD from a health interview survey and from general practitioner registration: what's the difference? , 2006, European journal of public health.

[8]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[9]  Andrew M. Jones,et al.  Explaining income-related inequalities in doctor utilisation in Europe. , 2004, Health economics.

[10]  J. Heckman Sample selection bias as a specification error , 1979 .

[11]  Partha Deb,et al.  The structure of demand for health care: latent class versus two-part models. , 2002, Journal of health economics.

[12]  M. Sáez [Factors conditioning primary care services utilization. Empirical evidence and methodological inconsistencies]. , 2003, Gaceta sanitaria.

[13]  Frank A. Sloan,et al.  Valuing health care , 1996 .

[14]  E. Doorslaer,et al.  EQUITY IN THE USE OF PHYSICIAN VISITS IN OECD COUNTRIES: HAS EQUAL TREATMENT FOR EQUAL NEED BEEN ACHIEVED? , 2003 .

[15]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[16]  H T Sorensen,et al.  A framework for evaluation of secondary data sources for epidemiological research. , 1996, International journal of epidemiology.

[17]  R. Senior,et al.  Community screening for left ventricular hypertrophy in patients with hypertension using hand-held echocardiography. , 2004, Journal of the American Society of Echocardiography : official publication of the American Society of Echocardiography.

[18]  R. Lawrenson,et al.  Clinical information for research; the use of general practice databases. , 1999, Journal of public health medicine.

[19]  Y. Lacasse,et al.  The validity of diagnosing chronic obstructive pulmonary disease from a large administrative database. , 2005, Canadian respiratory journal.

[20]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[21]  Robert A. Moffitt,et al.  An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics , 1998 .

[22]  J. Nyman,et al.  The value of health insurance: the access motive. , 1999, Journal of health economics.

[23]  Willard G. Manning,et al.  Valuing health care: Statistical issues in cost–effectiveness analyses , 1995 .

[24]  Paul D. Williams,et al.  A user's guide to the General Health Questionnaire , 1988 .

[25]  Mireia Jofre-Bonet,et al.  Public health care and private insurance demand: The waiting time as a link , 2000, Health care management science.

[26]  G. Mazzaglia,et al.  Prevalence estimates for chronic diseases in Italy: exploring the differences between self-report and primary care databases. , 2003, Journal of public health medicine.

[27]  M. Engelgau,et al.  Review of the performance of methods to identify diabetes cases among vital statistics, administrative, and survey data. , 2004, Annals of epidemiology.

[28]  A. Domingo-Salvany,et al.  [Proposal for a social class measure. Working Group of the Spanish Society of Epidemiology and the Spanish Society of Family and Community Medicine]. , 2000, Atencion primaria.

[29]  L. Ramsay,et al.  Coronary and cardiovascular risk estimation in uncomplicated mild hypertension. A comparison of risk assessment methods , 2002, Journal of hypertension.

[30]  Kenneth H. Pollock,et al.  Modeling capture, recapture, and removal statistics for estimation of demographic parameters for fish and wildlife populations : Past, present, and future , 1991 .

[31]  Winfried Pohlmeier,et al.  An Econometric Model of the Two-Part Decisionmaking Process in the Demand for Health Care , 1995 .

[32]  R E LaPorte,et al.  Ascertainment corrected rates: applications of capture-recapture methods. , 1993, International journal of epidemiology.

[33]  R. Cooper,et al.  Patterns of comorbidity and mortality risk in blacks and whites. , 1996, Annals of epidemiology.

[34]  J. Coresh,et al.  Prevalence of chronic kidney disease in the United States. , 2007, JAMA.

[35]  S. Jiménez-Mart́ın,et al.  Latent class versus two-part models in the demand for physician services across the European Union. , 2002, Health economics.

[36]  Nathaniel Schenker,et al.  Combining Estimates from Complementary Surveys: A Case Study Using Prevalence Estimates from National Health Surveys of Households and Nursing Homes , 2002, Public health reports.

[37]  R. Regal,et al.  Effect of variation in probability of ascertainment by sources ("variable catchability") upon "capture-recapture" estimates of prevalence. , 1993, American journal of epidemiology.

[38]  L L Roos,et al.  Estimating the burden of disease. Comparing administrative data and self-reports. , 1997, Medical care.

[39]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[40]  Elizabeth R. DeLong,et al.  Statistical issues in cost-effectiveness analysis , 1999 .

[41]  E. Doorslaer,et al.  Income-Related Inequality in the Use of Medical Care in 21 OECD Countries , 2004 .

[42]  L. Iezzoni Assessing Quality Using Administrative Data , 1997, Annals of Internal Medicine.

[43]  J. Carstensen,et al.  Estimating disease prevalence using a population-based administrative healthcare database , 2007, Scandinavian journal of public health.

[44]  G. Mazzaglia,et al.  Computerized general practice databases provide quick and cost-effective information on the prevalence of angina pectoris. , 2005, Italian heart journal : official journal of the Italian Federation of Cardiology.

[45]  Janet E Hux,et al.  Diabetes in Ontario: determination of prevalence and incidence using a validated administrative data algorithm. , 2002, Diabetes care.

[46]  B. Vaïsse,et al.  [Estimation of the number of patients treated for hypertension, diabetes or hyperlipidemia in France: FLAHS study 2002]. , 2003, Archives des maladies du coeur et des vaisseaux.

[47]  Andrew M. Jones,et al.  The impact of supplementary private health insurance on the use of specialists in European countries , 2006 .