Estimating net transition probabilities from cross‐sectional data with application to risk factors in chronic disease modeling

A problem occurring in chronic disease modeling is the estimation of transition probabilities of moving from one state of a categorical risk factor to another. Transitions could be obtained from a cohort study, but often such data may not be available. However, under the assumption that transitions remain stable over time, age specific cross-sectional prevalence data could be used instead. Problems that then arise are parameter identifiability and the fact that age dependent cross-sectional data are often noisy or are given in age intervals. In this paper we propose a method to estimate so-called net annual transition probabilities from cross-sectional data, including their uncertainties. Net transitions only describe the net inflow or outflow into a certain risk factor state at a certain age. Our approach consists of two steps: first, smooth the data using multinomial P-splines, second, from these data estimate net transition probabilities. This second step can be formulated as a transportation problem, which is solved using the simplex algorithm from linear programming theory. A sensible specification of the cost matrix is crucial to get meaningful results. Uncertainties are assessed by parametric bootstrapping. We illustrate our method using data on body mass index. We conclude that this method provides a flexible way of estimating net transitions and that the use of net transitions has implications for model dynamics, for example when modeling interventions.

[1]  M. Roberts,et al.  The use of multistate life-table models for improving population health. , 2001, IMA journal of mathematics applied in medicine and biology.

[2]  P. van Baal,et al.  Estimating health-adjusted life expectancy conditional on risk factors: results for smoking and obesity , 2006, Population health metrics.

[3]  Ben Pelzer,et al.  Bayesian estimation of transition probabilities from repeated cross sections , 2002 .

[4]  P. J. van der Maas,et al.  The health care costs of smoking. , 1997, The New England journal of medicine.

[5]  Rudolf T Hoogenveen,et al.  Chronic disease projections in heterogeneous ageing populations: approximating multi-state models of joint distributions by modelling marginal distributions. , 2010, Mathematical medicine and biology : a journal of the IMA.

[6]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[7]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[8]  C. Wild,et al.  Vector Generalized Additive Models , 1996 .

[9]  Rudolf T Hoogenveen,et al.  Lifetime Medical Costs of Obesity: Prevention No Cure for Increasing Health Expenditure , 2008, PLoS medicine.

[10]  Philip Hans Franses,et al.  Inferring Transition Probabilities from Repeated Cross Sections , 2002, Political Analysis.

[11]  Rudolf T Hoogenveen,et al.  Dynamic effects of smoking cessation on disease incidence, mortality and quality of life: The role of time since cessation , 2008, Cost effectiveness and resource allocation : C/E.

[12]  L. Gunning-Schepers,et al.  The health benefits of prevention: a simulation approach. , 1988, Health policy.