A data-augmentation method for infectious disease incidence data from close contact groups

A broad range of studies of preventive measures in infectious diseases gives rise to incidence data from close contact groups. Parameters of common interest in such studies include transmission probabilities and efficacies of preventive or therapeutic interventions. We estimate these parameters using discrete-time likelihood models. We augment the data with unobserved pairwise transmission outcomes and fit the model using the EM algorithm. A linear model derived from the likelihood based on the augmented data and fitted with the iteratively re-weighted least squares method is also discussed. Using simulations, we demonstrate the comparable accuracy and lower sensitivity to initial estimates of the proposed methods with data augmentation relative to the likelihood model based solely on the observed data. Two randomized household-based trials of zanamivir, an influenza antiviral agent, are analyzed using the proposed methods.

[1]  A. Nizam,et al.  Containing pandemic influenza with antiviral agents. , 2004, American journal of epidemiology.

[2]  I. Longini,et al.  Household and community transmission parameters from final distributions of infections in households. , 1982, Biometrics.

[3]  M. Halloran,et al.  Antiviral effects on influenza viral transmission and pathogenicity: observations from household-based trials. , 2006, American journal of epidemiology.

[4]  M. Elizabeth Halloran,et al.  A Frailty Mixture Model for Estimating Vaccine Efficacy , 1996 .

[5]  M E Halloran,et al.  Efficiency of Estimating Vaccine Efficacy for Susceptibility and Infectiousness: Randomization by Individual Versus Household , 1999, Biometrics.

[6]  C. Viboud,et al.  A Bayesian MCMC approach to study transmission of influenza: application to household longitudinal data , 2004, Statistics in medicine.

[7]  A. Monto,et al.  Inhaled zanamivir for the prevention of influenza in families. Zanamivir Family Study Group. , 2000, The New England journal of medicine.

[8]  A. M. Hasofer,et al.  Estimation in Epidemics with Incomplete Observations , 1997 .

[9]  A. Monto,et al.  Zanamivir prophylaxis: an effective strategy for the prevention of influenza types A and B within households. , 2002, The Journal of infectious diseases.

[10]  C. Macken,et al.  Mitigation strategies for pandemic influenza in the United States. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[11]  A. Nizam,et al.  Containing Pandemic Influenza at the Source , 2005, Science.

[12]  M Elizabeth Halloran,et al.  Design and evaluation of prophylactic interventions using infectious disease incidence data from close contact groups , 2006, Journal of the Royal Statistical Society. Series C, Applied statistics.

[13]  D. Balding,et al.  Analyses of infectious disease data from household outbreaks by Markov chain Monte Carlo methods , 2000 .

[14]  I. Longini,et al.  A generalized stochastic model for the analysis of infectious disease final size data. , 1991, Biometrics.

[15]  A. Langworthy,et al.  An influenza simulation model for immunization studies. , 1976, American journal of epidemiology.

[16]  R Brookmeyer,et al.  Analysis of infectious disease data from partner studies with unknown source of infection. , 1993, Biometrics.

[17]  G. Roberts,et al.  Bayesian inference for partially observed stochastic epidemics , 1999 .

[18]  E. Lawlor,et al.  A likelihood-based method of identifying contaminated lots of blood product. , 1999, International journal of epidemiology.

[19]  F. Hayden,et al.  Perspectives on antiviral use during pandemic influenza. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[20]  M E Halloran,et al.  Study designs for evaluating different efficacy and effectiveness aspects of vaccines. , 1997, American journal of epidemiology.

[21]  Haitao Chu,et al.  Estimating Vaccine Efficacy From Secondary Attack Rates , 2003 .

[22]  Xiao-Li Meng,et al.  The Art of Data Augmentation , 2001 .

[23]  Tom Britton,et al.  Estimating Vaccine Effects on Transmission of Infection from Household Outbreak Data , 2003, Biometrics.

[24]  R. Paap What are the advantages of MCMC based inference in latent variable models? , 2002 .

[25]  I M Longini,et al.  A discrete-time model for the statistical analysis of infectious disease incidence data. , 1992, Biometrics.

[26]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[27]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .