Uses of the EM algorithm in the analysis of data on HIV/AIDS and other infectious diseases

The analysis of data on infectious diseases is a natural setting for applications of the EM algorithm, because the infection process is only partially observable. Difficulties in determining the expectation at the E step have been side-stepped by adopting pragmatic models which reflect only part of the mechanism that generates the data. In the HIV/AIDS context the EM algorithm has helped in the reconstruction of the unobserved HIV infection curve, the so-called backprojection problem, as well as in the estimation of the distribution for the incubation period until AIDS, in estimating the infectivity of HIV in partnerships and in estimating parameters describing the decline in the immune system. There is a need for smooth estimates of functions in these applications, suggesting the use of the EMS algorithm or use of the EM algorithm to maximize a penalized likelihood. For data on other infectious diseases the application of the EM algorithm has so far been restricted to analyses of data on the size of outbreaks in a sample of households.

[1]  N. Becker,et al.  A general chain binomial model for infectious diseases. , 1981, Biometrics.

[2]  J. Sun,et al.  Empirical estimation of a distribution function with truncated and doubly interval-censored data and its application to AIDS studies. , 1995, Biometrics.

[3]  S W Lagakos,et al.  Analyzing doubly censored data with covariates, with application to AIDS. , 1993, Biometrics.

[4]  P. Bacchetti,et al.  Nonparametric estimation of the incubation period of AIDS based on a prevalent cohort with unknown infection times. , 1991, Biometrics.

[5]  I. Marschner,et al.  Simultaneous back-projection of AIDS incidence data for two or more groups. , 1994, Statistics in medicine.

[6]  N. Becker,et al.  Dependent HIV incidences in back-projection of AIDS incidence data. , 1994, Statistics in medicine.

[7]  Xiao-Li Meng,et al.  Using EM to Obtain Asymptotic Variance-Covariance Matrices: The SEM Algorithm , 1991 .

[8]  T R Holford,et al.  Change points in the series of T4 counts prior to AIDS. , 1995, Biometrics.

[9]  H. Frydman Semiparametric estimation in a three-state duration-dependent Markov model from interval-censored observations with application to AIDS data. , 1995, Biometrics.

[10]  J. D. Wilson,et al.  A smoothed EM approach to indirect estimation problems, with particular reference to stereology and emission tomography , 1990 .

[11]  S W Lagakos,et al.  Analysis of doubly-censored survival data, with application to AIDS. , 1989, Biometrics.

[12]  R Brookmeyer,et al.  Analysis of infectious disease data from partner studies with unknown source of infection. , 1993, Biometrics.

[13]  P. Bacchetti,et al.  A competing risks analysis of presenting AIDS diagnoses trends. , 1996, Biometrics.

[14]  I. Marschner,et al.  Using time of first positive HIV test and other auxiliary data in back-projection of AIDS incidence. , 1994, Statistics in medicine.

[15]  J B Carlin,et al.  A method of non-parametric back-projection and its application to AIDS data. , 1991, Statistics in medicine.

[16]  N. Becker,et al.  Use of Empirical Transformations in Nonparametric Back-projection of Aids Incidence Data , 1992 .

[17]  Ian C. Marschner,et al.  A method for estimating the age-specific relative risk of HIV infection from AIDS incidence data , 1993 .

[18]  I. Longini,et al.  Household and community transmission parameters from final distributions of infections in households. , 1982, Biometrics.

[19]  F. O’Sullivan A Statistical Perspective on Ill-posed Inverse Problems , 1986 .

[20]  K Dietz,et al.  Back-projection of German AIDS data using information on dates of tests. , 1994, Statistics in medicine.

[21]  Ian C. Marschner,et al.  An improved ems algorithm for back-projection of aids incidence data , 1994 .

[22]  Mark R. Segal,et al.  Variances for Maximum Penalized Likelihood Estimates Obtained via the EM Algorithm , 1994 .

[23]  J. Liao,et al.  An empirical Bayes approach to smoothing in backcalculation of HIV infection rates. , 1995, Biometrics.

[24]  Mark R. Segal,et al.  Backcalculation of HIV Infection Rates , 1993 .

[25]  N G Becker Parametric inference for epidemic models. , 1993, Mathematical biosciences.

[26]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[27]  Mitchell H. Gail,et al.  A Method for Obtaining Short-Term Projections and Lower Bounds on the Size of the AIDS Epidemic , 1988 .

[28]  B. Turnbull The Empirical Distribution Function with Arbitrarily Grouped, Censored, and Truncated Data , 1976 .

[29]  X M Tu,et al.  Regression analysis of censored and truncated data: estimating reporting-delay distributions and AIDS incidence from surveillance data. , 1994, Biometrics.