Maximum likelihood estimation in semiparametric selection bias models with application to AIDS vaccine trials

SUMMARY The following problem is treated: given s possibly selection biased samples from an unknown distribution function, and assuming that the sampling rule weight functions for each of the samples are mathematically specified up to a common unknown finite-dimensional parameter, how can we use the data to estimate the unknown parameters? We propose a simple maximum partial likelihood method for deriving the semiparametric maximum likelihood estimator. A discussion of assumptions under which the selection bias model is identifiable and uniquely estimable is presented. We motivate the need for the methodology by discussing the generalised logistic regression model (Gilbert, Self & Ashby, 1998), a semiparametric selection bias model which is useful for assessing from vaccine trial data how the efficacy of an HIV vaccine varies with characteristics of the exposing virus. We show through simulations and an example that the maximum likelihood estimator in the generalised logistic regression model has satisfactory finite-sample properties.