Estimating a survival curve with unlinked entry and failure times

In monitoring a clinical trial or other observational study with a survival endpoint, sometimes the numbers of patients entering and dying at each time point are presented, but the connections between them are kept confidential. Hence, the exact time to failure or censoring for each individual is missing. We refer to such a study monitoring table with missing pairing information between the entry and death times as a 'broken' survival data set. In this paper we study the problem of estimating the survival distribution from a broken survival data set. We have developed two methods, likelihood-based estimation and self-consistency estimation, to estimate the survival curve parametrically and empirically, respectively. We use simulations to study the properties of these methods, and illustrate them with data from the STELLAR-3 trial.

[1]  Jianguo Sun Self-Consistency Estimation of Distributions Based on Truncated and Doubly Censored Survival Data with Applications to AIDS Cohort Studies , 1997, Lifetime data analysis.

[2]  Prem K. Goel,et al.  Estimation of the Correlation Coefficient from a Broken Random Sample , 1980 .

[3]  W. Deming,et al.  On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known , 1940 .

[4]  N. S. D'Andrea Du Bois,et al.  A Solution to the Problem of Linking Multivariate Documents , 1969 .

[5]  S. Lagakos,et al.  Estimation of the infection time and latency distribution of AIDS with doubly censored data. , 1994, Biometrics.

[6]  M. Socinski,et al.  Paclitaxel poliglumex (PPX)/carboplatin vs paclitaxel/carboplatin for the treatment of PS2 patients with chemotherapy-naïve advanced non-small cell lung cancer (NSCLC): A phase III study , 2005 .

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  S. Kullback,et al.  Contingency tables with given marginals. , 1968, Biometrika.

[9]  Prem K. Goel,et al.  On Re-Pairing Observations in a Broken Random Sample , 1975 .

[10]  J M Lachin,et al.  Evaluation of sample size and power for analyses of survival with allowance for nonuniform patient entry, losses to follow-up, noncompliance, and stratification. , 1986, Biometrics.

[11]  Philip J Smith,et al.  Methods for capture–recapture analysis when cases lack personal identifiers , 2005, Statistics in medicine.

[12]  F. F. Stephan An Iterative Method of Adjusting Sample Frequency Tables When Expected Marginal Totals are Known , 1942 .

[13]  B. Turnbull The Empirical Distribution Function with Arbitrarily Grouped, Censored, and Truncated Data , 1976 .