Hazard function estimators: a simulation study.

Kernel-based methods for the smooth, non-parametric estimation of the hazard function have received considerable attention in the statistical literature. Although the mathematical properties of the kernel-based hazard estimators have been carefully studied, their statistical properties have not. We reviewed various kernel-based methods for hazard function estimation from right-censored data and compared the statistical properties of these estimators through computer simulations. Our simulations covered seven distributions, three levels of random censoring, four types of bandwidth functions, two sample sizes and three types of boundary correction. We conducted a total of 504 simulation experiments with 500 independent samples each. Our results confirmed the advantages of two recent innovations in kernel estimation - boundary correction and locally optimal bandwidths. The median relative improvement (decrease) in mean square error over fixed-bandwidth estimators without boundary correction was 3 per cent for fixed-bandwidth estimators with left boundary correction, 52 per cent locally optimal bandwidths without boundary correction, and 66 per cent for locally optimal bandwidths with left boundary correction. The locally optimal bandwidth estimators with left boundary correction also outperformed three previously published and publicly available algorithms, with median relative improvements in mean square error of 31 per cent, 77 per cent and 80 per cent.

[1]  J. Klein,et al.  Survival Analysis: Techniques for Censored and Truncated Data , 1997 .

[2]  H. Dette,et al.  Nearest neighbour kernel estimation of the hazard function from censored data , 1992 .

[3]  Martin A. Tanner,et al.  The Estimation of the Hazard Function from Randomly Censored Data by the Kernel Method , 1983 .

[4]  P. Cheng A nearest neighbor hazard tare estimator for randomly censored data , 1987 .

[5]  M. R. Leadbetter,et al.  Hazard Analysis , 2018, System Safety Engineering and Risk Assessment.

[6]  A Histogram Estimator of the Hazard Rate with Censored Data , 1985 .

[7]  H. Ramlau-Hansen Smoothing Counting Process Intensities by Means of Kernel Functions , 1983 .

[8]  Wing Hung Wong,et al.  Data-Based Nonparametric Estimation of the Hazard Function with Applications to Model Diagnostics and Exploratory Analysis , 1984 .

[9]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[10]  J. Simonoff Smoothing Methods in Statistics , 1998 .

[11]  A. Bowman,et al.  Applied smoothing techniques for data analysis : the kernel approach with S-plus illustrations , 1999 .

[12]  M. Tanner Algorithm AS 202: Data-Based Non-Parametric Hazard Estimation , 1984 .

[13]  A Note on Data-Adaptive Kernel Estimation of the Hazard and Density Function in the Random Censorship Situation , 1985 .

[14]  James Stephen Marron,et al.  Bootstrap selection of the smoothing parameter in nonparametric hazard rate estimation , 1996 .

[15]  Winfried Stute,et al.  Kernel density and hazard function estimation in the presence of censoring , 1988 .

[16]  A. Izenman Recent Developments in Nonparametric Density Estimation , 1991 .

[17]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[18]  Hans-Georg Müller,et al.  Nonparametric analysis of changes in hazard rates for censored survival data: An alternative to change-point models , 1990 .

[19]  Jane-Ling Wang,et al.  Density and hazard rate estimation for censored data via strong representation of the Kaplan-Meier estimator , 1985 .

[20]  N. Kiefer Economic Duration Data and Hazard Functions , 1988 .

[21]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[22]  H. Müller,et al.  Hazard rate estimation under random censoring with varying kernels and bandwidths. , 1994, Biometrics.

[23]  Nozer D. Singpurwalla,et al.  Estimation of the failure rate-a survey of nonparametric methods Part I: Non-Bayesian Methods , 1983 .

[24]  James Stephen Marron,et al.  Asymptotically Optimal Bandwidth Selection for Kernel Density Estimators from Randomly Right-Censored Samples. , 1987 .

[25]  A. Izenman Review Papers: Recent Developments in Nonparametric Density Estimation , 1991 .

[26]  Barry W. Brown,et al.  Algorithm 762: LLDRLF, log-likelihood and some derivatives for log-F models , 1996, TOMS.

[27]  Niels Keiding,et al.  Statistical Models Based on Counting Processes , 1993 .

[28]  O. Gefeller,et al.  A Review on Smoothing Methods for the Estimation of the Hazard Rate Based on Kernel Functions , 1992 .

[29]  Jane-Ling Wang,et al.  A comparison of hazard rate estimators for left truncated and right censored data , 1992 .

[30]  W. J. Padgett,et al.  Nonparametric density estimation from censored data , 1984 .

[31]  R. Gray Some diagnostic methods for Cox regression models through hazard smoothing. , 1990, Biometrics.

[32]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[33]  Wayne Nelson,et al.  Hazard Plotting for Incomplete Failure Data , 1969 .

[34]  Elisa T. Lee,et al.  Statistical Methods for Survival Data Analysis , 1994, IEEE Transactions on Reliability.

[35]  Martin A. Tanner,et al.  A Note on the Variable Kernel Estimator of the Hazard Function from Randomly Censored Data , 1983 .

[36]  Brian S. Yandell,et al.  Nonparametric Inference for Rates with Censored Survival Data , 1983 .