Density Estimation in the Presence of Heteroskedastic Measurement Error

We consider density estimation when the variable of interest is subject to heteroskedastic measurement error. The density is assumed to have a smooth but unknown functional form which we model with a penalized mixture of B-splines. We treat the situation where multiple mismeasured observations of each of the variables of interest are observed, and the measurement error is assumed to be additive and normal. The paper’s main contributions to this problem are to address the effects of heteroskedastic measurement error, to explain the biases caused by ignoring heteroskedasticity, and to present an approximate equivalent kernel for a spline based density estimator. The derivation of the equivalent kernel may be of independent interest. We model the variance as a smooth function of the unobserved variable of interest, and we use small-σ asymptotics to describe the biases incurred by assuming the measurement error is homoskedastic when it actually is heteroskedastic. We fit the model using Bayesian methods. An example from nutritional epidemiology and an example that uses simulated data are included.

[1]  J. Kiefer,et al.  CONSISTENCY OF THE MAXIMUM LIKELIHOOD ESTIMATOR IN THE PRESENCE OF INFINITELY MANY INCIDENTAL PARAMETERS , 1956 .

[2]  N. Laird Nonparametric Maximum Likelihood Estimation of a Mixing Distribution , 1978 .

[3]  John Rice,et al.  Deconvolution of Microfluorometric Histograms with B Splines , 1982 .

[4]  B. Lindsay The Geometry of Mixture Likelihoods: A General Theory , 1983 .

[5]  B. Silverman,et al.  Spline Smoothing: The Equivalent Variable Kernel Method , 1984 .

[6]  Leonard A. Stefanski,et al.  The effects of measurement error on parameter estimation , 1985 .

[7]  P. Hall,et al.  Optimal Rates of Convergence for Deconvolving a Density , 1988 .

[8]  R. DerSimonian Correction to Algorithm as 221: Maximum Likelihood Estimation of a Mixing Distribution , 1990 .

[9]  A. Carriquiry,et al.  A Semiparametric Transformation Approach to Estimating Usual Daily Intake Distributions , 1996 .

[10]  Marie Davidian,et al.  The Nonlinear Mixed Effects Model with a Smooth Random Effects Density , 1993 .

[11]  Wayne A. Fuller,et al.  Estimation in the Presence of Measurement Error , 1995 .

[12]  Ja-Yong Koo,et al.  B-spline deconvolution based on the EM algorithm , 1996 .

[13]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[14]  E Demidenko,et al.  Covariate measurement error and the estimation of random effect parameters in a mixed model for longitudinal data. , 1998, Statistics in medicine.

[15]  P Schlattmann,et al.  Recent developments in computer-assisted analysis of mixtures. , 1998, Biometrics.

[16]  M A Newton,et al.  An Estimation Method for the Semiparametric Mixed Effects Model , 1999, Biometrics.

[17]  M. Aitkin A General Maximum Likelihood Analysis of Variance Components in Generalized Linear Models , 1999, Biometrics.

[18]  Dankmar Böhning,et al.  Computer-Assisted Analysis of Mixtures and Applications , 2000, Technometrics.

[19]  M Davidian,et al.  Linear Mixed Models with Flexible Distributions of Random Effects for Longitudinal Data , 2001, Biometrics.

[20]  B. Lindsay,et al.  Alternative EM methods for nonparametric finite mixture models , 2001 .

[21]  D. Ruppert Selecting the Number of Knots for Penalized Splines , 2002 .

[22]  D. Ruppert,et al.  Local Polynomial Regression and SIMEX , 2003 .

[23]  Spline Estimators of the Density Function of a Variable Measured with Error , 2003 .

[24]  Raymond J. Carroll,et al.  Low order approximations in deconvolution and regression with errors in variables , 2004 .

[25]  E. Lesaffre,et al.  Smooth Random Effects Distribution in a Linear Mixed Model , 2004, Biometrics.

[26]  D. Hinkley Annals of Statistics , 2006 .

[27]  On The Asymptotics Of Penalised Splines , 2007 .

[28]  Yuedong Wang,et al.  Smoothing Spline Estimation of Variance Functions , 2007 .