On empirical likelihood for a semiparametric mixture model

Plant and animal studies of quantitative trait loci provide data which arise from mixtures of distributions with known mixing proportions. Previous approaches to estimation involve modelling the distributions parametrically. We propose a semiparametric alternative which assumes that the log ratio of the component densities satisfies a linear model, with the baseline density unspecified. It is demonstrated that a constrained empirical likelihood has an irregularity under the null hypothesis that the two densities are equal. A factorisation of the likelihood suggests a partial empirical likelihood which permits unconstrained estimation of the parameters, and which is shown to give consistent and asymptotically normal estimators, regardless of the null. The asymptotic null distribution of the log partial likelihood ratio is chi-squared. Theoretical calculations show that the procedure may be as efficient as the full empirical likelihood in the regular set-up. The usefulness of the robust methodology is illustrated with a rat study of breast cancer resistance genes. Copyright Biometrika Trust 2002, Oxford University Press.

[1]  D. Hosmer A Comparison of Iterative Maximum Likelihood Estimates of the Parameters of a Mixture of Two Normal Distributions Under Three Different Types of Sample , 1973 .

[2]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[3]  P. Hall On the Non‐Parametric Estimation of Mixture Proportions , 1981 .

[4]  Jing Qin,et al.  Empirical Likelihood in Biased Sample Problems , 1993 .

[5]  R. Doerge,et al.  Permutation tests for multiple loci affecting a quantitative character. , 1996, Genetics.

[6]  A. Owen Empirical likelihood ratio confidence intervals for a single functional , 1988 .

[7]  P. Sen,et al.  On the asymptotic performance of the log likelihood ratio statistic for the mixture model and related results , 1984 .

[8]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[9]  J. Anderson Multivariate logistic compounds , 1979 .

[10]  J. Lawless,et al.  Empirical Likelihood and General Estimating Equations , 1994 .

[11]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[12]  Patrice Loisel,et al.  Testing in normal mixture models when the proportions are known , 1992 .

[13]  S. Knapp,et al.  Mapping quantitative trait loci using molecular marker linkage maps , 1990, Theoretical and Applied Genetics.

[14]  Rebecca W. Doerge,et al.  Statistical issues in the search for genes affecting quantitative traits in experimental populations , 1997 .

[15]  D. Titterington,et al.  Estimation Problems with Data from a Mixture , 1978 .

[16]  Y. Qin Inferences for case-control and semiparametric two-sample density ratio models , 1998 .

[17]  E. Lander,et al.  Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results , 1995, Nature Genetics.

[18]  Jing Qin,et al.  Empirical likelihood ratio based confidence intervals for mixture proportions , 1999 .

[19]  R. Durrett Probability: Theory and Examples , 1993 .

[20]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[21]  D.,et al.  Regression Models and Life-Tables , 2022 .

[22]  M. Newton,et al.  Genetic loci controlling breast cancer susceptibility in the Wistar-Kyoto rat. , 2001, Genetics.

[23]  L Kruglyak,et al.  A nonparametric approach for mapping quantitative trait loci. , 1995, Genetics.

[24]  J. Klotz,et al.  Statistical methods for the analysis of tumor multiplicity data. , 1981, Cancer research.