On Bootstrapping the Likelihood Ratio Test Statistic for the Number of Components in a Normal Mixture

An important but difficult problem in practice is assessing the number of components g in a mixture. An obvious way of proceeding is to use the likelihood ratio test statistic A{ to test for the smallest value of g consistent with the data. Unfortunately with mixture models, regularity conditions do not hold for -2 log A, to have it usual asymptotic null distribution of chi-squared. In this paper the role of the bootstrap is highlighted for the assessment of the null distribution of -2 log A{ for the test of a single normal density versus a mixture of two normal densities in the univariate case.

[1]  D. B. Duncan MULTIPLE RANGE AND MULTIPLE F TESTS , 1955 .

[2]  A. Hope A Simplified Monte Carlo Significance Test Procedure , 1968 .

[3]  N. E. Day Estimating the components of a mixture of normal distributions , 1969 .

[4]  J. Wolfe A Monte Carlo Study of the Sampling Distribution of the Likelihood Ratio for Mixtures of Multinormal Distributions , 1971 .

[5]  A. Scott,et al.  A Cluster Analysis Method for Grouping Means in the Analysis of Variance , 1974 .

[6]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[7]  D. Binder Bayesian cluster analysis , 1978 .

[8]  B. Efron Bootstrap Methods: Another Look at the Jackknife , 1979 .

[9]  David A. Binder,et al.  Approximations to Bayesian clustering rules , 1981 .

[10]  Murray Aitkin,et al.  Statistical Modelling of Data on Teaching Styles , 1981 .

[11]  B. Everitt A Monte Carlo Investigation Of The Likelihood Ratio Test For The Number Of Components In A Mixture Of Normal Distributions. , 1981, Multivariate behavioral research.

[12]  B. Everitt,et al.  Finite Mixture Distributions , 1981 .

[13]  G. J. McLachlan,et al.  9 The classification and mixture maximum likelihood approaches to cluster analysis , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[14]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[15]  Geoffrey J. McLachlan,et al.  Estimation of Mixing Proportions: A Case Study , 1984 .

[16]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[17]  T. Caliński,et al.  Clustering means in ANOVA by simultanuous testing , 1985 .

[18]  Geoffrey J. McLachlan,et al.  Cluster analysis in a randomized complete block design , 1985 .

[19]  D. Rubin,et al.  Estimation and Hypothesis Testing in Finite Mixture Models , 1985 .

[20]  G. McLachlan,et al.  Likelihood Estimation with Normal Mixture Models , 1985 .

[21]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[22]  Robert Tibshirani,et al.  Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy , 1986 .

[23]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[24]  Geoffrey J. McLachlan,et al.  Mixture models : inference and applications to clustering , 1989 .