k-Boxplots for mixture data

This article introduces a new graphical tool to summarize data which possess a mixture structure. Computation of the required summary statistics makes use of posterior probabilities of class membership which can be obtained from a fitted mixture model. Real and simulated data are used to highlight the usefulness of this tool for the visualization of mixture data in comparison to the traditional boxplot.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  David C. Hoaglin,et al.  Some Implementations of the Boxplot , 1989 .

[5]  Murray Aitkin,et al.  Fitting overdispersed generalized linear models by non-parametric maximum likelihood. , 1995 .

[6]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[7]  Naomi S. Altman,et al.  Points of Significance: Visualizing samples with box plots , 2014, Nature Methods.

[8]  David R. Bellhouse,et al.  A diagnostic tool for regression analysis of complex survey data , 2015 .

[9]  Ursula Gather,et al.  Weighted Repeated Median Smoothing and Filtering , 2007 .

[10]  Ibrahim Mohamed,et al.  Boxplot for circular variables , 2012, Comput. Stat..

[11]  Jochen Einbeck,et al.  A number‐of‐modes reference rule for density estimation under multimodality , 2013 .

[12]  Christophe Biernacki,et al.  Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models , 2003, Comput. Stat. Data Anal..

[13]  Vincenzo Verardi,et al.  A generalized boxplot for skewed and heavy-tailed distributions , 2014 .

[14]  Gilles Celeux,et al.  EM for mixtures , 2015, Stat. Comput..

[15]  Yoav Benjamini,et al.  Opening the Box of a Boxplot , 1988 .

[16]  J. Tukey,et al.  Variations of Box Plots , 1978 .

[17]  Mia Hubert,et al.  An adjusted boxplot for skewed distributions , 2008, Comput. Stat. Data Anal..

[18]  D. M. Titterington,et al.  On the deter-mination of the number of components in a mixture , 1998 .