Functional Boxplots

This article proposes an informative exploratory tool, the functional boxplot, for visualizing functional data, as well as its generalization, the enhanced functional boxplot. Based on the center outward ordering induced by band depth for functional data, the descriptive statistics of a functional boxplot are: the envelope of the 50% central region, the median curve, and the maximum non-outlying envelope. In addition, outliers can be detected in a functional boxplot by the 1.5 times the 50% central region empirical rule, analogous to the rule for classical boxplots. The construction of a functional boxplot is illustrated on a series of sea surface temperatures related to the El Niño phenomenon and its outlier detection performance is explored by simulations. As applications, the functional boxplot and enhanced functional boxplot are demonstrated on children growth data and spatio-temporal U.S. precipitation data for nine climatic regions, respectively. This article has supplementary material online.

[1]  Rob J Hyndman,et al.  Rainbow Plots, Bagplots, and Boxplots for Functional Data , 2010 .

[2]  J. Romo,et al.  On the Concept of Depth for Functional Data , 2009 .

[3]  M. Febrero,et al.  Outlier detection in functional data by depth measures, with application to identify abnormal NOx levels , 2008 .

[4]  Ricardo Fraiman,et al.  Robust estimation and classification for functional data via projection-based depth notions , 2007, Comput. Stat..

[5]  M. Greenwood Nonparametric Functional Data Analysis: Theory and Practice , 2007 .

[6]  Wenceslao González-Manteiga,et al.  A functional analysis of NOx levels: location and scale estimation and outlier detection , 2007, Comput. Stat..

[7]  Z. Q. John Lu,et al.  Nonparametric Functional Data Analysis: Theory And Practice , 2007, Technometrics.

[8]  Ricardo Fraiman,et al.  On the use of the bootstrap for estimating functions with functional data , 2006, Comput. Stat. Data Anal..

[9]  P. Thomas Fletcher,et al.  Principal geodesic analysis for the study of nonlinear statistics of shape , 2004, IEEE Transactions on Medical Imaging.

[10]  Paul Embrechts,et al.  Economic Applications of Quantile Regression , 2003 .

[11]  Teobaldo Dioses,et al.  El Niño 1982-1983 and 1997-1998: Effects on Peruvian Jack Mackerel and Peruvian Chub Mackerel , 2002 .

[12]  R. Fraiman,et al.  Trimmed means for functional data , 2001 .

[13]  Cun-Hui Zhang,et al.  The multivariate L1-median and associated data depth. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Jean Meloche,et al.  Multivariate L-estimation , 1999 .

[15]  Regina Y. Liu,et al.  Multivariate analysis by data depth: descriptive statistics, graphics and inference, (with discussion and a rejoinder by Liu and Singh) , 1999 .

[16]  P. Rousseeuw,et al.  The Bagplot: A Bivariate Boxplot , 1999 .

[17]  Regina Y. Liu On a Notion of Data Depth Based on Random Simplices , 1990 .

[18]  H. Oja Descriptive Statistics for Multivariate Distributions , 1983 .

[19]  J. Tukey Mathematics and the Picturing of Data , 1975 .

[20]  P. Mahalanobis On the generalized distance in statistics , 1936 .