Robust scale estimators for fuzzy data

Observations distant from the majority or deviating from the general pattern often appear in datasets. Classical estimates such as the sample mean or the sample variance can be substantially affected by these observations (outliers). Even a single outlier can have huge distorting influence. However, when one deals with real-valued data there exist robust measures/estimates of location and scale (dispersion) which reduce the influence of these atypical values and provide approximately the same results as the classical estimates applied to the typical data without outliers. In real-life, data to be analyzed and interpreted are not always precisely defined and they cannot be properly expressed by using a numerical scale of measurement. Frequently, some of these imprecise data could be suitably described and modelled by considering a fuzzy rating scale of measurement. In this paper, several well-known scale (dispersion) estimators in the real-valued case are extended for random fuzzy numbers (i.e., random mechanisms generating fuzzy-valued data), and some of their properties as estimators for dispersion are examined. Furthermore, their robust behaviour is analyzed using two powerful tools, namely, the finite sample breakdown point and the sensitivity curves. Simulations, including empirical bias curves, are performed to complete the study.

[1]  Ana Colubi,et al.  SMIRE Research Group at the University of Oviedo: A distance-based statistical analysis of fuzzy number-valued data , 2014, Int. J. Approx. Reason..

[2]  Ana Colubi,et al.  Computational Statistics and Data Analysis Fuzzy Data Treated as Functional Data: a One-way Anova Test Approach , 2022 .

[3]  L. M. D. C. Ibáñez,et al.  A subjective approach for ranking fuzzy numbers , 1989 .

[4]  Stefan Van Aelst,et al.  The median of a random fuzzy number. The 1-norm distance approach , 2012, Fuzzy Sets Syst..

[5]  Ana Colubi,et al.  Rejoinder on "A distance-based statistical analysis of fuzzy number-valued data" , 2014, Int. J. Approx. Reason..

[6]  María Asunción Lubiano,et al.  The λ-mean squared dispersion associated with a fuzzy random variable , 2000, Fuzzy Sets Syst..

[7]  María Angeles Gil,et al.  Fuzzy Rating Scale-Based Questionnaires and Their Statistical Analysis , 2015, IEEE Transactions on Fuzzy Systems.

[8]  P. Kloeden,et al.  Metric spaces of fuzzy sets , 1990 .

[9]  M. Ming On embedding problems of fuzzy number spaces. Part 4 , 1993 .

[10]  Jean-Paul Chilès,et al.  Wiley Series in Probability and Statistics , 2012 .

[11]  Wolfgang Näther,et al.  On the variance of random fuzzy variables , 2002 .

[12]  Ana Colubi,et al.  On the formalization of fuzzy random variables , 2001, Inf. Sci..

[13]  Lotfi A. Zadeh,et al.  The concept of a linguistic variable and its application to approximate reasoning-III , 1975, Inf. Sci..

[14]  F. Hampel Contributions to the theory of robust estimation , 1968 .

[15]  M. Asunción Lubiano,et al.  Estimating the expected value of fuzzy random variables in random samplings from finite populations , 1999 .

[16]  M. Fréchet Les éléments aléatoires de nature quelconque dans un espace distancié , 1948 .

[17]  M. Puri,et al.  The Concept of Normality for Fuzzy Random Variables , 1985 .

[18]  R. A. Vitale Lp metrics for compact, convex sets , 1985 .

[19]  Ralf Körner,et al.  On the variance of fuzzy random variables , 1997, Fuzzy Sets Syst..

[20]  R. Goetschel,et al.  Elementary fuzzy calculus , 1986 .

[21]  M. Puri,et al.  Fuzzy Random Variables , 1986 .

[22]  Miguel López-Díaz,et al.  Reversing the order of integration in iterated expectations of fuzzy random variables, and statistical applications , 1998 .

[23]  P. Rousseeuw,et al.  Alternatives to the Median Absolute Deviation , 1993 .

[24]  Ronald R. Yager,et al.  A procedure for ordering fuzzy subsets of the unit interval , 1981, Inf. Sci..

[25]  María Angeles Gil,et al.  A generalized L1-type metric between fuzzy numbers for an approach to central tendency of fuzzy data , 2013, Inf. Sci..

[26]  V. Yohai,et al.  Robust Statistics: Theory and Methods , 2006 .

[27]  Ana Colubi,et al.  A generalized strong law of large numbers , 1999 .

[28]  María Asunción Lubiano,et al.  K-sample tests for equality of variances of random fuzzy sets , 2012, Comput. Stat. Data Anal..