Least squares estimation of linear regression models for convex compact random sets

Simple and multiple linear regression models are considered between variables whose “values” are convex compact random sets in $${\mathbb{R}^p}$$ , (that is, hypercubes, spheres, and so on). We analyze such models within a set-arithmetic approach. Contrary to what happens for random variables, the least squares optimal solutions for the basic affine transformation model do not produce suitable estimates for the linear regression model. First, we derive least squares estimators for the simple linear regression model and examine them from a theoretical perspective. Moreover, the multiple linear regression model is dealt with and a stepwise algorithm is developed in order to find the estimates in this case. The particular problem of the linear regression with interval-valued data is also considered and illustrated by means of a real-life example.

[1]  Wolfgang Näther,et al.  On the variance of random fuzzy variables , 2002 .

[2]  Noel A. C. Cressie,et al.  Statistics for Spatial Data: Cressie/Statistics , 1993 .

[3]  L. Billard,et al.  From the Statistics of Data to the Statistics of Knowledge , 2003 .

[4]  M. Eisen,et al.  Probability and its applications , 1975 .

[5]  R. Aumann INTEGRALS OF SET-VALUED FUNCTIONS , 1965 .

[6]  Manuel Montenegro,et al.  Regression and correlation analyses of a linear relation between random intervals , 2001 .

[7]  G. Matheron Random Sets and Integral Geometry , 1976 .

[8]  Ana Colubi,et al.  Testing linear independence in linear models with interval-valued data , 2007, Comput. Stat. Data Anal..

[9]  Francisco de A. T. de Carvalho,et al.  Univariate and Multivariate Linear Regression Methods to Predict Interval-Valued Features , 2004, Australian Conference on Artificial Intelligence.

[10]  P. Kloeden,et al.  Metric spaces of fuzzy sets , 1990 .

[11]  M. Gil,et al.  Least squares fitting of an affine function and strength of association for interval-valued data , 2002 .

[12]  María Asunción Lubiano,et al.  The λ-mean squared dispersion associated with a fuzzy random variable , 2000, Fuzzy Sets Syst..

[13]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[14]  Hans-Hermann Bock,et al.  Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data , 2000 .

[15]  P. Kloeden,et al.  Metric Spaces of Fuzzy Sets: Theory and Applications , 1994 .

[16]  Francisco de A. T. de Carvalho,et al.  A New Method to Fit a Linear Regression Model for Interval-Valued Data , 2004, KI.

[17]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[18]  Carlo Bertoluzza,et al.  On a new class of distances between fuzzy numbers , 1995 .

[19]  Phil Diamond,et al.  Least squares fitting of compact set-valued data , 1990 .

[20]  D. Stoyan,et al.  Stochastic Geometry and Its Applications , 1989 .

[21]  D. Stoyan,et al.  Stochastic Geometry and Its Applications , 1989 .