Minimizing effects of scale distortion for spatially grouped census data using rough sets

Census data has been widely used for community evaluation based on demographic and socioeconomic variables. However, the analysis is typically associated with specific areal units and the results often change when the size of the census configuration changes leading to scale distortions. Various approaches such as optimal zoning systems and multivariate statistical analysis have been developed to address the scale problem. But limitations in these approaches have led to the use of non-statistical methods to tackle the scale problem. This study combines a non-statistical method with descriptive statistical measures to develop a rough sets approach to constructing a census-based deprivation index (DI) and to determine its relationship to a recent immigrant population using the 2001 Canadian census. Application of the approach in the Greater Vancouver Regional District shows that rough sets can stabilize relationships for spatially grouped census data by minimizing scale distortions. Scale sensitivity measures are also estimated to translate DI relationships across three census configurations. The rough sets approach is suitable for areal data analysis because it is resistant to nonlinearity, outliers, and assumes no prior relationship between variables.

[1]  A. Ludbrook,et al.  Health and deprivation , 2005 .

[2]  Jean H. P. Paelinck,et al.  On aggregation in spatial econometric modelling , 2000, J. Geogr. Syst..

[3]  S. Openshaw Ecological Fallacies and the Analysis of Areal Census Data , 1984, Environment & planning A.

[4]  S. Openshaw A million or so correlation coefficients : three experiments on the modifiable areal unit problem , 1979 .

[5]  N. Wrigley,et al.  Quantitative geography : a British view , 1981 .

[6]  Mai Stafford,et al.  Neighbourhood deprivation and health: does it affect us all equally? , 2003, International journal of epidemiology.

[7]  M. Batty,et al.  Spatial Analysis: Modelling in a GIS Environment , 1998 .

[8]  A. Mcculloch Ward-Level Deprivation and Individual Social and Economic Outcomes in the British Household Panel Study , 2001 .

[9]  D. Marceau The Scale Issue in the Social and Natural Sciences , 1999 .

[10]  Peter M. Atkinson,et al.  Modelling scale in geographical information science , 2001 .

[11]  Stewart Fotheringham,et al.  Scale-independent spatial analysis , 1989 .

[12]  Stan Openshaw,et al.  Modifiable Areal Unit Problem , 2008, Encyclopedia of GIS.

[13]  Tomoki Nakaya,et al.  An Information Statistical Approach to the Modifiable Areal Unit Problem in Incidence Rate Maps , 2000 .

[14]  Gary C. White,et al.  Statistical Applications in the Spatial Sciences. , 1981 .

[15]  D. Steel,et al.  Using Census Data to Investigate the Causes of the Ecological Fallacy , 1998, Environment & planning A.

[16]  P. Haggett,et al.  Geographical aspects of epidemic diffusion in closed communities , 1978, Advances in Applied Probability.

[17]  Z. Vekerdy Spatial analysis: modelling in a GIS environment , 1998 .

[18]  Jianguo Wu,et al.  The modifiable areal unit problem and implications for landscape ecology , 1996, Landscape Ecology.

[19]  Atsuyuki Okabe,et al.  The Modifiable Areal Unit Problem in a Repression Model Whose Independent Variable Is a Distance from a Predetermined Point , 2002 .

[20]  N. Wrigley,et al.  Statistical applications in the spatial sciences , 1981 .

[21]  R. Pampalon,et al.  A deprivation index for health and welfare planning in Quebec. , 2000, Chronic diseases in Canada.

[22]  Maurizio Rafanelli,et al.  The aggregate data problem: a system for their definition and management , 1996, SGMD.

[23]  S. Halli,et al.  Plight of Immigrants: The Spatial Concentration of Poverty in Canada , 1997 .

[24]  Ashcroft Ranch,et al.  GREATER VANCOUVER REGIONAL DISTRICT , 2004 .

[25]  David G Steel,et al.  Exploring a relationship between aggregate and individual levels spatial data through semivariogram models , 2006 .

[26]  D. Ley,et al.  Immigration and Poverty in Canadian Cities, 1971-1991 , 1997 .

[27]  R. Skeldon Migration and poverty , 2002 .

[28]  J. Burt,et al.  Elementary statistics for geographers , 1995 .

[29]  Michael F. Goodchild,et al.  The accuracy of spatial databases , 1991 .

[30]  RafanelliM.,et al.  The aggregate data problem , 1996 .

[31]  M. Goodchild,et al.  Uncertainty in geographical information , 2002 .

[32]  Harold David Reynolds,et al.  The modifiable area unit problem, empirical analysis by statistical simulation , 1998 .

[33]  A S Fotheringham,et al.  The Modifiable Areal Unit Problem in Multivariate Statistical Analysis , 1991 .

[34]  David Manley,et al.  Scales, levels and processes: Studying spatial patterns of British census variables , 2006, Comput. Environ. Urban Syst..

[35]  Z. Pawlak Rough set approach to knowledge-based decision support , 1997 .