Inference methods in traditional statistics, machine learning and data mining assume that data is generated from an independent and identically distributed (iid) process. Spatial data exhibits behavior for which the iid assumption must be relaxed. For example, the standard approach in spatial regression is to assume the existence of a contiguity matrix which captures the spatial autoregressive properties of the data. However all spatial methods, till now, have assumed that the contiguity matrix is given apriori or can be estimated by using a spatial similarity function. In this paper we propose a convex optimization formulation to solve the spatial autoregressive regression (SAR) model in which both the contiguity matrix and the non-spatial regression parameters are unknown and inferred from the data. We solve the problem using the alternating direction method of multipliers (ADMM) which provides a solution which is both robust and efficient. While our approach is general we use data from housing markets of Boston and Sydney to both guide the analysis and validate our results. A novel side effect of our approach is the automatic discovery of spatial clusters which translate to submarkets in the housing data sets.
[1]
Trevor J. Barnes,et al.
Environment and Planning A 2001, volume 33, pages 2127^2147
,
2001
.
[2]
W. Marsden.
I and J
,
2012
.
[3]
A. Sayed,et al.
Foundations and Trends ® in Machine Learning > Vol 7 > Issue 4-5 Ordering Info About Us Alerts Contact Help Log in Adaptation , Learning , and Optimization over Networks
,
2011
.
[4]
R. Muth,et al.
Cities and Housing.
,
1970
.
[5]
Shashi Shekhar,et al.
Spatial Databases: A Tour
,
2003
.
[6]
Nicolai,et al.
Revue d ' économie industrielle Capabilities and the Theory of the Firm
,
2016
.
[7]
小林 信治,et al.
自然資源および環境資源への制御理論の適用-解説-上-(Journal of Environmental Economics and Management,4,1977)
,
1981
.
[8]
W. Alonso.
Location And Land Use
,
1964
.
[9]
L. Anselin.
Spatial Econometrics: Methods and Models
,
1988
.