Machine Learning and the Spatial Structure of House Prices and Housing Returns

Economists do not have reliable measures of current house values, let alone housing returns. This ignorance underlies the illiquidity of mortgage-backed securities, which in turn feeds back to deepen the sub-prime crisis. Using a massive new data tape of housing transactions in L.A., we demonstrate systematic patterns in the error associated with using the ubiquitous repeat sales methodology to understand house values. In all periods, the resulting indices under-predict sales prices of less expensive homes, and over-predict prices of more expensive homes. The recent period has produced errors that are not only unprecedentedly large in absolute value, but highly systematic: after a few years in which the indices under-predicted prices, they now significantly over-predict them. We introduce new machine learning techniques from computer science to correct for prediction errors that have geographic origins. The results are striking. Accounting for geography significantly reduces the extent of the prediction error, removes many of the systematic patterns, and results in far less deterioration in model performance in the recent period.

[1]  Bradford Case,et al.  Modeling Spatial and Temporal House Price Patterns: A Comparison of Four Models , 2004 .

[2]  S. Rosen Hedonic Prices and Implicit Markets: Product Differentiation in Pure Competition , 1974, Journal of Political Economy.

[3]  Ronald P. Barry,et al.  Spatiotemporal Autoregressive Models of Neighborhood Effects , 1998 .

[4]  R. Struyk,et al.  Segmentation in Urban Housing Markets. , 1976 .

[5]  Otis W. Gilley,et al.  Using the Spatial Configuration of the Data to Improve Estimation , 1997 .

[6]  William N. Goetzmann,et al.  Non-temporal Components of Residential Real Estate Appreciation , 1995 .

[7]  Aysegul Can The Measurement of Neighborhood Dynamics in Urban House Prices , 1990 .

[8]  Martin Hoesli,et al.  Defining Housing Submarkets , 1999 .

[9]  R. Shiller,et al.  The Efficiency of the Market for Single-Family Homes , 1988 .

[10]  Richard F. Muth,et al.  A Regression Method for Real Estate Price Index Construction , 1963 .

[11]  Allen C. Goodman,et al.  HOUSING SUBMARKETS WITHIN URBAN AREAS: DEFINITIONS AND EVIDENCE* , 1981 .

[12]  Aysegul Can Specification and estimation of hedonic housing price models , 1992 .

[13]  R. Kelley Pace,et al.  Nonparametric methods with applications to hedonic models , 1993 .

[14]  T. Thibodeau Marking Single–Family Property Values to Market , 2003 .

[15]  William N. Goetzmann,et al.  A Spatial Model of Housing Returns and Neighborhood Substitutability , 1997 .

[16]  J. Clapp A Semi Parametric Method for Estimating Local House Price Indices , 2003 .

[17]  P. Anglin,et al.  SEMIPARAMETRIC ESTIMATION OF A HEDONIC PRICE FUNCTION , 1996 .

[18]  R. Meese,et al.  Nonparametric Estimation of Dynamic Hedonic Price Models and the Construction of Residential Housing Price Indices , 1991 .

[19]  R. Halvorsen,et al.  Choice of functional form for hedonic price equations , 1981 .

[20]  R. Dubin Spatial autocorrelation and neighborhood quality , 1992 .

[21]  Yong Tu,et al.  Economic perspectives on the structure of local housing systems , 1996 .

[22]  S. Basu,et al.  Analysis of Spatial Autocorrelation in House Prices , 1998 .

[23]  R. Dubin,et al.  Predicting House Prices Using Multiple Listings Data , 1998 .