Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data

Housing price valuation is one of most important trading decisions.This study uses machine learning to develop housing price prediction models.This study analyzes the housing data of 5359 townhouses in Fairfax County, VA.The 10-fold cross-validation was applied to C4.5, RIPPER, Bayesian, and AdaBoost.RIPPER outperformed these other housing price prediction models in all tests. House sales are determined based on the Standard & Poor's Case-Shiller home price indices and the housing price index of the Office of Federal Housing Enterprise Oversight (OFHEO). These reflect the trends of the US housing market. In addition to these housing price indices, the development of a housing price prediction model can greatly assist in the prediction of future housing prices and the establishment of real estate policies. This study uses machine learning algorithms as a research methodology to develop a housing price prediction model. To improve the accuracy of housing price prediction, this paper analyzes the housing data of 5359 townhouses in Fairfax County, Virginia, gathered by the Multiple Listing Service (MLS) of the Metropolitan Regional Information Systems (MRIS). We develop a housing price prediction model based on machine learning algorithms such as C4.5, RIPPER, Naive Bayesian, and AdaBoost and compare their classification accuracy performance. We then propose an improved housing price prediction model to assist a house seller or a real estate agent make better informed decisions based on house price valuation. The experiments demonstrate that the RIPPER algorithm, based on accuracy, consistently outperforms the other models in the performance of housing price prediction.

[1]  Jim Berry,et al.  Hedonic modelling, housing submarkets and residential valuation , 1996 .

[2]  Osman Aytekin,et al.  The use of fuzzy logic in predicting house selling price , 2010, Expert Syst. Appl..

[3]  Simon Stevenson,et al.  New empirical evidence on heteroscedasticity in hedonic housing models , 2004 .

[4]  Mingcang Zhu,et al.  Housing price forecasting based on genetic algorithm and support vector machine , 2011, Expert Syst. Appl..

[5]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[6]  Pieter Hooimeijer,et al.  Capturing Housing Market Segmentation: An Alternative Approach based on Neural Network Modelling , 2002 .

[7]  Yihao Zhang,et al.  Real estate price forecasting based on SVM optimized by PSO , 2014 .

[8]  Ibrahim Halil Gerek,et al.  House selling price assessment using two different adaptive neuro-fuzzy techniques , 2014 .

[9]  Seow Eng Ong,et al.  Determinants of House Price: A Decision Tree Approach , 2006 .

[10]  Yan Kestens,et al.  Heterogeneity in hedonic modelling of house prices: looking at buyers’ household profiles , 2006, J. Geogr. Syst..

[11]  Piet M. A. Eichholtz,et al.  REIT Performance: Does Managerial Specialization Pay? , 1999 .

[12]  Hasan Selim,et al.  Determinants of house prices in Turkey: Hedonic regression versus artificial neural network , 2009, Expert Syst. Appl..

[13]  Ali Azadeh,et al.  A hybrid fuzzy regression-fuzzy cognitive map algorithm for forecasting and optimization of housing market fluctuations , 2012, Expert Syst. Appl..

[14]  Nancy Wallace,et al.  House Price Dynamics and Market Fundamentals: The Parisian Housing Market , 2003 .

[15]  Jian-Guo Liu,et al.  Application of Fuzzy Neural Network for Real Estate Prediction , 2006, ISNN.

[16]  Okmyung Bin A prediction comparison of housing sales prices by parametric versus semi-parametric regressions , 2004 .

[17]  Rainer Schulz,et al.  A State Space Model for Berlin House Prices: Estimation and Economic Interpretation , 2003 .