House Prices Prediction with Machine Learning Algorithms

Based on the data set compiled by D. D. Cock and the competition run by kaggle.com, we propose a house prices prediction algorithm in Ames, lowa by deliberating on data processing, feature engineering and combination forecasting. Our prediction ranks the 35th of the total 2221 results on the public leaderboard of Kaggle.com and the RMSE of predicted results after taking logarithm from all the test data is 0.12019, which shows good performance and small of over-fitting.

[1]  Conghui Zhu,et al.  Research on machine learning method-based combination forecasting model and its application , 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[2]  Nattapol Aunsri,et al.  Drug-Drug Interactions prediction from enzyme action crossing through machine learning approaches , 2015, 2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON).

[3]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[4]  Miriam A. M. Capretz,et al.  Machine Learning With Big Data: Challenges and Approaches , 2017, IEEE Access.

[5]  Jeff Heaton,et al.  An empirical analysis of feature engineering for predictive modeling , 2016, SoutheastCon 2016.

[6]  Aparna U.R.,et al.  Feature selection and extraction in data mining , 2016, 2016 Online International Conference on Green Engineering and Technologies (IC-GET).

[7]  Nagwa Lotfy Badr,et al.  A study of spatial machine learning for business behavior prediction in location based social networks , 2016, 2016 11th International Conference on Computer Engineering & Systems (ICCES).

[8]  Huajie Zhang,et al.  An Empirical Study on Machine Learning Models for Wind Power Predictions , 2016, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA).

[9]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[10]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[11]  Dean R. De Cock,et al.  Ames, Iowa: Alternative to the Boston Housing Data as an End of Semester Regression Project , 2011 .