Using Geographic Information and Point of Interest to Estimate Missing Second-Hand Housing Price of Residential Area in Urban Space

The real estate market including second-hand real estate market plays an important role in Chinese economy. However, it is not easy to acquire price of the each pieces of residential area in a city, and for instance the data acquired from Internet in our paper can only cover 56% of residential area in Nanjing. To this end, our paper proposed an model to fill the missing price data by using price and locations of second-hand real estates and Point of Interest (POI) information which were acquired from Internet. Our experiment was conducted in Nanjing and Chongqing, and demonstrates that our model is able to perform better than traditional Geographic Information System (GIS) method, such as Kriging interpolation, and general machine learning model, such as K-Nearest Neighbour (KNN). Also, our proposed model can be more interpretable than traditional methods, and able to reveal how the POI information can influence the second-hand real estate price. Our proposed model can help domain experts, e.g. city planners and economists, to better research the second-hand real estate market in the future.

[1]  Vasilii D. Nosov China in the 2010s: The Economy of Overheating , 2017 .

[2]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[3]  Deng Xiao-zhu,et al.  The land prices and housing prices — Empirical research based on panel data of 11 provinces and municipalities in Eastern China , 2013, 2013 International Conference on Management Science and Engineering 20th Annual Conference Proceedings.

[4]  Hui Xiong,et al.  A Discovery System for Finding High-Value Homes , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[5]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[6]  Wu Wenji Spatial correlation analysis of landscape accessibility and residential housing price in Beijing , 2014 .

[7]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[8]  Wang Lijuan,et al.  Spatial Variation Analysis of the Housing Price in Multi-center City: A Case Study in Chongqing City, China , 2013, 2013 International Conference on Computational and Information Sciences.

[9]  Dongdong Chen,et al.  Geostatistical analysis of China inbound tourism spatial distribution structure , 2010, 2010 18th International Conference on Geoinformatics.

[10]  Kilian Q. Weinberger,et al.  Gradient boosted feature selection , 2014, KDD.

[11]  Wayan Firdaus Mahmudy,et al.  Data-driven fuzzy rule extraction for housing price prediction in Malang, East Java , 2017, 2017 International Conference on Advanced Computer Science and Information Systems (ICACSIS).

[12]  Guangli Liu,et al.  Research of second-hand real estate price forecasting based on data mining , 2017, 2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC).

[13]  Shujun Ye,et al.  Study on the influencing factors of commodity housing price taking Hebei Province as a sample , 2017, 2017 4th International Conference on Industrial Economics System and Industrial Security Engineering (IEIS).

[14]  Wei Huang,et al.  A Real Estate Boom with Chinese Characteristics , 2016 .

[15]  Feng Pan,et al.  Feature selection for ranking using boosted trees , 2009, CIKM.

[16]  George C. Runger,et al.  Feature Selection with Ensembles, Artificial Variables, and Redundancy Elimination , 2009, J. Mach. Learn. Res..

[17]  Lipo Wang,et al.  Predicting public housing prices using delayed neural networks , 2016, 2016 IEEE Region 10 Conference (TENCON).

[18]  Katherine A. Kiel,et al.  Location, Location, Location: The 3L Approach to House Price Determination , 2008 .

[19]  Yi Bin,et al.  Research on Prediction Methods of Residential Real Estate Price Based on Improved BPNN , 2016 .

[20]  Hu Xiaolong,et al.  Applied research on real estate price prediction by the neural network , 2010, 2010 The 2nd Conference on Environmental Science and Information Application Technology.