Lifelong Property Price Prediction: A Case Study for the Toronto Real Estate Market

We present Luce, the first life-long predictive model for automated property valuation. Luce addresses two critical issues of property valuation: the lack of recent sold prices and the sparsity of house data. It is designed to operate on a limited volume of recent house transaction data. As a departure from prior work, Luce organizes the house data in a heterogeneous information network (HIN) where graph nodes are house entities and attributes that are important for house price valuation. We employ a Graph Convolutional Network (GCN) to extract the spatial information from the HIN for house-related data like geographical locations, and then use a Long Short Term Memory (LSTM) network to model the temporal dependencies for house transaction data over time. Unlike prior work, Luce can make effective use of the limited house transactions data in the past few months to update valuation information for all house entities within the HIN. By providing a complete and up-to-date house valuation dataset, Luce thus massively simplifies the downstream valuation task for the targeting properties. We demonstrate the benefit of Luce by applying it to large, real-life datasets obtained from the Toronto real estate market. Extensive experimental results show that Luce not only significantly outperforms prior property valuation methods but also often reaches and sometimes exceeds the valuation accuracy given by independent experts when using the actual realization price as the ground truth.

[1]  Philip S. Yu,et al.  Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks , 2021, ACM Trans. Inf. Syst..

[2]  Philip S. Yu,et al.  Lime: Low-Cost and Incremental Learning for Dynamic Heterogeneous Information Networks , 2021, IEEE Transactions on Computers.

[3]  Bowen Du,et al.  Dynamic graph convolutional network for long-term traffic flow prediction with reinforcement learning , 2021, Inf. Sci..

[4]  Jie Xu,et al.  Hawk: Rapid Android Malware Detection Through Heterogeneous Graph Attention Networks , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Jianmin Wang,et al.  Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting , 2021, NeurIPS.

[6]  Philip S. Yu,et al.  Streaming Social Event Detection and Evolution Discovery in Heterogeneous Information Networks , 2021, ACM Trans. Knowl. Discov. Data.

[7]  Reinforcement Learning Enhanced Heterogeneous Graph Neural Network , 2020, ArXiv.

[8]  Yu Liu,et al.  T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction , 2018, IEEE Transactions on Intelligent Transportation Systems.

[9]  Philip S. Yu,et al.  Pairwise Learning for Name Disambiguation in Large-Scale Heterogeneous Academic Networks , 2020, 2020 IEEE International Conference on Data Mining (ICDM).

[10]  Philip S. Yu,et al.  Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters , 2020, CIKM.

[11]  Philip S. Yu,et al.  Attentional Graph Convolutional Networks for Knowledge Concept Recommendation in MOOCs in a Heterogeneous View , 2020, SIGIR.

[12]  Md Zakirul Alam Bhuiyan,et al.  Deep Irregular Convolutional Residual LSTM for Urban Traffic Passenger Flows Prediction , 2020, IEEE Transactions on Intelligent Transportation Systems.

[13]  Philip S. Yu,et al.  Spatial temporal incidence dynamic graph neural networks for traffic flow forecasting , 2020, Inf. Sci..

[14]  Lin Liu,et al.  Dynamic network embedding via incremental skip-gram with negative sampling , 2019, Science China Information Sciences.

[15]  Karsten M. Borgwardt,et al.  Representation Learning for Dynamic Graphs: A Survey , 2019, J. Mach. Learn. Res..

[16]  Charles E. Leisersen,et al.  EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs , 2019, AAAI.

[17]  Alessandro Rozza,et al.  Dynamic Graph Convolutional Networks , 2017, Pattern Recognit..

[18]  Yu He,et al.  HeteSpaceyWalk: A Heterogeneous Spacey Random Walk for Heterogeneous Information Network Embedding , 2019, CIKM.

[19]  Nitesh V. Chawla,et al.  Heterogeneous Graph Neural Network , 2019, KDD.

[20]  Jieping Ye,et al.  Spatiotemporal Multi-Graph Convolution Network for Ride-Hailing Demand Forecasting , 2019, AAAI.

[21]  Philip S. Yu,et al.  Fine-grained Event Categorization with Heterogeneous Graph Convolutional Networks , 2019, IJCAI.

[22]  Jie Tang,et al.  Representation Learning for Attributed Multiplex Heterogeneous Network , 2019, KDD.

[23]  Yanfang Ye,et al.  Heterogeneous Graph Attention Network , 2019, WWW.

[24]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Soroosh Sorooshian,et al.  Short‐Term Precipitation Forecast Based on the PERSIANN System and LSTM Recurrent Neural Networks , 2018, Journal of Geophysical Research: Atmospheres.

[26]  Cyrus Shahabi,et al.  Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning , 2018, SIGSPATIAL/GIS.

[27]  Kenli Li,et al.  Exploiting Spatio-Temporal Correlations with Multiple 3D Convolutional Neural Networks for Citywide Vehicle Flow Prediction , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[28]  Yike Guo,et al.  Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction , 2018, ACM Multimedia.

[29]  Bai Wang,et al.  Attention Based Meta Path Fusion for Heterogeneous Information Network Embedding , 2018, PRICAI.

[30]  Qiang Yang,et al.  Bike flow prediction with multi-graph convolutional networks , 2018, SIGSPATIAL/GIS.

[31]  Kevin Heaslip,et al.  Inferring transportation modes from GPS trajectories using a convolutional neural network , 2018, ArXiv.

[32]  Chengqi Zhang,et al.  MetaGraph2Vec: Complex Semantic Path Augmented Heterogeneous Network Embedding , 2018, PAKDD.

[33]  Joost van de Weijer,et al.  Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[34]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[35]  Nicholas G. Polson,et al.  Deep learning for spatio‐temporal modeling: Dynamic traffic flows and high frequency trading , 2017, Applied Stochastic Models in Business and Industry.

[36]  Xavier Bresson,et al.  Structured Sequence Modeling with Graph Convolutional Recurrent Networks , 2016, ICONIP.

[37]  Wang-Chien Lee,et al.  HIN2Vec: Explore Meta-paths in Heterogeneous Information Networks for Representation Learning , 2017, CIKM.

[38]  Nitesh V. Chawla,et al.  metapath2vec: Scalable Representation Learning for Heterogeneous Networks , 2017, KDD.

[39]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[40]  Tinne Tuytelaars,et al.  Expert Gate: Lifelong Learning with a Network of Experts , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[42]  Philip S. Yu,et al.  A Survey of Heterogeneous Information Network Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[43]  Wayan Firdaus Mahmudy,et al.  Modeling House Price Prediction using Regression Analysis and Particle Swarm Optimization Case Study : Malang, East Java, Indonesia , 2017 .

[44]  Ludovic Dos Santos,et al.  Multilabel Classification on Heterogeneous Graphs with Gaussian Embeddings , 2016, ECML/PKDD.

[45]  Estevam R. Hruschka,et al.  Lifelong Machine Learning and Computer Reading the Web , 2016, KDD.

[46]  Razvan Pascanu,et al.  Progressive Neural Networks , 2016, ArXiv.

[47]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[48]  Jae Kwon Bae,et al.  Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data , 2015, Expert Syst. Appl..

[49]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[50]  Xiaolong Liu,et al.  Spatial and Temporal Dependence in House Price Prediction , 2013 .

[51]  Andrew J. Patton,et al.  Daily House Price Indexes: Construction, Modeling, and Longer-Run Predictions , 2013 .

[52]  Philip S. Yu,et al.  Mining Knowledge from Interconnected Data: A Heterogeneous Information Network Analysis Approach , 2012, Proc. VLDB Endow..

[53]  Liv Osland,et al.  An Application of Spatial Econometrics in Relation to Hedonic House Price Modelling , 2010 .

[54]  Hasan Selim,et al.  Determinants of house prices in Turkey: Hedonic regression versus artificial neural network , 2009, Expert Syst. Appl..

[55]  Eva Cantoni,et al.  Spatial Dependence, Housing Submarkets, and House Price Prediction , 2007 .

[56]  Jorge Miguel Chica Olmo,et al.  Prediction of Housing Location Price By a Multivariate Spatial Method: Cokriging , 2007 .

[57]  V. Limsombunchai House Price Prediction: Hedonic Price Model vs. Artificial Neural Network , 2004 .

[58]  Simon Stevenson,et al.  New empirical evidence on heteroscedasticity in hedonic housing models , 2004 .

[59]  Okmyung Bin A prediction comparison of housing sales prices by parametric versus semi-parametric regressions , 2004 .

[60]  S. Basu,et al.  Analysis of Spatial Autocorrelation in House Prices , 1998 .

[61]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[62]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[63]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .