Geographical address representation learning for address matching

Address matching is a crucial task in various location-based businesses like take-out services and express delivery, which aims at identifying addresses referring to the same location in address databases. It is a challenging one due to various possible ways to express the address of a location, especially in Chinese. Traditional address matching approaches relying on string similarities and learning matching rules to identify addresses referring to the same location, could hardly solve the cases with redundant, incomplete or unusual expression of addresses. In this paper, to learn the geographical semantic representations for address strings, we novelly propose to get rich contexts for addresses from the Web through Web search engines, which could strongly enrich the semantic meaning of addresses that could be learned. Apart from that, we propose a two-stage geographical address representation learning model for address matching. In the first stage, we propose to use an encode-decoder architecture to learn the semantic vector representation for each address string where an up-sampling and sub-sampling strategy is applied to solve the problem of address redundancy and incompleteness. The attention mechanism is also applied to the model to highlight important features of addresses in their semantic representations. And in the second stage, we construct a single large graph from the corpus, which contains address elements and addresses as nodes, and the edges between nodes are built by word co-occurrence information to learn embedding representations for all the nodes on the graph. Our empirical study conducted on two real-world address datasets demonstrates that our approach greatly improves both precision (up to 8%) and recall (up to 12%) of the state-of-the-art existing methods.

[1]  Joan Bruna,et al.  Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[2]  Yang Wang,et al.  大数据环境下基于贝叶斯推理的中文地名地址匹配方法 (Chinese Place-name Address Matching Method Based on Large Data Analysis and Bayesian Decision) , 2017, 计算机科学.

[3]  Sanja Fidler,et al.  Skip-Thought Vectors , 2015, NIPS.

[4]  Khalil Sima'an,et al.  Graph Convolutional Encoders for Syntax-aware Neural Machine Translation , 2017, EMNLP.

[5]  Xiao-Ming Wu,et al.  Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning , 2018, AAAI.

[6]  Wang Yong,et al.  The Standardization Method of Address Information for POIs from Internet Based on Positional Relation , 2016 .

[7]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[10]  Shen Li,et al.  Revisiting Correlations between Intrinsic and Extrinsic Evaluations of Word Embeddings , 2018, CCL.

[11]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[12]  William J. Drummond,et al.  Address Matching: GIS Technology for Mapping Human Activity Patterns , 1995 .

[13]  L. Venkata Subramaniam,et al.  Transfer of Supervision for Improved Address Standardization , 2010, 2010 20th International Conference on Pattern Recognition.

[14]  Jürgen Schmidhuber,et al.  LSTM can Solve Hard Long Time Lag Problems , 1996, NIPS.

[15]  Xiaofeng Zhu,et al.  One-Step Multi-View Spectral Clustering , 2019, IEEE Transactions on Knowledge and Data Engineering.

[16]  Yu Bin,et al.  A Rule-Based Segmenting and Matching Method for Fuzzy Chinese Addresses , 2011 .

[17]  Xuelong Li,et al.  Graph PCA Hashing for Similarity Search , 2017, IEEE Transactions on Multimedia.

[18]  Zhong Su,et al.  Address standardization with latent semantic association , 2009, KDD.

[19]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[20]  Shichao Zhang,et al.  Spectral clustering via half-quadratic optimization , 2019, World Wide Web.

[21]  Shengrui Wang,et al.  Approximate Address Matching , 2010, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[22]  Joan Bruna,et al.  Deep Convolutional Networks on Graph-Structured Data , 2015, ArXiv.

[23]  Shikhar Sharma,et al.  Automated Parsing of Geographical Addresses: A Multilayer Feedforward Neural Network Based Approach , 2018, 2018 IEEE 12th International Conference on Semantic Computing (ICSC).

[24]  Shichao Zhang,et al.  Low-Rank Sparse Subspace for Spectral Clustering , 2019, IEEE Transactions on Knowledge and Data Engineering.

[25]  Qingyun Du,et al.  Using an Optimized Chinese Address Matching Method to Develop a Geocoding Service: A Case Study of Shenzhen, China , 2016, ISPRS Int. J. Geo Inf..

[26]  Eric P. Xing,et al.  Entity Hierarchy Embedding , 2015, ACL.

[27]  Li Jing Improvement on reverse directional maximum matching method based on hash structure for Chinese word segmentation , 2008 .

[28]  Du Qingyun,et al.  A New Method of Chinese Address Extraction Based on Address Tree Model , 2015 .

[29]  Chong Wang,et al.  Attention-based Graph Neural Network for Semi-supervised Learning , 2018, ArXiv.

[30]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.