论文信息 - Methods for matching English language addresses

Methods for matching English language addresses

Addresses occupy a niche location within the landscape of textual data, due to the positional importance carried by every word, and the geographic scope it refers to. The task of matching addresses happens every day and is present in various fields such as mail redirection, entity resolution, etc. Our work defines, and formalizes a framework to generate matching and mismatching pairs of addresses in the English language, and use it to evaluate various methods to automatically perform address matching. These methods vary widely from distance‐based approaches to deep learning models. By studying the Precision, Recall, and Accuracy metrics of these approaches, we obtain an understanding of the best suited method for this setting of the address matching task.

D. Borrajo | Keshav Ramani

[1] Xiangrong She,et al. Deep Contrast Learning Approach for Address Semantic Matching , 2021, Applied Sciences.

[2] Qingyun Du,et al. A deep learning architecture for semantic address matching , 2019, Int. J. Geogr. Inf. Sci..

[3] Daniel Arribas-Bel,et al. Machine learning innovations in address matching: A practical comparison of word2vec and CRFs , 2019, Trans. GIS.

[4] Jianxiong Dong,et al. Enhance word representation for out-of-vocabulary on Ubuntu dialogue corpus , 2018, ArXiv.

[5] Patricia Murrieta-Flores,et al. Toponym matching through deep neural networks , 2018, Int. J. Geogr. Inf. Sci..

[6] Sepp Hochreiter,et al. Self-Normalizing Neural Networks , 2017, NIPS.

[7] Zhen-Hua Ling,et al. Enhanced LSTM for Natural Language Inference , 2016, ACL.

[8] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[9] J. Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[10] Tong Zhang. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods , 2001, AI Mag..

[11] B. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[12] Michael McGill,et al. Introduction to Modern Information Retrieval , 1983 .

[13] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.