Neural Chinese Address Parsing

This paper introduces a new task – Chinese address parsing – the task of mapping Chinese addresses into semantically meaningful chunks. While it is possible to model this problem using a conventional sequence labelling approach, our observation is that there exist complex dependencies between labels that cannot be readily captured by a simple linear-chain structure. We investigate neural structured prediction models with latent variables to capture such rich structural information within Chinese addresses. We create and publicly release a new dataset consisting of 15K Chinese addresses, and conduct extensive experiments on the dataset to investigate the model effectiveness and robustness. We release our code and data at http://statnlp.org/research/sp.

[1]  Weiguang Qu,et al.  Exploiting Chunk-level Features to Improve Phrase Chunking , 2012, EMNLP-CoNLL.

[2]  Jun Zhang,et al.  A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[3]  Maurizio Tesconi,et al.  GSP (Geo-Semantic-Parsing): Geoparsing and Geotagging with Machine Learning on Top of Linked Data , 2018, ESWC.

[4]  Noah A. Smith,et al.  Recurrent Neural Network Grammars , 2016, NAACL.

[5]  Kevin Duh,et al.  DyNet: The Dynamic Neural Network Toolkit , 2017, ArXiv.

[6]  Heng Li,et al.  Transductive HMM based Chinese text chunking , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[7]  Baobao Chang,et al.  Graph-based Dependency Parsing with Bidirectional LSTM , 2016, ACL.

[8]  Yue Zhang,et al.  Neural Word Segmentation with Rich Pretraining , 2017, ACL.

[9]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[10]  William W. Cohen,et al.  Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.

[11]  Tianshun Yao,et al.  Chinese Chunk Identification Using SVMs Plus Sigmoid , 2004, IJCNLP.

[12]  Huan Liu,et al.  Content-Aware Point of Interest Recommendation on Location-Based Social Networks , 2015, AAAI.

[13]  Yijia Liu,et al.  Exploring Segment Representations for Neural Segmentation Models , 2016, IJCAI.

[14]  Navdeep Jaitly,et al.  Pointer Networks , 2015, NIPS.

[15]  Xu Sun,et al.  A Discriminative Latent Variable Chinese Segmenter with Hybrid Word/Character Information , 2009, HLT-NAACL.

[16]  Sabine Buchholz,et al.  Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[17]  Stephen Clark,et al.  Chinese Segmentation with a Word-Based Perceptron Algorithm , 2007, ACL.

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Chilin Shih,et al.  A Stochastic Finite-State Word-Segmentation Algorithm for Chinese , 1994, ACL.

[20]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[21]  Noah A. Smith,et al.  Segmental Recurrent Neural Networks , 2015, ICLR.

[22]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[23]  Yue Zhang,et al.  Chinese NER Using Lattice LSTM , 2018, ACL.

[24]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[25]  Hai Zhao,et al.  Effective Tag Set Selection in Chinese Word Segmentation via Conditional Random Field Modeling , 2006, PACLIC.

[26]  Dan Klein,et al.  Constituency Parsing with a Self-Attentive Encoder , 2018, ACL.

[27]  Noah A. Smith,et al.  Transition-Based Dependency Parsing with Stack Long Short-Term Memory , 2015, ACL.

[28]  Weitong Chen,et al.  Learning Graph-based POI Embedding for Location-based Recommendation , 2016, CIKM.

[29]  Bowen Zhou,et al.  Neural Models for Sequence Chunking , 2017, AAAI.

[30]  Nianwen Xu,et al.  Chinese Word Segmentation as Character Tagging , 2003, Int. J. Comput. Linguistics Chin. Lang. Process..

[31]  Yue Zhang,et al.  Transition-Based Neural Word Segmentation , 2016, ACL.

[32]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[33]  Dan Klein,et al.  A Minimal Span-Based Neural Constituency Parser , 2017, ACL.

[34]  Hitoshi Isahara,et al.  An Empirical Study of Chinese Chunking , 2006, ACL.

[35]  Thorsten Joachims,et al.  Learning structural SVMs with latent variables , 2009, ICML '09.

[36]  Andrew McCallum,et al.  Chinese Segmentation and New Word Detection using Conditional Random Fields , 2004, COLING.

[37]  Tianshun Yao,et al.  Applying Conditional Random Fields to Chinese Shallow Parsing , 2005, CICLing.

[38]  Shih-Hung Wu,et al.  Applying Maximum Entropy to Robust Chinese Shallow Parsing , 2005, ROCLING/IJCLCLP.