论文信息 - Learning Hidden Markov Models with Distributed State Representations for Domain Adaptation

Learning Hidden Markov Models with Distributed State Representations for Domain Adaptation

Recently, a variety of representation learning approaches have been developed in the literature to induce latent generalizable features across two domains. In this paper, we extend the standard hidden Markov models (HMMs) to learn distributed state representations to improve cross-domain prediction performance. We reformulate the HMMs by mapping each discrete hidden state to a distributed representation vector and employ an expectationmaximization algorithm to jointly learn distributed state representations and model parameters. We empirically investigate the proposed model on cross-domain part-ofspeech tagging and noun-phrase chunking tasks. The experimental results demonstrate the effectiveness of the distributed HMMs on facilitating domain adaptation.

Min Xiao | Yuhong Guo

[1] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[2] Alexander M. Rush,et al. Spectral Learning of Refinement HMMs , 2013, CoNLL.

[3] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[4] Herbert Jaeger,et al. Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.

[5] Fei Huang,et al. Exploring Representation-Learning Approaches to Domain Adaptation , 2010 .

[6] Xian Wu,et al. Domain Adaptation with Latent Semantic Association for Named Entity Recognition , 2009, NAACL.

[7] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8] Xavier Carreras,et al. Simple Semi-supervised Dependency Parsing , 2008, ACL.

[9] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[10] John Blitzer,et al. Frustratingly Hard Domain Adaptation for Dependency Parsing , 2007, EMNLP.

[11] Hal Daumé,et al. Frustratingly Easy Domain Adaptation , 2007, ACL.