论文信息 - Mixture-of-Parents Maximum Entropy Markov Models

Mixture-of-Parents Maximum Entropy Markov Models

We present the mixture-of-parents maximum entropy Markov model (MoP-MEMM), a class of directed graphical models extending MEMMs. The MoP-MEMM allows tractable incorporation of long-range dependencies between nodes by restricting the conditional distribution of each node to be a mixture of distributions given the parents. We show how to efficiently compute the exact marginal posterior node distributions, regardless of the range of the dependencies. This enables us to model non-sequential correlations present within text documents, as well as between interconnected documents, such as hyperlinked web pages. We apply the MoP-MEMM to a named entity recognition task and a web page classification task. In each, our model shows significant improvement over the basic MEMM, and is competitive with other long-range sequence models that use approximate inference.

Ben Taskar | Dan Klein | David S. Rosenberg

[1] Razvan C. Bunescu,et al. Collective Information Extraction with Relational Markov Networks , 2004, ACL.

[2] Ben Taskar,et al. Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[3] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[4] James R. Curran,et al. Language Independent NER using a Maximum Entropy Tagger , 2003, CoNLL.

[5] Rob Malouf,et al. Markov Models for Language-independent Named Entity Recognition , 2002, CoNLL.

[6] Avi Pfeffer,et al. Sufficiency, Separability and Temporal Probabilistic Models , 2001, UAI.

[7] D K Smith,et al. Numerical Optimization , 2001, J. Oper. Res. Soc..

[8] Tom M. Mitchell,et al. Discovering Test Set Regularities in Relational Domains , 2000, ICML.

[9] Piotr Indyk,et al. Enhanced hypertext categorization using hyperlinks , 1998, SIGMOD '98.

[10] Christopher D. Manning,et al. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[11] Ben Taskar,et al. Probabilistic Models of Text and Link Structure for Hypertext Classification , 2001 .