Zero-Anaphora Resolution in Chinese Using Maximum Entropy

In this paper, we propose a learning classifier based on maximum entropy (ME) for resolving zero-anaphora in Chinese text. Besides regular grammatical, lexical, positional and semantic features motivated by previous research on anaphora resolution, we develop two innovative Web-based features for extracting additional semantic information from the Web. The values of the two features can be obtained easily by querying the Web using some patterns. Our study shows that our machine learning approach is able to achieve an accuracy comparable to that of state-of-the-art systems. The Web as a knowledge source can be incorporated effectively into the ME learning framework and significantly improves the performance of our approach.

[1]  Tsutomu Hirao,et al.  Japanese Zero Pronoun Resolution based on Ranking Rules and Machine Learning , 2003, EMNLP.

[2]  Claire Gardent,et al.  Improving Machine Learning Approaches to Coreference Resolution , 2002, ACL.

[3]  Noah A. Smith,et al.  The Web as a Parallel Corpus , 2003, CL.

[4]  J. Darroch,et al.  Generalized Iterative Scaling for Log-Linear Models , 1972 .

[5]  Erhard W. Hinrichs,et al.  A data-driven approach to pronominal anaphora resolution for German , 2007 .

[6]  John Hale,et al.  A Statistical Approach to Anaphora Resolution , 1998, VLC@COLING/ACL.

[7]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Andreas Stolcke,et al.  Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures , 2003, NAACL.

[9]  Na-Rae Han,et al.  Detecting Errors in English Article Usage with a Maximum Entropy Classifier Trained on a Large, Diverse Corpus , 2004, LREC.

[10]  Miles Osborne,et al.  Estimation of Stochastic Attribute-Value Grammars using an Informative Sample , 2000, COLING.

[11]  Yi-Chun Chen,et al.  Zero Anaphora Resolution in Chinese with Shallow Parsing , 2007, J. Chin. Lang. Comput..

[12]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[13]  Mitchell P. Marcus,et al.  Maximum entropy models for natural language ambiguity resolution , 1998 .

[14]  Zhou Chang-le Study on Meta-Anaphoric Resolution in Chinese Discourse Understanding , 2002 .

[15]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[16]  Kazuhiro Seki,et al.  A Probabilistic Method for Analyzing Japanese Anaphora Integrating Zero Pronoun Detection and Resolution , 2002, COLING.