Dialogue focus tracking for zero pronoun resolution

We take a novel approach to zero pronoun resolution in Chinese: our model explicitly tracks the flow of focus in a discourse. Our approach, which generalizes to deictic references, is not reliant on the presence of overt noun phrase antecedents to resolve to, and allows us to address the large percentage of “non-anaphoric” pronouns filtered out in other approaches. We furthermore train our model using readily available parallel Chinese/English corpora, allowing for training without hand-annotated data. Our results demonstrate improvements on two test sets, as well as the usefulness of linguistically motivated features.

[1]  Fang Kong,et al.  A Tree Kernel-Based Unified Framework for Chinese Zero Anaphora Resolution , 2010, EMNLP.

[2]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[3]  John Langford,et al.  Efficient programmable learning to search , 2014, ArXiv.

[4]  Chen Chen,et al.  Chinese Zero Pronoun Resolution: Some Recent Advances , 2013, EMNLP.

[5]  Soo-Ok Kweon Processing Null and Overt Pronoun Subject in Ambiguous Sentences in Korean , 2011 .

[6]  John Langford,et al.  Search-based structured prediction , 2009, Machine Learning.

[7]  Marine Carpuat,et al.  Improving Statistical Machine Translation Using Word Sense Disambiguation , 2007, EMNLP.

[8]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[9]  Richard Cameron,et al.  Pronominal and null subject variation in Spanish : constraints, dialects, and functional compensation , 1992 .

[10]  Chen Chen,et al.  Chinese Zero Pronoun Resolution: An Unsupervised Probabilistic Model Rivaling Supervised Resolvers , 2014, EMNLP.

[11]  Yoav Goldberg,et al.  Language-Independent Parsing with Empty Elements , 2011, ACL.

[12]  Maria Nella Carminati,et al.  The processing of Italian subject pronouns , 2002 .

[13]  David Gil,et al.  The World Atlas of Language Structures , 2005 .

[14]  Mitchell P. Marcus,et al.  OntoNotes: The 90% Solution , 2006, NAACL.

[15]  Hwee Tou Ng,et al.  Identification and Resolution of Chinese Zero Pronouns: A Machine Learning Approach , 2007, EMNLP.

[16]  Yi-Chun Chen,et al.  Zero Anaphora Resolution in Chinese with Shallow Parsing , 2007, J. Chin. Lang. Comput..

[17]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[18]  Michael C. Frank,et al.  Markers of Topical Discourse in Child-Directed Speech , 2014, Cogn. Sci..

[19]  Lyn Frazier,et al.  Null vs. overt pronouns and the Topic-Focus articulation in Spanish: 2704 , 2002 .