Probabilistic Coordination Disambiguation in a Fully-Lexicalized Japanese Parser

This paper describes a probabilistic model for coordination disambiguation integrated into syntactic and case structure analysis. Our model probabilistically assesses the parallelism of a candidate coordinate structure using syntactic/semantic similarities and cooccurrence statistics. We integrate these probabilities into the framework of fully-lexicalized parsing based on largescale case frames. This approach simultaneously addresses two tasks of coordination disambiguation: the detection of coordinate conjunctions and the scope disambiguation of coordinate structures. Experimental results on web sentences indicate the effectiveness of our approach.

[1]  Rajeev Agarwal,et al.  A Simple but Useful Approach to Conjunct Identification , 1992, ACL.

[2]  Daisuke Kawahara,et al.  A Fully-Lexicalized Probabilistic Model for Japanese Syntactic and Case Structure Analysis , 2006, HLT-NAACL.

[3]  Daisuke Kawahara,et al.  A Fully-Lexicalized Probabilistic Model for Japanese Syntactic and Case Structure Analysis (Special Issue : "Collection of Best Annual Papers" Organized for the 20th Anniversary of the Association for Natural Language Processing) , 2006 .

[4]  Manabu Sassano,et al.  Linear-Time Dependency Analysis for Japanese , 2004, COLING.

[5]  Miriam Goldberg,et al.  An Unsupervised Model for Statistically Determining Coordinate Phrase Attachment , 1999, ACL.

[6]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[7]  Makoto Nagao,et al.  Building a Japanese parsed corpus while improving the parsing system , 1997 .

[8]  Frank Keller,et al.  Integrating Syntactic Priming into an Incremental Probabilistic Parser, with an Application to Psycholinguistic Modeling , 2006, ACL.

[9]  Kikuo Maekawa Kotonoha , the Corpus Development Project of the National Institute for Japanese Language , 2006 .

[10]  Sadao Kurohashi Analyzing Coordinate Structures Including Punctuation in English , 1995, IWPT.

[11]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[12]  Yuji Matsumoto,et al.  Japanese Dependency Analysis using Cascaded Chunking , 2002, CoNLL.

[13]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[14]  A. Kilgarriff,et al.  Disambiguating coordinations using word distribution information , 2005 .

[15]  Daisuke Kawahara,et al.  Case Frame Compilation from the Web using High-Performance Computing , 2006, LREC.

[16]  Makoto Nagao,et al.  A Syntactic Analysis Method of Long Japanese Sentences Based on the Detection of Conjunctive Structures , 1994, CL.