The Hinoki syntactic and semantic treebank of Japanese

In this paper we describe the current state of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definitions, examples and news text, and uses an HPSG based Japanese grammar to encode both syntactic and semantic information. It is combined with an ontology based on the definition sentences to give a detailed sense level description of the most familiar 28,000 words of Japanese.

[1]  Makoto Nagao,et al.  Building A Japanese Parsed Corpus , 2003 .

[2]  Francis Bond,et al.  The Hinoki Sensebank — A Large-Scale Word Sense Tagged Corpus of Japanese — , 2006 .

[3]  Eric Nichols,et al.  Robust Ontology Acquisition from Machine-Readable Dictionaries , 2005, IJCAI.

[4]  Stephan Oepen,et al.  Stochastic HPSG Parse Disambiguation using the Redwoods Corpus , 2005 .

[5]  Wolfgang Wahlster,et al.  Verbmobil: Foundations of Speech-to-Speech Translation , 2000, Artificial Intelligence.

[6]  Eric Nichols,et al.  Acquiring an Ontology for a Fundamental Vocabulary , 2004, COLING.

[7]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[8]  Makoto Nagao,et al.  Building a Japanese parsed corpus while improving the parsing system , 1997 .

[9]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[10]  Stephan Oepen,et al.  Stochastic HPSG Parse Selection using the Redwoods Corpus , 2005 .

[11]  Mitchell P. Marcus,et al.  OntoNotes: The 90% Solution , 2006, NAACL.

[12]  Melanie Siegel,et al.  HPSG Analysis of Japanese , 2000 .

[13]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[14]  Timothy Baldwin,et al.  Word Sense Disambiguation Incorporating Lexical and Structural Semantic Information , 2007, EMNLP.

[15]  Eric Nichols,et al.  The Hinoki Treebank A Treebank for Text Understanding , 2004, IJCNLP.

[16]  Stephan Oepen,et al.  High Precision Treebanking—Blazing Useful Trees Using POS Information , 2005, ACL.

[17]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[18]  Françoise BOLÍVAR,et al.  WORD SENSE DISAMBIGUATION , 2008 .

[19]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[20]  Christopher D. Manning,et al.  LinGO Redwoods A Rich and Dynamic Treebank for HPSG , 2002 .

[21]  Mark Stevenson,et al.  Word sense disambiguation , 2002 .

[22]  Ulrich Callmeier,et al.  PET – a platform for experimentation with efficient HPSG processing techniques , 2000, Natural Language Engineering.

[23]  Kiyoaki Shirai Construction of a Word Sense Tagged Corpus for SENSEVAL-2 Japanese Dictionary Task , 2002, LREC.

[24]  Stephan Oepen,et al.  Exploiting Semantic Information for HPSG Parse Selection , 2007, ACL 2007.

[25]  Stephan Oepen,et al.  LinGO Redwoods , 2004 .

[26]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.