OntoNotes: The 90% Solution

We describe the OntoNotes methodology and its result, a large multilingual richly-annotated corpus constructed at 90% interannotator agreement. An initial portion (300K words of English newswire and 250K words of Chinese newswire) will be made available to the community during 2007.

[1]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[2]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[3]  Adam Pease,et al.  Towards a standard upper ontology , 2001, FOIS.

[4]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[5]  Olga Babko-Malaya,et al.  Different Sense Granularities for Different Applications , 2004, HLT-NAACL 2004.

[6]  Nizar Habash,et al.  Interlingual Annotation for MT Development , 2004, AMTA.

[7]  Ralph Grishman,et al.  The NomBank Project: An Interim Report , 2004, FCP@NAACL-HLT.

[8]  Seth Kulick,et al.  Proposition Bank II: Delving Deeper , 2004, FCP@NAACL-HLT.

[9]  Daniel Jurafsky,et al.  Semantic Role Labeling Using Different Syntactic Views , 2005, ACL.

[10]  Martha Palmer,et al.  Towards Robust High Performance Word Sense Disambiguation of English Verbs Using Rich Linguistic Features , 2005, IJCNLP.

[11]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[12]  M. A. R T H A P A L,et al.  Making fine-grained and coarse-grained sense distinctions , both manually and automatically , 2005 .

[13]  Patrick Pantel,et al.  The Omega Ontology , 2005, IJCNLP.

[14]  Seth Kulick,et al.  Fully Parsing the Penn Treebank , 2006, NAACL.

[15]  Christiane Fellbaum,et al.  Making fine-grained and coarse-grained sense distinctions, both manually and automatically , 2006, Natural Language Engineering.

[16]  B. Hladká,et al.  The Prague Dependency Treebank: Annotation Structure and Support , 2022 .