Inducing Ontological Co-occurrence Vectors

In this paper, we present an unsupervised methodology for propagating lexical cooccurrence vectors into an ontology such as WordNet. We evaluate the framework on the task of automatically attaching new concepts into the ontology. Experimental results show 73.9% attachment accuracy in the first position and 81.3% accuracy in the top-5 positions. This framework could potentially serve as a foundation for ontologizing lexical-semantic resources and assist the development of other largescale and internally consistent collections of semantic information.

[1]  Kevin Knight,et al.  Building a Large-Scale Knowledge Base for Machine Translation , 1994, AAAI.

[2]  Patrick Pantel,et al.  Discovering word senses from text , 2002, KDD.

[3]  J. Katz,et al.  The philosophy of linguistics , 1989 .

[4]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[5]  Mitchell P. Marcus,et al.  Adding Semantic Annotation to the Penn TreeBank , 1998 .

[6]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[7]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[8]  Ralph Grishman,et al.  Annotating Noun Argument Structure for NomBank , 2004, LREC.

[9]  Kentaro Torisawa,et al.  Acquiring Hyponymy Relations from Web Documents , 2004, NAACL.

[10]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[11]  Dekang Lin,et al.  PRINCIPAR - An Efficient, Broad-coverage, Principle-based Parser , 1994, COLING.

[12]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[13]  Patrick Pantel,et al.  VerbOcean: Mining the Web for Fine-Grained Semantic Verb Relations , 2004, EMNLP.

[14]  Olatz Ansa,et al.  Enriching WordNet concepts with topic signatures , 2001, ArXiv.

[15]  Mark A. Musen,et al.  An Algorithm for Merging and Aligning Ontologies: Automation and Tool Support , 1999 .

[16]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[17]  Jonathan Ginzburg,et al.  Proceedings of COLING 2004 , 2004 .

[18]  Eduard Hovy,et al.  Combining and standardizing large- scale, practical ontologies for machine tranlation and other uses , 1998, LREC.

[19]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[20]  Dan I. Moldovan,et al.  Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations , 2003, NAACL.

[21]  Ellen Riloff,et al.  A Corpus-Based Approach for Building Semantic Lexicons , 1997, EMNLP.

[22]  George A. Miller,et al.  WordNet 2 - A Morphologically and Semantically Enhanced Resource , 1999 .

[23]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..