Automatic Extraction of Semantic Relationships for WordNet by Means of Pattern Learning from Wikipedia

This paper describes an automatic approach to identify lexical patterns which represent semantic relationships between concepts, from an on-line encyclopedia. Next, these patterns can be applied to extend existing ontologies or semantic networks with new relations. The experiments have been performed with the Simple English Wikipedia and WordNet 1.7. A new algorithm has been devised for automatically generalising the lexical patterns found in the encyclopedia entries. We have found general patterns for the hyperonymy, hyponymy, holonymy and meronymy relations and, using them, we have extracted more than 1200 new relationships that did not appear in WordNet originally. The precision of these relationships ranges between 0.61 and 0.69, depending on the relation.

[1]  German Rigau Automatic Acquisition of Lexical Knowl-edge from MRDs , 1998 .

[2]  Marco De Boni,et al.  Automated Discovery of Telic Relations for WordNet , 2002 .

[3]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[4]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[5]  Michel C. A. Klein,et al.  The semantic web: yet another hip? , 2002, Data Knowl. Eng..

[6]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[7]  Steffen Staab,et al.  Word classification based on combined measures of distributional and semantic similarity , 2003, EACL.

[8]  James A. Hendler,et al.  A new form of Web content that is meaningful to computers will unleash a revolution of new possibili , 2002 .

[9]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[10]  Barry Smith,et al.  GOL: toward an axiomatized upper-level ontology , 2001, FOIS.

[11]  Thomas R. Gruber,et al.  A Translation Approach to Portable Ontologies , 1993 .

[12]  Suresh Manandhar,et al.  Extending a Lexical Ontology by a Combination of Distributional Semantics Signatures , 2002, EKAW.

[13]  Adrian Novischi Accurate Semantic Annotations via Pattern Matching , 2002, FLAIRS Conference.

[14]  Steffen Staab,et al.  Clustering Concept Hierarchies from Text , 2004, LREC.

[15]  G Stix,et al.  The mice that warred. , 2001, Scientific American.

[16]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[17]  Christiane Fellbaum,et al.  Knowledge Processing On An Extended Wordnet , 1998 .

[18]  Steffen Staab,et al.  Discovering Conceptual Relations from Text , 2000, ECAI.

[19]  Raphael Volz,et al.  Semi-automatic Ontology Acquisition from a Corporate Intranet , 2000 .

[20]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[21]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[22]  Paola Velardi,et al.  Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites , 2004, CL.

[23]  Udo Hahn,et al.  Towards Text Knowledge Engineering , 1998, AAAI/IAAI.

[24]  Yorick Wilks,et al.  Providing machine tractable dictionary tools , 1990, Machine Translation.

[25]  Maria Ruiz-Casado,et al.  Automatic Assignment of Wikipedia Encyclopedic Entries to WordNet Synsets , 2005, AWIC.

[26]  Lucy Vanderwende,et al.  MindNet: Acquiring and Structuring Semantic Information from Text , 1998, COLING-ACL.

[27]  David Faure,et al.  A corpus-based conceptual clustering method for verb frames and ontology , 1998 .

[28]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[29]  Peter M. Hastings Automatic acquisition of word meaning from context , 1994 .

[30]  Emmanuel Morin,et al.  Extracting Semantic Relationships between Terms: Supervised vs. Unsupervised Methods , 1999 .

[31]  Lillian Lee,et al.  Similarity-Based Approaches to Natural Language Processing , 1997, ArXiv.

[32]  Marti A. Hearst Automated Discovery of WordNet Relations , 2004 .

[33]  Sanda M. Harabagiu,et al.  Knowledge processing on an extended wordnet , 1998 .

[34]  Suresh Manandhar,et al.  Improving an Ontology Refinement Method with Hyponymy Patterns , 2002, LREC.

[35]  George A. Miller,et al.  WordNet 2 - A Morphologically and Semantically Enhanced Resource , 1999 .