The ontogram-approach to text processing and semantic relation spotting for indexing

This paper describes the OntoGram-approach to indexing texts by their conceptual content using ontologies along with syntactic grammars and lexico-syntactic information and semantic role assignment provided by lexical resources. The conceptual content of meaningful chunks of text is transformed into concept feature structures and mapped into concepts in a generative ontology. By this approach, synonymous but linguistically quite distinct expressions are mapped to the same concept in the ontology. This allows us to perform a content-based search which will retrieve relevant documents independently of the linguistic form of the query as well as the documents.

[1]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[2]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[3]  Dan I. Moldovan,et al.  On the semantics of noun compounds , 2005, Comput. Speech Lang..

[4]  Preslav Nakov,et al.  Classification of semantic relations between nominals , 2009, Lang. Resour. Evaluation.

[5]  Paola Velardi,et al.  Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites , 2004, CL.

[6]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[7]  Steffen Staab,et al.  Learning Ontologies for the Semantic Web , 2001 .

[8]  Tine Lassen,et al.  Uncovering Prepositional Senses , 2010 .

[9]  Jørgen Fischer Nilsson,et al.  ONTOGRABBING: Extracting Information from Texts Using Generative Ontologies , 2009, FQAS.

[10]  David Sánchez,et al.  Pattern-based automatic taxonomy learning from the Web , 2008, AI Commun..

[11]  Ion Muslea,et al.  Extraction Patterns for Information Extraction Tasks: A Survey , 1999 .

[12]  Timothy Baldwin,et al.  Automatic Interpretation of Noun Compounds Using WordNet Similarity , 2005, IJCNLP.

[13]  R. Girju,et al.  A knowledge-rich approach to identifying semantic relations between nominals , 2010, Inf. Process. Manag..

[14]  Tony Veale,et al.  Using WordNet to Automatically Deduce Relations between Words in Noun-Noun Compounds , 2006, ACL.

[15]  Martha Palmer,et al.  Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[16]  Irene Vogel,et al.  Cross-disciplinary issues in compounding , 2010 .

[17]  Jørgen Fischer Nilsson A logico-algebraic framework for ontologies , 2001 .

[18]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[19]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[20]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[21]  Frank van Harmelen,et al.  Extraction and use of linguistic patterns for modelling medical guidelines , 2007, Artif. Intell. Medicine.

[22]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[23]  Ellen Riloff,et al.  Exploiting Role-Identifying Nouns and Expressions for Information Extraction , 2007 .

[24]  Troels Andreasen,et al.  Grammatical specification of domain ontologies , 2004, Data Knowl. Eng..