Implementing a Semantic Lexicon

The human lexicon is emerging as the single most important aspect of modern NLP applications. Current research focus ranges from the organization and structure of lexicons to fleshing it out with lexical entries with their full semantics. We have implemented a model of the human lexicon that addresses the automatic acquisition of the lexicon as well as the representation of the semantic content of individual lexical entries. We have used Conceptual Structures as the representation scheme. The system analyzes paragraphs of English sentences, maps the extracted lexical and semantic knowledge to CS graphs, and reasons about them. The system augments its vocabulary by using a persistency mechanism that restores the previously defined lexical items in later sessions.

[1]  H. Alshawi,et al.  Analysing the dictionary definitions , 1989 .

[2]  Paul Procter,et al.  Longman Dictionary of Contemporary English , 1978 .

[3]  Ralph Grishman,et al.  Comlex Syntax: Building a Computational Lexicon , 1994, COLING.

[4]  Ralph Grishman,et al.  Developing multiple tagged corpora for lexical research , 1994 .

[5]  Ramanathan V. Guha,et al.  Cyc: toward programs with common sense , 1990, CACM.

[6]  Nicoletta Calzolari,et al.  Methods and Tools for Lexical Acquisition , 1990, EAIA.

[7]  Michael P. Barnett Computer programming in English , 1969 .

[8]  Mark Lauer How much is enough?: Data requirements for statistical NLP , 1995, ArXiv.

[9]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[10]  Lucy Vanderwende,et al.  Automatically Identifying Morphological Relations in = Machine-Readable Dictionaries , 1994, ArXiv.

[11]  Mark Lauer Conserving Fuel in Statistical Language Learning: Predicting Data Requirements , 1995, ArXiv.

[12]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[13]  Jaime G. Carbonell,et al.  A tutorial on natural-language processing , 1981, ACM '81.

[14]  Ralph Grishman,et al.  The Comlex Syntax Project: The First Year , 1994, HLT.

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  William B. Dolan,et al.  Word Sense Ambiguation: Clustering Related Senses , 1994, COLING.

[17]  David R. Barstow A Knowledge-Based System for Automatic Program Construction , 1977, IJCAI.

[18]  Gilles Sérasset lnterlinguai Lexical Organisation for Multilingual Lexical Databases in NADIA , 1994, COLING.

[19]  Ivan A. Sag,et al.  Information-based syntax and semantics , 1987 .

[20]  Lucy Vanderwende Ambiguity in the Acquisition of Lexical Information , 1995, ArXiv.

[21]  Ralph Grishman,et al.  Creating a common syntactic dictionary of English , 1994 .

[22]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[23]  Antonio Sanfilippo LKB encoding of lexical knowledge , 1994 .

[24]  Ramanathan V. Guha,et al.  Enabling agents to work together , 1994, CACM.

[25]  Hsin-Hsi Chen,et al.  Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and its Automatic Evaluation , 1994, ACL.

[26]  George E. Heidorn,et al.  English as a very high level language for simulation programming , 1974, SIGPLAN Symposium on Very High Level Languages.

[27]  Daniel G. Bobrow,et al.  Natural Language Input for a Computer Problem Solving System , 1964 .

[28]  Alan W. Biermann,et al.  Approaches to Automatic Programming , 1976, Adv. Comput..

[29]  Walter Daelemans,et al.  Memory-based lexical acquisition and processing , 1993, EAMT.

[30]  C. Cordell Green The design of the PSI program synthesis system , 1976, ICSE '76.

[31]  Julian Kupiec,et al.  Augmenting a Hidden Markov Model for Phrase-Dependent Word Tagging , 1989, HLT.

[32]  George E. Heidorn Automatic Programming Through Natural Language Dialogue: A Survey , 1976, IBM J. Res. Dev..

[33]  James R. Slagle,et al.  A System that Translates Conceptual Structures into English , 1992, Workshop on Conceptual Graphs.

[34]  Ramanathan V. Guha,et al.  Building large knowledge-based systems , 1989 .

[35]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[36]  Martin Kay,et al.  Parsing in functional unification grammar , 1986 .

[37]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[38]  Ramanathan V. Guha,et al.  CYC: A Midterm Report , 1990, AI Mag..

[39]  Lucy Vanderwende,et al.  Automatically Deriving Structured Knowledge Bases From On-Line Dictionaries , 1993 .

[40]  Inderjeet Mani,et al.  Knowledge and natural language processing , 1990, CACM.

[41]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[42]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.