Abstract ID: A229This paper presents lexical attraction models of lan-guage, in which the only explicitly represented linguis-tic knowledge is the likelihood of pairwise relations be-tween words. This is in contrast with models that rep-resent linguistic knowledge in terms of a lexicon, whichassigns categories to each word, and a grammar, whichexpresses possible combinations in terms of these cat-egories. The word-based nature and the simplicity oflexical attraction models make them good candidatesfor experiments in language learning. I introduce anunsupervised learning algorithm that uses lexical at-traction and gives accuracy results comparable to su-pervised learning. Content Areas: Natural Language Processing #Techniques or Algorithms # statistical or corpus basedmethods, Machine Learning and Discovery # Tasks orProblems # unsupervised learning Introduction The information in a sentence is contained partly inits words and partly in the relationships between thewords. The main task of natural language processing isto identify the relationships between the words. Whendeciding whether two words are related we typically usetwo types of information. First, there are grammaticalconstraints, e.g. each determiner must modify a nounor transitive verbs must take objects. However, gram-matical constraints are typically not restrictive enoughto uniquely identify the correct relations. Second, thereare selectional restrictions, e.g. given the verb eat, cakewouldbe a morelikely objectthan train. Lexicalattrac-tion models encode selectional restrictions. Ordinarily,one would need both types of information to identifythe correct structure of a sentence. I will show how farwe can go using lexical attraction alone.The next section presents examples that demonstratewhen grammatical constraints are not sufficient andlexical attraction is. The third section formalizes themodel and expresses the meaning of my opening sen-tence in the language of information theory. The nextsection gives some basic results on dependency struc-tures. The section on unsupervised learning gives thelearning algorithm and evaluation results. The pa-per ends with a discussion of related work and futureprospects.
[1]
Daniel Dominic Sleator,et al.
Parsing English with a Link Grammar
,
1995,
IWPT.
[2]
Ann Bies,et al.
The Penn Treebank: Annotating Predicate Argument Structure
,
1994,
HLT.
[3]
Deniz Yuret,et al.
Discovery of linguistic relations using lexical attraction
,
1998,
ArXiv.
[4]
John Lafferty,et al.
A Model of Lexical Attraction and Repulsion
,
1997,
Annual Meeting of the Association for Computational Linguistics.
[5]
Igor Mel’čuk,et al.
Dependency Syntax: Theory and Practice
,
1987
.
[6]
Mark A. Paskin,et al.
Grammatical Bigrams
,
2001,
NIPS.
[7]
Judea Pearl,et al.
Probabilistic reasoning in intelligent systems - networks of plausible inference
,
1991,
Morgan Kaufmann series in representation and reasoning.
[8]
A. Hasman,et al.
Probabilistic reasoning in intelligent systems: Networks of plausible inference
,
1991
.
[9]
John Lafferty,et al.
Grammatical Trigrams: A Probabilistic Model of Link Grammar
,
1992
.
[10]
John D. Lafferty,et al.
Text Segmentation Using Exponential Models
,
1997,
EMNLP.
[11]
Frederick Jelinek,et al.
Exploiting Syntactic Structure for Language Modeling
,
1998,
ACL.
[12]
Ronald L. Graham,et al.
Concrete mathematics - a foundation for computer science (2. ed.)
,
1994
.
[13]
Haim Gaifman,et al.
Dependency Systems and Phrase-Structure Systems
,
1965,
Inf. Control..
[14]
Richard Hudson,et al.
English word grammar
,
1995
.
[15]
Sang Joon Kim,et al.
A Mathematical Theory of Communication
,
2006
.
[16]
Jason Eisner,et al.
Three New Probabilistic Models for Dependency Parsing: An Exploration
,
1996,
COLING.
[17]
Thomas M. Cover,et al.
Elements of Information Theory
,
2005
.
[18]
Frank Harary,et al.
Graph Theory
,
2016
.
[19]
菅山 謙正,et al.
Word Grammar 理論の研究
,
2005
.
[20]
Noam Chomsky,et al.
वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax
,
1965
.
[21]
Dan Klein,et al.
Corpus-Based Induction of Syntactic Structure: Models of Dependency and Constituency
,
2004,
ACL.