The three corpus-based statistical sense resolution methods studied here attempt to infer the correct sense of a polysemous word by using knowledge about patterns of word cooccurrences. The techniques were based on Bayesian decision theory, neural, networks, and content vectors as used in information retrieval. To understand these methods better, we posed a very specific problem: given a set of contexts, each containing the noun line in a known sense, construct a classifier that selects the correct sense of line for new contexts. To see how the degree of polysemy affects performance, results from three- and six-sense tasks are compared.The results demonstrate that each of the techniques is able to distinguish six senses of line with an accuracy greater than 70%. Furthermore, the response patterns of the classifiers are, for the most part, statistically indistinguishable from one another. Comparison of the two tasks suggests that the degree of difficulty involved in resolving individual senses is a greater performance factor than the degree of polysemy.
[1]
Geoffrey E. Hinton,et al.
Learning internal representations by error propagation
,
1986
.
[2]
George A. Miller,et al.
Introduction to WordNet: An On-line Lexical Database
,
1990
.
[3]
Gerard Salton,et al.
A vector space model for automatic indexing
,
1975,
CACM.
[4]
Yaacov Choueka,et al.
Disambiguation by short contexts
,
1985,
Comput. Humanit..
[5]
F. Mosteller,et al.
Inference and Disputed Authorship: The Federalist
,
1966
.
[6]
G. Zipf.
The meaning-frequency relationship of words.
,
1945,
The Journal of general psychology.
[7]
David Yarowsky,et al.
Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora
,
2010,
COLING.
[8]
Ellen M. Voorhees,et al.
Learning context to disambiguate word senses
,
1992
.
[9]
David Yarowsky,et al.
A method for disambiguating word senses in a large corpus
,
1992,
Comput. Humanit..