论文信息 - Predicting Strong Associations on the Basis of Corpus Data

Predicting Strong Associations on the Basis of Corpus Data

Current approaches to the prediction of associations rely on just one type of information, generally taking the form of either word space models or collocation measures. At the moment, it is an open question how these approaches compare to one another. In this paper, we will investigate the performance of these two types of models and that of a new approach based on compounding. The best single predictor is the log-likelihood ratio, followed closely by the document-based word space model. We will show, however, that an ensemble method that combines these two best approaches with the compounding algorithm achieves an increase in performance of almost 30% over the current state of the art.

Yves Peirsman | Dirk Geeraerts

[1] Peter W. Foltz,et al. Latent semantic analysis for text-based research , 1996 .

[2] J. Aitchison. Words in the Mind: An Introduction to the Mental Lexicon , 1987 .

[3] Hinrich Schütze,et al. Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[4] J. Aitchison. Words in the mind , 1994 .

[5] Sabine Schulte im Walde,et al. Identifying Semantic Relations and Functional Properties of Human Verb Associations , 2005, HLT/EMNLP.

[6] J. Bullinaria,et al. Extracting semantic representations from word co-occurrence statistics: A computational study , 2007, Behavior research methods.

[7] Theodore Alexandrov,et al. Does Latent Semantic Analysis Reflect Human Associations ? , 2008 .

[8] Magnus Sahlgren,et al. The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces , 2006 .

[9] W. Lowe,et al. The Direct Route: Mediated Priming in Semantic Space , 2000 .

[10] T. Landauer,et al. A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[11] Gertjan van Noord,et al. At Last Parsing Is Now Operational , 2006, JEPTALNRECITAL.