Corpora and collocations

3 Cooccurrence and frequency counts 11 12 3.1 Surface cooccurrence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 13 3.2 Textual cooccurrence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 14 3.3 Syntactic cooccurrence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 15 3.4 Comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 16

[1]  Stefan Evert,et al.  Methods for the Qualitative Evaluation of Lexical Association Measures , 2001, ACL.

[2]  Frank Keller,et al.  Determinants of Adjective-Noun Plausibility , 1999, EACL.

[3]  J. Silva,et al.  A Local Maxima method and a Fair Dispersion Normalization for extracting multi-word units from corpora , 2009 .

[4]  J. R. Firth,et al.  A Synopsis of Linguistic Theory, 1930-1955 , 1957 .

[5]  José Gabriel Pereira Lopes,et al.  Language Independent Automatic Acquisition of Rigid Multiword Units from Unrestricted Text Corpora , 1999 .

[6]  Stefan Evert,et al.  Experiments on Candidate Data for Collocation Extraction , 2003, EACL.

[7]  Stefan Evert,et al.  How Random is a Corpus? The Library Metaphor , 2006 .

[8]  Magnus Sahlgren,et al.  The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces , 2006 .

[9]  J. Firth,et al.  Selected papers of J. R. Firth, 1952-59 , 1968 .

[10]  Walter L. Smith Probability and Statistics , 1959, Nature.

[11]  Michael Stubbs,et al.  COLLOCATIONS AND SEMANTIC PROFILES: ON THE CAUSE OF THE TROUBLE WITH QUANTITATIVE STUDIES , 1995 .

[12]  L. Dekang,et al.  Extracting collocations from text corpora , 1998 .

[13]  Stefan Evert,et al.  Using small random samples for the manual evaluation of statistical association measures , 2005, Comput. Speech Lang..

[14]  Vasileios Hatzivassiloglou,et al.  Translating Collocations for Bilingual Lexicons: A Statistical Approach , 1996, CL.

[15]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[16]  Charles L. A. Clarke,et al.  Fast Computation of Lexical Affinity Models , 2004, COLING.

[17]  F. Yates,et al.  Tests of Significance for 2 × 2 Contingency Tables , 1984 .

[18]  Franz Josef Hausmann Was sind eigentlich Kollokationen , 2004 .

[19]  Timothy Baldwin,et al.  Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[20]  Pavel Pecina,et al.  Combining Association Measures for Collocation Extraction , 2006, ACL.

[21]  Michael Schiehlen Annotation Strategies for Probabilistic Parsing in German , 2004, COLING.

[22]  Daniel Jurafsky,et al.  Is Knowledge-Free Induction of Multiword Unit Dictionary Headwords a Solved Problem? , 2001, EMNLP.

[23]  Ted Pedersen,et al.  Fishing for Exactness , 1996, ArXiv.

[24]  Mark Johnson,et al.  Unsupervised learning of multi-word verbs , 2001 .

[25]  F. Grossmann,et al.  Les collocations: analyse et traitement. , 2003 .

[26]  Richard Poole,et al.  Oxford collocations dictionary for students of English , 2009 .

[27]  John Sinclair,et al.  Corpus, Concordance, Collocation , 1991 .

[28]  Darren Pearce,et al.  Synonymy in collocation extraction , 2001 .

[29]  Darren Pearce A Comparative Evaluation of Collocation Extraction Techniques , 2002, LREC.

[30]  Gaël Dias,et al.  Using Masks, Suffix Array-based Data Structures and Multidimensional Arrays to Compute Positional Ngram Statistics from Corpora , 2003, ACL 2003.

[31]  Pavel Pecina An Extensive Empirical Study of Collocation Extraction Methods , 2005, ACL.

[32]  J. Sinclair Collins Cobuild English dictionary , 1995 .

[33]  Suzanne Stevenson,et al.  Statistical Measures of the Semi-Productivity of Light Verb Constructions , 2004 .

[34]  Adam Kilgarriff,et al.  The Sketch Engine , 2004 .