Identifying dictionary-relevant formulaic sequences in written and spoken corpora

[1]  Héctor Martínez Alonso,et al.  Multiword Expressions: Between Lexicography and NLP , 2018, International Journal of Lexicography.

[2]  Kaja Dobrovoljc,et al.  Multi-word discourse markers and their corpus-driven identification: The case of MWDM extraction from the reference corpus of spoken Slovene , 2017 .

[3]  C. Westbury,et al.  Processing Advantages of Lexical Bundles: Evidence from Self-Paced Reading and Sentence Recall Tasks. , 2011 .

[4]  Norbert Schmitt,et al.  A Phrasal Expressions List , 2012 .

[5]  Tony McEnery,et al.  Collocations in Corpus‐Based Language Learning Research: Identifying, Comparing, and Interpreting the Evidence , 2017 .

[6]  D. Verdonik,et al.  A speech corpus as a source of lexical information , 2016 .

[7]  Stefan Th. Gries,et al.  50-something years of work on collocations: What is or should be next … , 2013 .

[8]  Gries Stefan Th. Some Current Quantitative Problems in Corpus Linguistics and a Sketch of Some Solutions , 2015 .

[9]  Douglas Biber,et al.  A corpus-driven approach to formulaic language in English: multi-word patterns in speech and writing , 2009 .

[10]  Paul Baker,et al.  Investigating Criterial Discourse Features across Second Language Development: Lexical Bundles in Rated Learner Essays, CEFR B1, B2 and C1 , 2014 .

[11]  D. Siepmann Dictionaries and Spoken Language: A Corpus-Based Review of French Dictionaries , 2015 .

[12]  N. Ellis,et al.  An Academic Formulas List: New Methods in Phraseology Research , 2010 .

[13]  Carlos Ramisch,et al.  Unsupervised Compositionality Prediction of Nominal Compounds , 2019, CL.

[14]  D. Biber,et al.  If you look at …: Lexical Bundles in University Teaching and Textbooks , 2004 .

[15]  Pavel Pecina,et al.  Lexical association measures and collocation extraction , 2009, Lang. Resour. Evaluation.

[16]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[17]  Yves Bestgen Evaluating the frequency threshold for selecting lexical bundles by means of an extension of the Fisher's exact test , 2018 .

[18]  B. Erman,et al.  The idiom principle and the open choice principle , 2000 .

[19]  Simon Krek,et al.  Discovering Automated Lexicography: The Case of the Slovene Lexical Database , 2016 .

[20]  Carlos Ramisch,et al.  Multiword Expressions Acquisition , 2015, Theory and Applications of Natural Language Processing.

[21]  A. Viera,et al.  Understanding interobserver agreement: the kappa statistic. , 2005, Family medicine.

[22]  Simon Krek,et al.  Compilation, transcription and usage of a reference speech corpus: the case of the Slovene corpus GOS , 2013, Language Resources and Evaluation.

[23]  Rufus H. Gouws,et al.  A Lexicographical Perspective on the Classification of Multiword Combinations , 2014 .

[24]  Sylviane Granger,et al.  From General to Learners’ Bilingual Dictionaries: Towards a More Effective Fulfilment of Advanced Learners’ Phraseological Needs , 2016 .

[25]  Yves Bestgen,et al.  Comparing Lexical Bundles across Corpora of Different Sizes: The Zipfian Problem , 2019, J. Quant. Linguistics.

[26]  Kathy Conklin,et al.  Formulaic Sequences: Are They Processed More Quickly than Nonformulaic Language by Native and Nonnative Speakers? , 2008 .