Contextualized Usage-Based Material Selection

In this paper, we combine several NLP-functionalities to organize examples drawn from corpora. The application’s primary target audience are language learners. Currently, authentic linguistic examples for a given keyword search are often organized alphabetically according to context. From this, it is not always clear which contextual regularities actually exist on a syntactic, collocational and semantic level. Showing information at different levels of abstraction will help with the discovery of linguistic regularities and thus improve linguistic understanding. Practically this translates in a system that groups retrieved results on syntactic grounds, after which the examples are further organized at the hand of semantic similarity within certain phrasal slots. Visualization algorithms are then used to show focused information in phrasal slots, laying bare semantic restrictions within the construction.

[1]  Hidetoshi Shimodaira,et al.  Pvclust: an R package for assessing the uncertainty in hierarchical clustering , 2006, Bioinform..

[2]  Joan L. Bybee,et al.  From Usage to Grammar: The Mind's Response to Repetition , 2007 .

[3]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[4]  Carol A. Chapelle,et al.  Multimedia CALL: Lessons to be Learned from Research on Instructed SLA , 1998 .

[5]  R. Sternberg Most vocabulary is learned from context. , 1987 .

[6]  William Croft,et al.  Radical Construction Grammar: Syntactic Theory in Typological Perspective , 2001 .

[7]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[8]  Anna Papst,et al.  Learning Vocabulary In Another Language , 2016 .

[9]  Nathan Halko,et al.  Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions , 2009, SIAM Rev..

[10]  A. Goldberg Constructions at Work: The Nature of Generalization in Language , 2006 .

[11]  Stefan Th. Gries,et al.  Collostructions: Investigating the interaction of words and constructions , 2003 .

[12]  Walt Detmar Meurers,et al.  Linguistically Aware Information Retrieval: Providing Input Enrichment for Second Language Learners , 2016, BEA@NAACL-HLT.

[13]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[14]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[15]  M. Tomasello Constructing a Language: A Usage-Based Theory of Language Acquisition , 2003 .

[16]  Vladimir M. Sloutsky,et al.  Learning Words from Context , 2008 .