An automatic collocation writing assistant for Taiwanese EFL learners: A case of corpus-based NLP technology

Previous work in the literature reveals that EFL learners were deficient in collocations that are a hallmark of near native fluency in learner's writing. Among different types of collocations, the verb-noun (V-N) one was found to be particularly difficult to master, and learners' first language was also found to heavily influence their collocation production. In this paper, we develop an online collocation aid for EFL writers in Taiwan, aiming at detecting and correcting of learners' miscollocations attributable to L1 interference. Relevant correct collocation as feedback messages is suggested according to the translation equivalents between learner's L1 and L2. The system utilizes natural language processing (NLP) techniques to segment sentences in order to extract V-N collocations in given texts, and to derive a list of candidate English verbs that share the same Chinese translations via consulting electronic bilingual dictionaries. After combining nouns with these derived candidate verbs as V-N pairs, the system makes use of a reference corpus to exclude the inappropriate V-N pairs and single out the proper collocations. The system can effectively pinpoint the miscollocations and provide the learner with adequate collocations that the learner intends to write but misuses. It is hoped that this online assistant can facilitate EFL learner-writers' collocation use and help them transfer this essential knowledge to their future writing.