SMILLE for Portuguese: Annotation and Analysis of Grammatical Structures in a Pedagogical Context

In Second Language Acquisition (SLA), the exposure of learners to authentic material is an important learning step, but the use of raw text may pose problems, because the information that the learner should be focusing on may be overlooked. In this paper, we present SMILLE for Portuguese, a system for detecting pedagogically relevant grammatical structures in raw texts. SMILLE’s rules for recognizing grammatical structures were evaluated in random sentences from three different genres, achieving an overall precision of 84%. The automatic recognition of pedagogically relevant grammatical structures can help teachers and course coordinators to better inform the choice of texts to be used in language courses, while also allowing for the analysis of grammar profiles for SLA. As a case study, we used SMILLE to analyze pedagogical material used in a Portuguese as foreign language course and to observe how the predominance of grammatical content in the texts is related to the described grammatical focus of the language levels.

[1]  Leonardo Zilio,et al.  Adaptive System for Language Learning , 2017, 2017 IEEE 17th International Conference on Advanced Learning Technologies (ICALT).

[2]  Michael Sharwood Smith,et al.  Input Enhancement in Instructed SLA , 1993, Studies in Second Language Acquisition.

[3]  Leonardo Zilio,et al.  Enhancing Grammatical Structures in Web-Based Texts. , 2017 .

[4]  Walt Detmar Meurers,et al.  Enhancing Authentic Web Pages for Language Learners , 2010 .

[5]  Jörg Tiedemann,et al.  Finding Alternative Translations in a Large Corpus of Movie Subtitle , 2016, LREC.

[6]  Luke Plonsky,et al.  The CALL-SLA Interface: Insights from a Second-Order Synthesis. , 2016 .

[7]  Walt Detmar Meurers,et al.  Online Information Retrieval for Language Learning , 2016, ACL.

[8]  R. Schmidt The role of consciousness in second language learning , 1990 .

[9]  Daphnée Simard,et al.  DIFFERENTIAL EFFECTS OF TEXTUAL ENHANCEMENT FORMATS ON INTAKE , 2009 .

[10]  Maxine Eskenazi,et al.  Retrieval of Authentic Documents for Reader-Specific Lexical Practice , 2004 .

[11]  J. Truscott,et al.  Explaining input enhancement: a MOGUL perspective , 2014 .

[12]  Hayo Reinders,et al.  Towards a definition of intake in second language acquisition , 2012 .

[13]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[14]  Brian North,et al.  Common European Framework of Reference for Languages: learning, teaching, assessment , 2009 .

[15]  Maxine Eskénazi,et al.  Porting REAP to European Portuguese , 2009, SLaTE.

[16]  J. Truscott,et al.  Noticing in second language acquisition: a critical review , 1998 .

[17]  Charles A. Ferguson Individual differences in language learning. , 1989 .

[18]  Leonardo Zilio,et al.  PassPort: A Dependency Parsing Model for Portuguese , 2018, PROPOR.

[19]  Kemal Oflazer,et al.  An NLP-based Reading Tool for Aiding Non-native English Readers , 2013, RANLP.

[20]  Leonardo Zilio,et al.  Using NLP for Enhancing Second Language Acquisition , 2017, RANLP.

[21]  C. Doughty Second Language Instruction Does Make a Difference , 1991, Studies in Second Language Acquisition.