Automatic Identification of Discourse Moves in Scientific Article Introductions

This paper reports on the first stage of building an educational tool for international graduate students to improve their academic writing skills. Taking a text-categorization approach, we experimented with several models to automatically classify sentences in research article introductions into one of three rhetorical moves. The paper begins by situating the project within the larger framework of intelligent computer-assisted language learning. It then presents the details of the study with very encouraging results. The paper then concludes by commenting on how the system may be improved and how the project is intended to be pursued and evaluated.

[1]  Martin Chodorow,et al.  Automated Essay Scoring for Nonnative English Speakers , 1999 .

[2]  Lawrence M. Rudner,et al.  Automated Essay Scoring Using Bayes' Theorem , 2002 .

[3]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[4]  Mark Warschauer,et al.  Automated writing evaluation: defining the classroom research agenda , 2006 .

[5]  Carol A. Chapelle,et al.  Challenges in Evaluation of Innovation: Observations from Technology Research , 2007 .

[6]  Michael H. Long Process and Product in ESL Program Evaluation. , 1984 .

[7]  Jill Burstein,et al.  Automated Essay Scoring : A Cross-disciplinary Perspective , 2003 .

[8]  Laurence Anthony,et al.  Mover: a machine learning tool to assist in the reading and writing of technical papers , 2003 .

[9]  Tom Mitchell,et al.  Towards robust computerised marking of free-text responses , 2002 .

[10]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[11]  Na-Rae Han,et al.  Detecting errors in English article usage by non-native speakers , 2006, Natural Language Engineering.

[12]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[13]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[14]  Na-Rae Han,et al.  Detection of Grammatical Errors Involving Prepositions , 2007, ACL 2007.

[15]  Jill Burstein,et al.  The E-rater® scoring engine: Automated essay scoring with natural language processing. , 2003 .

[16]  Michael A. Shepherd,et al.  Support vector machines for text categorization , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[17]  John M. Swales,et al.  Genre Analysis: English in Academic and Research Settings , 1993 .

[18]  J. Swales Research Genres: Explorations and Applications , 2004 .

[19]  Robert J. Mislevy,et al.  Automated scoring of complex tasks in computer-based testing , 2006 .

[20]  Ingo Mierswa,et al.  YALE: rapid prototyping for complex data mining tasks , 2006, KDD '06.

[21]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[22]  Daniel Marcu,et al.  Finding the WRITE Stuff: Automatic Identification of Discourse Structure in Student Essays , 2003, IEEE Intell. Syst..

[23]  John M. Levis,et al.  A project-based approach to teaching research writing to nonnative writers , 2003 .

[24]  S. Kushner,et al.  Tackling the needs of foreign academic writers: a case study , 1997 .

[25]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.