论文信息 - IDENTIFYING SPEECH ACTS IN E-MAILS: TOWARD AUTOMATED SCORING OF THE TOEIC® E-MAIL TASK - 字舞流文

IDENTIFYING SPEECH ACTS IN E-MAILS: TOWARD AUTOMATED SCORING OF THE TOEIC® E-MAIL TASK

This study proposes an approach to automatically score the TOEIC® Writing e-mail task. We focus on one component of the scoring rubric, which notes whether the test-takers have used particular speech acts such as requests, orders, or commitments. We developed a computational model for automated speech act identification and tested it on a corpus of TOEIC responses, achieving up to 79.28% accuracy. This model represents a positive first step toward the development of a more comprehensive scoring model. We also created a corpus of speech act-annotated native English workplace e-mails. Comparisons between these and the TOEIC data allow us to assess whether English learners are approximating native models and whether differences between native and non-native data can have negative consequences in the global workplace.

Rachele De Felice | Paul Deane | P. Deane | R. D. Felice | R. Felice

[1] William B. Stiles,et al. Describing talk : a taxonomy of verbal response modes , 1992 .

[2] Stephen G. Pulman,et al. Information Extraction and Machine Learning: Auto-Marking Short Free Text Responses to Science Questions , 2005, AIED.

[3] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[4] Mitchell P. Marcus,et al. Maximum entropy models for natural language ambiguity resolution , 1998 .

[5] Gabriele Kasper,et al. PRAGMATICS AND SLA , 1999, Annual Review of Applied Linguistics.

[6] Jack C. Richards,et al. SPEECH ACTS AND SECOND LANGUAGE LEARNING , 1980 .

[7] Donald E. Powers,et al. AUTOMATED SCORING OF SHORT‐ANSWER OPEN‐ENDED GRE® SUBJECT TEST ITEMS , 2008 .

[8] Martin Chodorow,et al. C-rater: Automated Scoring of Short-Answer Questions , 2003, Comput. Humanit..

[9] Z. Dörnyei,et al. Do Language Learners Recognize Pragmatic Violations? Pragmatic Versus Grammatical Awareness in Instructed L2 Learning. , 1998 .

[10] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL 2006.

[11] James R. Curran,et al. Wide-Coverage Efficient Statistical Parsing with CCG and Log-Linear Models , 2007, Computational Linguistics.

[12] Dale A. Koike. Pragmatic Competence and Adult L2 Acquisition: Speech Acts in Interlanguage , 1989 .

[13] Jack C. Richards,et al. Longman Dictionary of Language Teaching and Applied Linguistics , 1992 .

[14] Kasia M. Jaszczolt. Semantics and Pragmatics: Meaning in Language and Discourse , 2002 .

[15] Jacob Cohen,et al. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[16] Tom M. Mitchell,et al. Learning to Classify Email into “Speech Acts” , 2004, EMNLP.

[17] Nancy Ide. American National Corpus (ANC) , 2002 .

[18] Zoltán Dörnyei,et al. On the Teachability of Communication Strategies , 1995 .

[19] Johanna D. Moore,et al. Automatic annotation of context and speech acts for dialogue corpora , 2009, Natural Language Engineering.

[20] Ron Artstein,et al. Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[21] Sigrun Biesenbach-Lucas,et al. Communication topics and strategies in e-mail consultation: Comparison between american and international university students , 2005 .

[22] J. Ardila. Meaning in Language. An Introduction to Semantics and Pragmatics , 2011 .

[23] G. Kasper. Four perspectives on L2 pragmatic development , 2001 .

[24] John Blitzer,et al. Intelligent email: reply and attachment prediction , 2008, IUI '08.

[25] G. Kasper. Pragmatic Comprehension in Learner-Native Speaker Discourse. , 1984 .

[26] M. Swain,et al. THEORETICAL BASES OF COMMUNICATIVE APPROACHES TO SECOND LANGUAGE TEACHING AND TESTING , 1980 .

[27] Jan Noyes,et al. Toward a stochastic speech act model of email behavior , 2008, CEAS.

[28] Jianfeng Gao,et al. Using Contextual Speller Techniques and Language Modeling for ESL Error Correction , 2008, IJCNLP.

[29] Jenny A. Thomas. Cross-Cultural Pragmatic Failure , 1983 .

[30] J. Sadock. Speech acts , 2007 .

[31] Cécile Paris,et al. The nature of requests and commitments in email messages , 2008, AAAI 2008.

[32] John A. Carroll,et al. Applied morphological processing of English , 2001, Natural Language Engineering.

[33] Johan Bos,et al. Linguistically Motivated Large-Scale NLP with C&C and Boxer , 2007, ACL.

[34] Rachele De Felice,et al. A Classifier-Based Approach to Preposition and Determiner Error Correction in L2 English , 2008, COLING.

[35] Jade Goldstein-Stewart,et al. Using Speech Acts to Categorize Email and Identify Email Genres , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).

[36] James R. Curran,et al. Language Independent NER using a Maximum Entropy Tagger , 2003, CoNLL.

[37] Penelope Brown,et al. Politeness: Some Universals in Language Usage , 1989 .

[38] Andrew D. Cohen,et al. Teaching and assessing L2 pragmatics: What can we expect from learners? , 2008, Language Teaching.

[39] A. Jaffe. Stance: Sociolinguistic Perspectives , 2009 .

[40] Kyu-Hwan Moon. Speech act study: Differences between native and nonnative speaker complaint strategies , 2002 .

[41] William W. Cohen,et al. On the collective classification of email "speech acts" , 2005, SIGIR '05.

[42] Andrew Lampert,et al. Can Requests-for-Action and Commitments-to-Act be Reliably Identified in Email Messages ? , 2007 .

[43] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL.

[44] Jacob L. Mey,et al. Between culture and pragmatics: Scylla and Charybdis? The precarious condition of intercultural pragmatics , 2004 .

[45] William W. Cohen,et al. Improving “Email Speech Acts” Analysis via N-gram Selection , 2006, HLT-NAACL 2006.

[46] 木村和夫. Pragmatics , 1997, Language Teaching.

[47] Yorick Wilks,et al. Routing email automatically by purpose not topic , 1999, Natural Language Engineering.

[48] Csr Young,et al. How to Do Things With Words , 2009 .

[49] Jana Z. Sukkarieh,et al. Leveraging C-Rater's Automated Scoring Capability for Providing Instructional Feedback for Short Constructed Responses , 2008, Intelligent Tutoring Systems.

[50] Michael Gamon,et al. Task-Focused Summarization of Email , 2004 .

[51] Douglas Biber,et al. Variation across speech and writing: Methodology , 1988 .

[52] John Blitzer,et al. Intelligent Email: Aiding Users with AI , 2008, AAAI.

[53] Martin Chodorow,et al. CriterionSM Online Essay Evaluation: An Application for Automated Evaluation of Student Essays , 2003, IAAI.

[54] Cécile Paris,et al. Requests and Commitments in Email are More Complex Than You Think: Eight Reasons to be Cautious , 2008, ALTA.

[55] Jill Burstein,et al. AUTOMATED ESSAY SCORING WITH E‐RATER® V.2.0 , 2004 .

[56] John Sie Yuen Lee. Automatic correction of grammatical errors in non-native English text , 2009 .

[57] Martin Chodorow,et al. The Ups and Downs of Preposition Error Detection in ESL Writing , 2008, COLING.

[58] Cécile Paris,et al. Classifying Speech Acts using Verbal Response Modes , 2006, ALTA.

[59] J. Searle,et al. Expression and Meaning. , 1982 .

[60] Stephen G. Pulman,et al. Automatic Short Answer Marking , 2005, ACL 2005.

[61] Claudia Leacock,et al. Proceedings of the Fourth Workshop on Innovative Use of NLP for Building Educational Applications , 2008 .

[62] R. Kaplan. CULTURAL THOUGHT PATTERNS IN INTER‐CULTURAL EDUCATION , 1966 .