Tweet Acts: A Speech Act Classifier for Twitter

Speech acts are a way to conceptualize speech as action. This holds true for communication on any platform, including social media platforms such as Twitter. In this paper, we explored speech act recognition on Twitter by treating it as a multi-class classification problem. We created a taxonomy of six speech acts for Twitter and proposed a set of semantic and syntactic features. We trained and tested a logistic regression classifier using a data set of manually labelled tweets. Our method achieved a state-of-the-art performance with an average F1 score of more than $0.70$. We also explored classifiers with three different granularities (Twitter-wide, type-specific and topic-specific) in order to find the right balance between generalization and overfitting for our task.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  Soroush Vosoughi,et al.  A Human-Machine Collaborative System for Identifying Rumors on Twitter , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[3]  Jing Jiang,et al.  An Empirical Comparison of Topics in Twitter and Traditional Media , 2011 .

[4]  A. Koller,et al.  Speech Acts: An Essay in the Philosophy of Language , 1969 .

[5]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[6]  Brendan T. O'Connor,et al.  Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters , 2013, NAACL.

[7]  Csr Young,et al.  How to Do Things With Words , 2009 .

[8]  J. Searle Expression and Meaning: A taxonomy of illocutionary acts , 1975 .

[9]  Noah A. Smith,et al.  A Dependency Parser for Tweets , 2014, EMNLP.

[10]  A. Wierzbicka English Speech Act Verbs: A Semantic Dictionary , 1987 .

[11]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[12]  J. Austin How to do things with words , 1962 .

[13]  David Crystal,et al.  Language and the Internet , 2001 .

[14]  Deb Kay A Semi-Automatic Method for Efficient Detection of Stories on Social Media , 2016 .

[15]  Soroush Vosoughi,et al.  Automatic detection and verification of rumors on Twitter , 2015 .

[16]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[17]  Soroush Vosoughi,et al.  A Semi-Automatic Method for Efficient Detection of Stories on Social Media , 2016, ICWSM.

[18]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.