Topic models meet discourse analysis: a quantitative tool for a qualitative approach

ABSTRACT Quantitative text analysis tools have become increasingly popular methods for the operationalization of various types of discourse analysis. However, their application usually remains fairly simple and superficial, and fails to exploit the resources which the digital era holds for discourse analysis to their full extent. This paper discusses the discourse-analytic potential of a more complex and advanced text analysis tool, which is already frequently employed in other approaches to textual analysis, notably topic modelling. We argue that topic modelling promises advances in areas where discourse analysis has traditionally struggled, such as scaling, repetition, and systematization, which go beyond the contributions of simpler frequency and collocation counts. At the same time, it does not violate the epistemological premises and methodological ethos of even the more radical theories of discourse, we will demonstrate. Finally, we present two small case studies to show how topic modelling – when used with appropriate parameters – can straightforwardly enhance our ability to systematically investigate and interpret discourses in large collections of text. Abbreviations: CDA: Critical Discourse Analysis; LDA: Latent Dirichlet Allocation

[1]  Charles Antakia,et al.  DISCOURSE ANALYSIS MEANS DOING ANALYSIS: A CRITIQUE OF SIX ANALYTIC SHORTCOMINGS , 2003 .

[2]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[3]  J. Torfing,et al.  Discourse theory in European politics : identity, policy and governance , 2005 .

[4]  Petter Törnberg,et al.  Muslims in social media discourse: Combining topic modeling and critical discourse analysis , 2016 .

[5]  Anupam Nanda,et al.  Doing well by talking good? A topic modelling-assisted discourse study of corporate social responsibility , 2016 .

[6]  C. Antaki,et al.  El Análisis del discurso implica analizar: Crítica de seis atajos analíticos , 2003 .

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  Jacob Torfing,et al.  Discourse Theory in European Politics , 2005 .

[9]  J. Sheyholislami,,et al.  Critical Discourse Analysis , 2019, Research Methods for Classroom Discourse.

[10]  M. Narasimha Murty,et al.  On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations , 2010, PAKDD.

[11]  Carina Jacobi,et al.  Quantitative analysis of large amounts of journalistic texts using topic modelling , 2016, Rethinking Research Methods in an Age of Digital Journalism.

[12]  M. Jørgensen,et al.  Discourse Analysis as Theory and Method , 2002 .

[13]  D. Blei,et al.  Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of U.S. government arts funding , 2013 .

[14]  Margaret E. Roberts,et al.  Navigating the Local Modes of Big Data: The Case of Topic Models , 2016, Computational Social Science.

[15]  Silke Adam,et al.  Applying LDA Topic Modeling in Communication Research: Toward a Valid and Reliable Methodology , 2018 .

[16]  George A. Vouros,et al.  Determining Automatically the Size of Learned Ontologies , 2008, ECAI.

[17]  Ruslan Salakhutdinov,et al.  Evaluation methods for topic models , 2009, ICML '09.

[18]  Amber E. Boydstun Making the News: Politics, the Media, and Agenda Setting , 2013 .

[19]  Graeme D. Kennedy,et al.  Book Reviews: An Introduction to Corpus Linguistics , 1999, CL.

[20]  Ghafar Samar Reza,et al.  Teaching Requestive Downgraders in L2: How Effective are Input-Based and Output-Based Tasks? , 2011 .

[21]  Linda A. Wood,et al.  Doing Discourse Analysis: Methods for Studying Action in Talk and Text , 2000 .

[22]  Petko Bogdanov,et al.  Introduction—Topic models: What they are and why they matter , 2013 .

[23]  J. Gibson‐Graham A Postcapitalist Politics , 2006 .

[24]  Jan Blommaert,et al.  Discourse: A Critical Introduction.  , 2008, Linguistische Berichte (LB).

[25]  Stephen Louw,et al.  Picking the ripe cherry: Extract selection in qualitative research , 2014 .

[26]  Margaret E. Roberts,et al.  The structural topic model and applied social science , 2013, ICONIP 2013.

[27]  D. Howarth,et al.  Logics of Critical Explanation in Social and Political Theory , 2007 .

[28]  Paul Baker,et al.  Picking the right cherries? A comparison of corpus-based and qualitative analyses of news articles about masculinity , 2015 .

[29]  Michael Franklin,et al.  Driving Regulation , 2014 .

[30]  Paul Baker,et al.  Using Corpora in Discourse Analysis , 2006 .

[31]  Petter Törnberg,et al.  Combining CDA and topic modeling: Analyzing discursive connections between Islamophobia and anti-feminism on an online forum , 2016 .

[32]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[33]  M. Schmelzer The Hegemony of Growth , 2016 .

[34]  James Paul Gee,et al.  话语分析入门 : 理论与方法 = An introduction to discourse analysis : theory and method , 1999 .

[35]  Rasmus Munksgaard,et al.  Mixing politics and crime - The prevalence and decline of political discourse on the cryptomarket. , 2016, The International journal on drug policy.