Studying Political Decision Making With Automatic Text Analysis

Analyzing political text can answer many pressing questions in political science, from understanding political ideology to mapping the effects of censorship in authoritarian states. This makes the study of political text and speech an important part of the political science methodological toolbox. The confluence of increasing availability of large digital text collections, plentiful computational power, and methodological innovations has lead to many researchers adopting techniques of automatic text analysis for coding and analyzing textual data. In what is sometimes termed the “text as data” approach, texts are converted to a numerical representation, and various statistical techniques such as dictionary analysis, automatic scaling, topic modeling, and machine learning are used to find patterns in and test hypotheses

[1]  John D. Lafferty,et al.  Correlated Topic Models , 2005, NIPS.

[2]  Claudio Cioffi-Revilla,et al.  Computational social science , 2010 .

[3]  Joshua A. Tucker,et al.  Liberal and Conservative Values: What We Can Learn From Congressional Tweets , 2018 .

[4]  Janyce Wiebe,et al.  Learning Subjective Language , 2004, CL.

[5]  Mona T. Diab,et al.  Second Generation AMIRA Tools for Arabic Processing : Fast and Robust Tokenization , POS tagging , and Base Phrase Chunking , 2009 .

[6]  Nicholas A. Valentino,et al.  The Changing Norms of Racial Political Rhetoric and the End of Racial Priming , 2017, The Journal of Politics.

[7]  M. Debus Party Competition and Government Formation in Multilevel Settings: Evidence from Germany 1 , 2008, Government and Opposition.

[8]  Meng Zhang,et al.  Neural Network Methods for Natural Language Processing , 2017, Computational Linguistics.

[9]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[10]  Michael Gamon,et al.  Customizing Sentiment Classifiers to New Domains: a Case Study , 2019 .

[11]  Veselin Stoyanov,et al.  Evaluation Measures for the SemEval-2016 Task 4 “Sentiment Analysis in Twitter” (Draft: Version 1.13) , 2016 .

[12]  Kenneth Benoit,et al.  Compared to What? A Comment on “A Robust Transformation Procedure for Interpreting Political Text” by Martin and Vanberg , 2008 .

[13]  Carina Jacobi,et al.  Quantitative analysis of large amounts of journalistic texts using topic modelling , 2016, Rethinking Research Methods in an Age of Digital Journalism.

[14]  E-Step Structural Topic Models for Open Ended Survey Responses , 2022 .

[15]  Sabine C. Carey,et al.  Canaries in a coal-mine? What the killings of journalists tell us about future repression , 2017, Journal of peace research.

[16]  Vito D'Orazio,et al.  Separating the Wheat from the Chaff: Applications of Automated Document Classification Using Support Vector Machines , 2014, Political Analysis.

[17]  H. Klüver,et al.  Coalition Governments and Party Competition: Political Communication Strategies of Coalition Parties* , 2015, Political Science Research and Methods.

[18]  M. Laver,et al.  Policy competition in the 2002 French legislative and presidential elections , 2006 .

[19]  James H. Martin,et al.  Speech and Language Processing, 2nd Edition , 2008 .

[20]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[21]  Damian Trilling,et al.  Taking Stock of the Toolkit , 2016, Rethinking Research Methods in an Age of Digital Journalism.

[22]  D. Boyd,et al.  CRITICAL QUESTIONS FOR BIG DATA , 2012 .

[23]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[24]  Dragomir R. Radev,et al.  How to Analyze Political Attention with Minimal Assumptions and Costs , 2010 .

[25]  Emily M. Bender Book Reviews: Linguistic Fundamentals for Natural Language Processing: 100 Essentials from Morphology and Syntax by Emily M. Bender , 2013, CL.

[26]  Philip A. Schrodt,et al.  Comparing Methods for Generating Large Scale Political Event Data Sets , 2015 .

[27]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[28]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[29]  Wouter van Atteveldt,et al.  When Communication Meets Computation: Opportunities, Challenges, and Pitfalls in Computational Communication Science , 2018 .

[30]  M. Laver,et al.  Extracting Policy Positions from Political Texts Using Words as Data , 2003, American Political Science Review.

[31]  Arthur Spirling,et al.  Text Preprocessing For Unsupervised Learning: Why It Matters, When It Misleads, And What To Do About It , 2017, Political Analysis.

[32]  P. Wilde,et al.  The ParlSpeech data set: Annotated full-text vectors of 3.9 million plenary speeches in the key legislative chambers of seven European states , 2017 .

[33]  Benjamin E. Lauderdale,et al.  Crowd-sourced Text Analysis: Reproducible and Agile Production of Political Data , 2016, American Political Science Review.

[34]  Thomas L. Griffiths,et al.  Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.

[35]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Will Lowe,et al.  A textual Taylor rule: estimating central bank preferences combining topic and scaling methods , 2015, Political Science Research and Methods.

[37]  Stuart Soroka,et al.  Affective News: The Automated Coding of Sentiment in Political Texts , 2012 .

[38]  I. Budge,et al.  Mapping Policy Preferences: Estimates for Parties, Electors, and Governments 1945-1998 , 2001 .

[39]  D. Sculley,et al.  Online Active Learning Methods for Fast Label-Efficient Spam Filtering , 2007, CEAS.

[40]  Philip Resnik,et al.  Political Ideology Detection Using Recursive Neural Networks , 2014, ACL.

[41]  Hanna Wallach Computational Social Science: Toward a Collaborative Future , 2016 .

[42]  F. Baumgartner,et al.  Measuring the Media Agenda , 2014 .

[43]  Derek Greene,et al.  Exploring the Political Agenda of the European Parliament Using a Dynamic Topic Modeling Approach , 2016, Political Analysis.

[44]  Andrea Ceron,et al.  Brave rebels stay home , 2015 .

[45]  L. Hooghe,et al.  Measuring party positions in Europe , 2015 .

[46]  Stuart Soroka,et al.  Bad News or Mad News? Sentiment Scoring of Negativity, Fear, and Anger in News Content , 2015 .

[47]  Anita R. Gohdes Pulling the plug , 2015 .

[48]  A. Pentland,et al.  Life in the network: The coming age of computational social science: Science , 2009 .

[49]  Gijs Schumacher,et al.  EUSpeech: a New Dataset of EU Elite Speeches , 2016 .

[50]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[51]  Georgios Paltoglou,et al.  Signals of Public Opinion in Online Communication , 2015 .

[52]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[53]  Claire Cardie,et al.  Text Annotation for Political Science Research , 2008 .

[54]  Mika V. Mäntylä,et al.  The evolution of sentiment analysis - A review of research topics, venues, and top cited papers , 2016, Comput. Sci. Rev..

[55]  Sujin Choi,et al.  The “Deliberative Digital Divide:” Opinion Leadership and Integrative Complexity in the U.S. Political Blogosphere , 2014 .

[56]  Claes H. de Vreese,et al.  Using Supervised Machine Learning to Code Policy Issues , 2015 .

[57]  Joan Bruna,et al.  Intriguing properties of neural networks , 2013, ICLR.

[58]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[59]  Chunyu Kit,et al.  Tokenization as the Initial Phase in NLP , 1992, COLING.

[60]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[61]  Malisa Zobel,et al.  Positions and saliency of immigration in party manifestos: A novel dataset using crowd coding , 2018 .

[62]  Tom Louwerse Mapping Policy Preferences II: Estimates for Parties, Electors, and Governments in Eastern Europe, European Union and OECD 1990–2003 , 2009 .

[63]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[64]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[65]  W. Lowe,et al.  Understanding Wordscores , 2008, Political Analysis.

[66]  Lindsay S. Hahn,et al.  Extracting Latent Moral Information from Text Narratives: Relevance, Challenges, and Solutions , 2018 .

[67]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[68]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[69]  Philip J. Stone,et al.  The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information , 2007 .

[70]  Daniel Bischof,et al.  Simple politics for the people? Complexity in campaign messages and political knowledge , 2018 .

[71]  Christopher Zorn,et al.  Corpus-based dictionaries for sentiment analysis of specialized vocabularies , 2019, Political Science Research and Methods.

[72]  Philip A. Schrodt,et al.  Validity Assessment of a Machine-Coded Event Data Set for the Middle East, 1982-92 , 1994 .

[73]  Raphael H. Heiberger,et al.  Installing computational social science: Facing the challenges of new information and communication technologies in social science , 2016 .

[74]  Eric P. Xing,et al.  Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2014, ACL 2014.

[75]  Zoltán Fazekas,et al.  The Nuts and Bolts of Automated Text Analysis. Comparing Different Document Pre-Processing Techniques in Four Countries , 2016 .

[76]  N. Baerg,et al.  Explaining Instability in the Stability and Growth Pact , 2016 .

[77]  Dustin Hillard,et al.  Computer-Assisted Topic Classification for Mixed-Methods Social Science Research , 2008 .

[78]  John D. Wilkerson,et al.  Large-Scale Computerized Text Analysis in Political Science: Opportunities and Challenges , 2017 .

[79]  Nada Lavrac,et al.  Stream-based active learning for sentiment analysis in the financial domain , 2014, Inf. Sci..

[80]  Jonathan B. Slapin,et al.  Position Taking in European Parliament Speeches , 2010 .

[81]  Anita R. Gohdes Studying the Internet and Violent conflict , 2018 .

[82]  B. Vis,et al.  Living in the Past or Living in the Future? Analyzing Parties’ Platform Change In Between Elections,The Netherlands 1997–2014 , 2018 .

[83]  Jonathan Nagler,et al.  Methodological Challenges in Estimating Tone: Application to News Coverage of the U.S. Economy , 2016 .

[84]  Margaret E. Roberts Censored: Distraction and Diversion Inside China's Great Firewall , 2018 .

[85]  Tamir Sheafer,et al.  How Politicians’ Attitudes and Goals Moderate Political Agenda Setting by the Media , 2017 .

[86]  Philip A. Schrodt,et al.  Introduction to the Special Issue: The Statistical Analysis of Political Text , 2008, Political Analysis.

[87]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[88]  Hadley Wickham,et al.  R for Data Science: Import, Tidy, Transform, Visualize, and Model Data , 2014 .

[89]  Mirya R. Holman,et al.  Climate change communication from cities in the USA , 2018, Climatic Change.

[90]  G. Jacobson How Do Campaigns Matter , 2015 .

[91]  Загоровская Ольга Владимировна,et al.  Исследование влияния пола и психологических характеристик автора на количественные параметры его текста с использованием программы Linguistic Inquiry and Word Count , 2015 .

[92]  C. Reinemann,et al.  Populism and the media: cross-national findings and perspectives , 2016 .

[93]  Silke Adam,et al.  Applying LDA Topic Modeling in Communication Research: Toward a Valid and Reliable Methodology , 2018 .

[94]  Sven-Oliver Proksch,et al.  A Scaling Model for Estimating Time-Series Party Positions from Texts , 2007 .

[95]  Andrew McCallum,et al.  Rethinking LDA: Why Priors Matter , 2009, NIPS.

[96]  Gregor Wiedemann,et al.  Proportional Classification Revisited: Automatic Content Analysis of Political Manifestos Using Active Learning , 2019 .

[97]  Constantine Boussalis,et al.  Text-mining the signals of climate change doubt , 2016 .

[98]  Armèn Hakhverdian Capturing Government Policy on the Left-Right Scale: Evidence from the United Kingdom, 1956-2006 , 2009 .

[99]  Zachary Greene,et al.  Leadership Competition and Disagreement at Party National Congresses , 2014, British Journal of Political Science.

[100]  Constantine Boussalis,et al.  Foreign Assistance and the Struggle Against HIV/AIDS in the Developing World , 2010 .

[101]  Kenneth Benoit,et al.  Text Analysis in R , 2017 .

[102]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[103]  Justin Grimmer,et al.  A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases , 2010, Political Analysis.

[104]  Kenneth Benoit,et al.  Validating Estimates of Latent Traits from Textual Data Using Human Judgment as a Benchmark , 2012, Political Analysis.

[105]  Anita R. Gohdes Pulling the Plug: Network Disruptions and Violence in Civil Conflict , 2014 .

[106]  Gary King,et al.  Automating Open Science for Big Data , 2015 .