Tea Party in the House: A Hierarchical Ideal Point Topic Model and Its Application to Republican Legislators in the 112th Congress

We introduce the Hierarchical Ideal Point Topic Model, which provides a rich picture of policy issues, framing, and voting behavior using a joint model of votes, bill text, and the language that legislators use when debating bills. We use this model to look at the relationship between Tea Party Republicans and “establishment” Republicans in the U.S. House of Representatives during the 112th Congress. 1 Capturing Political Polarization Ideal-point models are one of the most widely used tools in contemporary political science research (Poole and Rosenthal, 2007). These models estimate political preferences for legislators, known as their ideal points, from binary data such as legislative votes. Popular formulations analyze legislators’ votes and place them on a one-dimensional scale, most often interpreted as an ideological spectrum from liberal to conservative. Moving beyond a single dimension is attractive, however, since people may lean differently based on policy issues; for example, the conservative movement in the U.S. includes fiscal conservatives who are relatively liberal on social issues, and vice versa. In multi-dimensional ideal point models, therefore, the ideal point of each legislator is no longer characterized by a single number, but by a multi-dimensional vector. With that move comes a new challenge, though: the additional dimensions are often difficult to interpret. To mitigate this problem, recent research has introduced methods that estimate multi-dimensional ideal points using both voting data and the texts of the bills being voted on, e.g., using topic models and associating each dimension of the ideal point space with a topic. The words most strongly associated with the topic can sometimes provide a readable description of its corresponding dimension. In this paper, we develop this idea further by introducing HIPTM, the Hierarchical Ideal Point Topic Model, to estimate multi-dimensional ideal points for legislators in the U.S. Congress. HIPTM differs from previous models in three ways. First, HIPTM uses not only votes and associated bill text, but also the language of the legislators themselves; this allows predictions of ideal points from politicians’ writing alone. Second, HIPTM improves the interpretability of ideal-point dimensions by incorporating data from the Congressional Bills Project (Adler and Wilkerson, 2015), in which bills are labeled with major topics from the Policy Agendas Project Topic Codebook.1 And third, HIPTM discovers a hierarchy of topics, allowing us to analyze both agenda issues and issue-specific frames that legislators use on the congressional floor, following Nguyen et al. (2013) in modeling framing as second-level agenda setting (McCombs, 2005). Using this new model, we focus on Republican legislators during the 112th U.S. Congress, from January 2011 until January 2013. This is a particularly interesting session of Congress for political scientists, because of the rise of the Tea Party, a decentralized political movement with populist, libertarian, and conservative elements. Although united with “establishment” Republicans against Democrats in the 2010 midterm elections, leading to massive Democratic defeats, the Tea Party was—and still is—wrestling with establishment Republicans for control of the Republican party. The Tea Party is a new and complex phenomenon for political scientists; as Carmines and D’Amico (2015) observe: “Conventional views of ideology as a single-dimensional, left-right spectrum experience great difficulty in understanding or explaining the Tea Party.” Our model identifies legislators who have low (or high) levels of “Tea Partiness” but are (or are not) members of the Tea Party Caucus, and providing insights into the nahttp://www.policyagendas.org/

[1]  Simon Jackman,et al.  Multidimensional Analysis of Roll Call Data via Bayesian Simulation: Identification, Estimation, Inference, and Model Checking , 2001, Political Analysis.

[2]  J. Fox Bayesian Item Response Modeling: Theory and Applications , 2010 .

[3]  Christopher F. Karpowitz,et al.  Tea Time in America? The Impact of the Tea Party Movement on the 2010 Midterm Elections , 2011, PS: Political Science & Politics.

[4]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[5]  Noah Kaplan,et al.  Practical Issues in Implementing and Understanding Bayesian Ideal Point Estimation , 2005, Political Analysis.

[6]  Black Tea, Green Tea, White Tea, and Coffee: Understanding the Variation in Attachment to the Tea Party Among Members of Congress , 2014 .

[7]  Philip Resnik,et al.  Sometimes Average is Best: The Importance of Averaging for Prediction using MCMC Inference in Topic Modeling , 2014, EMNLP.

[8]  Yupeng Gu,et al.  Topic-factorized ideal point estimation model for legislative voting network , 2014, KDD.

[9]  John Coggin,et al.  The Tea Party and the Remaking of Republican Conservatism , 2011, Perspectives on Politics.

[10]  E. Carmines,et al.  The New Look in Political Ideology Research , 2015 .

[11]  Hanna Wallach,et al.  Structured Topic Models for Language , 2008 .

[12]  K. T. Poole,et al.  A Spatial Model for Legislative Roll Call Analysis , 1985 .

[13]  Sean Gerrish,et al.  Predicting Legislative Roll Calls from Text , 2011, ICML.

[14]  Bryan T. Gervais,et al.  Reading the Tea Leaves: Understanding Tea Party Caucus Membership in the US House of Representatives , 2012, PS: Political Science & Politics.

[15]  Matthew T. Harrison,et al.  A simple example of Dirichlet process mixture inconsistency for the number of components , 2013, NIPS.

[16]  Andrew D. Martin,et al.  Dynamic Ideal Point Estimation via Markov Chain Monte Carlo for the U.S. Supreme Court, 1953–1999 , 2002, Political Analysis.

[17]  Sean Gerrish,et al.  How They Vote: Issue-Adjusted Models of Legislative Behavior , 2012, NIPS.

[18]  Robert M. Entman,et al.  Framing: Toward Clarification of a Fractured Paradigm , 1993 .

[19]  Thang Nguyen,et al.  Is Your Anchor Going Up or Down? Fast and Accurate Supervised Topic Models , 2015, NAACL.

[20]  Viet-An Nguyen,et al.  Lexical and Hierarchical Topic Regression , 2013, NIPS.

[21]  M. McCombs A Look at Agenda-setting: past, present and future , 2005 .

[22]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[23]  Joshua D. Clinton,et al.  The Statistical Analysis of Roll Call Data , 2004, American Political Science Review.

[24]  Justin Grimmer,et al.  A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases , 2010, Political Analysis.

[25]  Chong Wang,et al.  Nested Hierarchical Dirichlet Processes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Alexander J. Smola,et al.  Nested Chinese Restaurant Franchise Process: Applications to User Tracking and Document Modeling , 2013, ICML.

[27]  Andrew D. Martin,et al.  Dynamic Ideal Point Estimation via Markov Chain Monte Carlo for the U.S. Supreme Court, 1953–1999 , 2002, Political Analysis.

[28]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[29]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[30]  C. Antoniak Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems , 1974 .

[31]  L B Lusted,et al.  Signal detectability and medical decision-making. , 1971, Science.

[32]  Noah A. Smith,et al.  The Utility of Text: The Case of Amicus Briefs and the Supreme Court , 2014, AAAI.

[33]  Tom S. Clark,et al.  Scaling Politically Meaningful Dimensions Using Texts and Votes , 2014 .

[34]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[35]  Philip J. Cowans Probabilistic Document Modelling , 2006 .

[36]  J. Heckman,et al.  Linear Probability Models of the Demand for Attributes with an Empirical Application to Estimating the Preferences of Legislators , 1996 .