An Automated Method of Topic-Coding Legislative Speech Over Time with Application to the 105th-108th U.S. Senate

In this paper, we describe a method for statistical learning from speech documents that we apply to the Congressional Record in order to gain new insight into the dynamics of the political agenda. Prior eorts to evaluate the attention of elected representatives across topic areas have been expensive manual coding exercises and are generally circumscribed along one or more features of detail: limited time periods, high levels of temporal aggregation, coarse topical categories, and so on. Conversely, the Congressional Record has scarcely been used for such analyses, largely because it contains too much information to absorb. We describe here a method for inferring, through the patterns of word choice in each speech and the dynamics of word choice patterns across time, (a) what the topics of speeches are, and (b) the probability that attention will be paid to any given topic or set of topics over time. We use the model to examine the agenda in the United States Senate from 1997-2004, a database of over 70 thousand documents containing over 70 million words. We estimate the model for 42 topics and provide evidence that we can reveal speech topics that are both distinctive and inter-related in substantively meaningful ways. We demonstrate further that the dynamics our model gives us leverage into important questions about the dynamics of the political agenda.

[1]  Theodore J. Lowi,et al.  American Business, Public Policy, Case-Studies, and Political Theory , 1964 .

[2]  [CALORIES IN RICE]. , 1964, The Journal of nutrition.

[3]  C. Sims Money, Income, and Causality , 1972 .

[4]  Aage R. Clausen How Congressmen Decide: A Policy Focus , 1973 .

[5]  David R. Mayhew Congress: The Electoral Connection , 1975 .

[6]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[7]  Anne Lohrli Chapman and Hall , 1985 .

[8]  W. Riker The art of political manipulation , 1987 .

[9]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[10]  M. West,et al.  Bayesian forecasting and dynamic models , 1989 .

[11]  K. T. Poole,et al.  Patterns of congressional voting , 1991 .

[12]  B. Jones,et al.  Agendas and instability in American politics , 1993 .

[13]  Philip A. Schrodt,et al.  MACHINE CODING OF EVENT DATA USING REGIONAL AND INTERNATIONAL SOURCES , 1994 .

[14]  Christopher Wlezien The Public as Thermostat: Dynamics of Preferences for Spending , 1995 .

[15]  B. Sinclair Legislators, Leaders, and Lawmaking: The U.S. House of Representatives in the Postreform Era , 1995 .

[16]  Richard L. Hall,et al.  Participation in Congress , 1996 .

[17]  Janet M. Box-Steffensmeier,et al.  The Strategic Timing of Position Taking in Congress: A Study of the North American Free Trade Agreement , 1997 .

[18]  P. Müller,et al.  Bayesian Forecasting of Multinomial Time Series through Conditionally Gaussian Dynamic Models , 1997 .

[19]  K. T. Poole,et al.  Congress: A Political-Economic History of Roll Call Voting , 1997 .

[20]  L. Rothenberg,et al.  Severing the Electoral Connection: Shirking in the Contemporary Congress , 2000 .

[21]  John D. Huber,et al.  Putting Parties in Their Place: Inferring Party Left-Right Ideological Positions from Party Manifestos Data , 2000 .

[22]  Gregory J. Wawro Legislative entrepreneurship in the U.S. House of Representatives , 2000 .

[23]  Matthew Potoski,et al.  The Dimensional Structure of Policy Outputs: Distributive Policy and Roll Call Voting , 2000 .

[24]  Robert Y. Shapiro,et al.  Politicians Don't Pander: Political Manipulation and the Loss of Democratic Responsiveness , 2000 .

[25]  M. Laver,et al.  Estimating policy positions from political texts , 2000 .

[26]  Hermann Schmitt,et al.  Dynamic Representation , 1995, American Political Science Review.

[27]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[28]  Simon Jackman,et al.  Multidimensional Analysis of Roll Call Data via Bayesian Simulation: Identification, Estimation, Inference, and Model Checking , 2001, Political Analysis.

[29]  I. Budge,et al.  Mapping Policy Preferences: Estimates for Parties, Electors, and Governments 1945-1998 , 2001 .

[30]  Barbara Ringer,et al.  The Role of Product Quality Information, Market State Information and Transaction Costs in Electronic Auctions , 2002 .

[31]  Andrew D. Martin,et al.  Dynamic Ideal Point Estimation via Markov Chain Monte Carlo for the U.S. Supreme Court, 1953–1999 , 2002, Political Analysis.

[32]  K. Hill,et al.  Symbolic Speeches in the U.S. Senate and Their Representational Implications , 2002, The Journal of Politics.

[33]  M. Laver,et al.  Extracting Policy Positions from Political Texts Using Words as Data , 2003, American Political Science Review.

[34]  Stenberg v. Carhart The Partial-Birth Abortion Ban Act of 2003. , 2003, Origins.

[35]  Adam Meirowitz,et al.  Integrating Voting Theory and Roll Call Analysis: A Framework , 2003, Political Analysis.

[36]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[37]  Bryan D. Jones,et al.  Representation and agenda-setting , 2004 .

[38]  Keith T. Poole,et al.  Spatial Models of Parliamentary Voting , 2005 .

[39]  Mathew D. McCubbins,et al.  Setting the Agenda: Author Index , 2005 .

[40]  N. Ringe Policy Preference Formation in Legislative Politics: Structures, Actors, and Focal Points , 2005 .

[41]  Bryan D. Jones,et al.  Comparative studies of policy agendas , 2006 .

[42]  Dustin Hillard,et al.  Automated classification of congressional legislation , 2006, DG.O.

[43]  The consequences of clout: Agenda control in U.S. legislatures , 2007 .

[44]  Aguilar Zuil,et al.  Compositional Time Series Analysis : A Review , 2007 .

[45]  Paul S. Martin Deliberative Choices: Debating Public Policy in Congress , 2007, Perspectives on Politics.

[46]  Lanny W. Martin,et al.  A Robust Transformation Procedure for Interpreting Political Text , 2007, Political Analysis.

[47]  Joost Berkhout,et al.  The Politics of Attention: How Government Prioritizes Problems , 2008 .