Tracking Amendments to Legislation and Other Political Texts with a Novel Minimum-Edit-Distance Algorithm: DocuToads

Political scientists often find themselves tracking amendments to political texts. As different actors weigh in, texts change as they are drafted and redrafted, reflecting political preferences and power. This study provides a novel solution to the prob- lem of detecting amendments to political text based upon minimum edit distances. We demonstrate the usefulness of two language-insensitive, transparent, and efficient minimum-edit-distance algorithms suited for the task. These algorithms are capable of providing an account of the types (insertions, deletions, substitutions, and trans- positions) and substantive amount of amendments made between version of texts. To illustrate the usefulness and efficiency of the approach we replicate two existing stud- ies from the field of legislative studies. Our results demonstrate that minimum edit distance methods can produce superior measures of text amendments to hand-coded efforts in a fraction of the time and resource costs.

[1]  John Nerbonne,et al.  Measuring Dialect Distance Phonetically , 1997, SIGMORPHON@EACL.

[2]  A. Kreppel,et al.  Legislative Procedures in the European Union: An Empirical Analysis , 2001, British Journal of Political Science.

[3]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[4]  Brett Kessler,et al.  Computational dialectology in Irish Gaelic , 1995, EACL.

[5]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[6]  J. Cross,et al.  Openness and censorship in the European Union: An interrupted time series analysis , 2015 .

[7]  Lanny W. Martin,et al.  Coalition Policymaking and Legislative Review , 2005, American Political Science Review.

[8]  Lanny W. Martin,et al.  Parliaments and Coalitions: The Role of Legislative Institutions in Multiparty Governance , 2011 .

[9]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[10]  J. Cross The seen and the unseen in legislative politics: explaining censorship in the Council of Ministers of the European Union , 2014 .

[11]  Jonathan B. Slapin,et al.  Position Taking in European Parliament Speeches , 2010 .

[12]  F. Franchino,et al.  Explaining negotiations in the conciliation committee , 2013 .

[13]  J. Cross Striking a pose: Transparency and position taking in the Council of the European Union , 2013 .

[14]  Justin Zobel,et al.  Methods for Identifying Versioned and Plagiarized Documents , 2003, J. Assoc. Inf. Sci. Technol..

[15]  A. Héritier,et al.  Interorganizational Negotiation and Intraorganizational Power in Shared Decision Making , 2004 .

[16]  Jonathan B. Slapin,et al.  Party System Dynamics in Post-war Japan: A quantitative content analysis of electoral pledges , 2011 .

[17]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[18]  M. O. Dayhoff,et al.  22 A Model of Evolutionary Change in Proteins , 1978 .

[19]  Bringing people with us: legislative writing as political rhetoric , 2012 .

[20]  Helen Wallace,et al.  The Council of Ministers of the European Union , 1997 .

[21]  Sven-Oliver Proksch,et al.  Look who’s talking: Parliamentary debate in the European Union , 2010 .

[22]  Robert A. Wagner,et al.  Order-n correction for regular languages , 1974, CACM.

[23]  Martin J. Medhurst,et al.  Presidential Speechwriting: From the New Deal to the Reagan Revolution and Beyond , 2004 .

[24]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[25]  Justin S. Vaughn,et al.  Conceptualizing and Measuring White House Staff Influence on Presidential Rhetoric , 2006 .

[26]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[27]  Sven-Oliver Proksch,et al.  A Scaling Model for Estimating Time-Series Party Positions from Texts , 2007 .

[28]  Lanny W. Martin,et al.  Policing the Bargain: Coalition Government and Parliamentary Scrutiny , 2004 .

[29]  Chak-Kuen Wong,et al.  Bounds for the String Editing Problem , 1976, JACM.

[30]  Brian Dille ThePrepared and Spontaneous Remarks of Presidents Reagan and Bush: A Validity Comparison for At-a-Distance Measurements , 2000 .

[31]  William F. Smyth,et al.  Computing Patterns in Strings , 2003 .