Do U.S. Regulators Listen to the Public? Testing the Regulatory Process with the RegRank Algorithm

We propose a tool called RegRank that can be used to measure and test whether government regulatory agencies adjust aspects of final rules in response to comments received from the public. The algorithm, which combines customized dictionaries with LDA topic models, is used to analyze the text of public rulemaking documents of the Commodity Futures Trading Commission (CFTC) - a federal regulatory agency in charge of implementing parts of the Dodd-Frank Wall Street Reform and Consumer Protection Act. A key finding based on the available data is that the government adjusts its final rules in the direction of public comments.

[1]  Darrell Laham,et al.  From paragraph to graph: Latent semantic analysis for information visualization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Kurt Hornik,et al.  Text Mining Infrastructure in R , 2008 .

[3]  Michael I. Jordan,et al.  Variational inference for Dirichlet process mixtures , 2006 .

[4]  Marissa Martino Golden,et al.  Interest Groups in the Rule-Making Process: Who Participates? Whose Voices Get Heard? , 1998 .

[5]  William F. West Administrative Rulemaking: An Old and Emerging Literature , 2005 .

[6]  Yee Whye Teh,et al.  A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation , 2006, NIPS.

[7]  Mariano-Florentino Cuéllar,et al.  Rethinking Regulatory Democracy , 2004 .

[8]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[9]  S. Shapiro Presidents and Process: A Comparison of the Regulatory Process Under the Clinton and Bush (43) Administrations , 2006 .

[10]  R. Flesch A new readability yardstick. , 1948, The Journal of applied psychology.

[11]  Max Welling,et al.  Fast collapsed gibbs sampling for latent dirichlet allocation , 2008, KDD.

[12]  Wendy E. Wagner,et al.  Rulemaking in the Shade: An Empirical Study of EPA's Air Toxic Emission Standards , 2011 .

[13]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[14]  Finn Årup Nielsen,et al.  A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs , 2011, #MSM.

[15]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[16]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[17]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[18]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[19]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[20]  Peter E. Kennedy A Guide to Econometrics , 1979 .