相关论文

Improving Ecological Inference by Predicting Individual Ethnicity from Voter Registration Records

Abstract:In both political behavior research and voting rights litigation, turnout and vote choice for different racial groups are often inferred using aggregate election results and racial composition. Over the past several decades, many statistical methods have been proposed to address this ecological inference problem. We propose an alternative method to reduce aggregation bias by predicting individual-level ethnicity from voter registration records. Building on the existing methodological literature, we use Bayes's rule to combine the Census Bureau's Surname List with various information from geocoded voter registration records. We evaluate the performance of the proposed methodology using approximately nine million voter registration records from Florida, where self-reported ethnicity is available. We find that it is possible to reduce the false positive rate among Black and Latino voters to 6% and 3%, respectively, while maintaining the true positive rate above 80%. Moreover, we use our predictions to estimate turnout by race and find that our estimates yields substantially less amounts of bias and root mean squared error than standard ecological inference estimates. We provide open-source software to implement the proposed methodology.

参考文献

[1]  J. Forster Ecological inference for 2 × 2 tables - Discussion , 2004 .

[2]  Leland Gerson Neuberg,et al.  A solution to the ecological inference problem: Reconstructing individual behavior from aggregate data , 1999 .

[3]  D. McCaffrey,et al.  Erratum to: Using the Census Bureau’s surname list to improve estimates of race/ethnicity and associated disparities , 2009, Health Services and Outcomes Research Methodology.

[4]  J. Hilbe Logistic Regression Models , 2009 .

[5]  Ying Lu,et al.  Bayesian and Likelihood Inference for 2 × 2 Ecological Tables: An Incomplete-Data Approach , 2007, Political Analysis.

[6]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[7]  M. Elliott,et al.  A new method for estimating race/ethnicity and associated disparities where administrative records lack self-reported race/ethnicity. , 2008, Health services research.

[8]  E. Fieldhouse,et al.  Diversity, density and turnout: The effect of neighbourhood ethno-religious composition on voter turnout in Britain , 2008 .

[9]  Jon Wakefield,et al.  Ecological inference for 2 × 2 tables , 2004 .

[10]  M. Tanner,et al.  Ecological Inference: New Methodological Strategies , 2004 .

[11]  J. Wakefield Ecological inference for 2 × 2 tables (with discussion) , 2004 .

[12]  The Mobilizing Effect of Majority-Minority Districts , 2004 .

[13]  Melissa R. Michelson Getting Out the Latino Vote: How Door-to-Door Canvassing Influences Voter Turnout in Rural Central California , 2003 .

[14]  Kevin M. Quinn,et al.  Exit Polling and Racial Bloc Voting: Combining Individual-Level and R X C Ecological Data , 2010, 1101.0985.

[15]  L. A. Goodman Ecological Regressions and Behavior of Individuals , 1953 .

[16]  CLAUDINE GAY,et al.  The Effect of Black Congressional Representation on Political Participation , 2001, American Political Science Review.

[17]  John A. Henderson,et al.  Cause or Effect? Turnout in Hispanic Majority-Minority Districts , 2016, Political Analysis.

[18]  M. Barreto İSí Se Puede! Latino Candidates and the Mobilization of Latino Voters , 2007, American Political Science Review.

[19]  J. A. Harris,et al.  What's in a Name? A Method for Extracting Information about Ethnicity from Names , 2015 .

[20]  Kevin M. Quinn,et al.  R×C ecological inference: bounds, correlations, flexibility and transparency of assumptions , 2009 .

[21]  K. Fiscella,et al.  Use of geocoding and surname analysis to estimate race and ethnicity. , 2006, Health services research.

[22]  D. Greiner Ecological Inference in Voting Rights Act Disputes: Where are We Now, and Where Do We Want to Be? , 2007 .

[23]  D. McCaffrey,et al.  Using the Census Bureau’s surname list to improve estimates of race/ethnicity and associated disparities , 2009, Health Services and Outcomes Research Methodology.

[24]  Bernard L. Fraga Candidates or Districts? Reevaluating the Role of Race in Voter Turnout , 2016 .

[25]  Nathan D. Woods,et al.  The Mobilizing Effect of Majority–Minority Districts on Latino Turnout , 2004, American Political Science Review.

[26]  HajnalZoltan,et al.  Where Turnout Matters: The Consequences of Uneven Turnout in City Politics , 2014 .

[27]  Gary King,et al.  EI: A Program for Ecological Inference , 2004 .

[28]  Wendy K. Tam Cho,et al.  Residential Concentration, Political Socialization, and Voter Turnout , 2006, The Journal of Politics.

[29]  Stephen Ansolabehere,et al.  Gender, Race, Age and Voting: A Research Note , 2013 .

引用
Predicting Race and Ethnicity From the Sequence of Characters in a Name
1805.02109
2018
Assessing algorithmic fairness with unobserved protected class using data combination
FAT*
2019
Did the Black Panther Movie Make Blacks Blacker? Examining Black Racial Identity on Twitter Before and After the Black Panther Movie Release
SocInfo
2019
Risk of being killed by police use of force in the United States by age, race–ethnicity, and sex
Proceedings of the National Academy of Sciences
2019
Sensitive Survey Questions with Auxiliary Information
2020
Estimating Candidate Support in Voting Rights Act Cases: Comparing Iterative EI and EI-R×C Methods:
2019
Examining scientific writing styles from the perspective of linguistic complexity
J. Assoc. Inf. Sci. Technol.
2018
Tsallis Regularized Optimal Transport and Ecological Inference
AAAI
2016
Bayesian Inference for Optimal Transport with Stochastic Cost
ArXiv
2020
Fake news on Twitter during the 2016 U.S. presidential election
Science
2019
Analysis of ISCB honorees and keynotes reveals disparities
bioRxiv
2020
Fast and Flexible Inference of Joint Distributions from their Marginals
ICML
2019
Legislative Effectiveness in the American States
American Political Science Review
2024
The Effectiveness of a Neighbor-to-Neighbor Get-Out-the-Vote Program: Evidence from the 2017 Virginia State Elections
Journal of Experimental Political Science
2021
How Does Minority Political Representation Affect School District Administration and Student Outcomes?
2021
All-mail voting in Colorado increases turnout and reduces turnout inequality
Electoral studies
2021
The Democratic Deficit in U.S. Education Governance
American Political Science Review
2021
Machine Learning Predictions as Regression Covariates
Political Analysis
2020
Toward a More Just Feminism
2020
Racial and Gender Disparities among Evicted Americans
2020