Representativeness in Statistics, Politics, and Machine Learning

Representativeness is a foundational yet slippery concept. Though familiar at first blush, it lacks a single precise meaning. Instead, meanings range from typical or characteristic, to a proportionate match between sample and population, to a more general sense of accuracy, generalizability, coverage, or inclusiveness. Moreover, the concept has long been contested. In statistics, debates about the merits and methods of selecting a representative sample date back to the late 19th century; in politics, debates about the value of likeness as a logic of political representation are older still. Today, as the concept crops up in the study of fairness and accountability in machine learning, we need to carefully consider the term's meanings in order to communicate clearly and account for their normative implications. In this paper, we ask what representativeness means, how it is mobilized socially, and what values and ideals it communicates or confronts. We trace the concept's history in statistics and discuss normative tensions concerning its relationship to likeness, exclusion, authority, and aspiration. We draw on these analyses to think through how representativeness is used in FAccT debates, with emphasis on data, shift, participation, and power.

[1]  Robert D. Tortora,et al.  Sampling: Design and Analysis , 2000 .

[2]  L. Goddard First Course , 1969, Nature.

[3]  J. Bolker,et al.  Model systems in developmental biology , 1995, BioEssays : news and reviews in molecular, cellular and developmental biology.

[4]  Harini Suresh,et al.  A Framework for Understanding Unintended Consequences of Machine Learning , 2019, ArXiv.

[5]  H. Lüders,et al.  Comments , 2002, Clinical Neurophysiology.

[6]  A. Chouldechova,et al.  Toward Algorithmic Accountability in Public Services: A Qualitative Study of Affected Community Perspectives on Algorithmic Decision-making in Child Welfare Services , 2019, CHI.

[7]  Thomas L. Griffiths,et al.  Testing a Bayesian Measure of Representativeness Using a Large Image Database , 2011, NIPS.

[8]  Frederick Mosteller,et al.  Representative Sampling, III: The Current Statistical Literature , 1979 .

[9]  J Mark Elwood,et al.  Commentary: On representativeness. , 2013, International journal of epidemiology.

[10]  Stephan Günnemann,et al.  Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift , 2018, NeurIPS.

[11]  Kendra Albert,et al.  Ethical Testing in the Real World: Evaluating Physical Testing of Adversarial Machine Learning , 2020, ArXiv.

[12]  Danah Boyd,et al.  Fairness and Abstraction in Sociotechnical Systems , 2019, FAT.

[13]  Barbara Kiviat The Moral Limits of Predictive Practices: The Case of Credit-Based Insurance Scores , 2019, American Sociological Review.

[14]  Frederick Mosteller,et al.  Representative Sampling, IV: The History of the Concept in Statistics, 1895-1939 , 1980 .

[15]  Zachary C. Lipton,et al.  Troubling Trends in Machine Learning Scholarship , 2018, ACM Queue.

[16]  Timnit Gebru,et al.  Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[17]  J. Overhage,et al.  Sorting Things Out: Classification and Its Consequences , 2001, Annals of Internal Medicine.

[18]  Amandalynne Paullada,et al.  Data and its (dis)contents: A survey of dataset development and use in machine learning research , 2020, Patterns.

[19]  Emily Denton,et al.  Bringing the People Back In: Contesting Benchmark Machine Learning Datasets , 2020, ArXiv.

[20]  S. Harding Rethinking Standpoint Epistemology : What is « Strong Objectivity? » , 1992 .

[21]  Henry E. Garrett The Representativeness of a Sample , 1942 .

[22]  Alexandra Chouldechova,et al.  The Frontiers of Fairness in Machine Learning , 2018, ArXiv.

[23]  Anne Phillips,et al.  The Politics of Presence , 1995 .

[24]  S. Dawson Keywords: a Vocabulary of Culture and Society , 1976 .

[25]  Jørn Olsen,et al.  Commentary: Epidemiologists have debated representativeness for more than 40 years--has the time come to move on? , 2013, International journal of epidemiology.

[26]  Andrew Smart,et al.  Participatory Problem Formulation for Fairer Machine Learning Through Community Based System Dynamics , 2020, ArXiv.

[27]  F. F. Stephan History of the Uses of Modern Sampling Procedures , 1948 .

[28]  T. Porter The Rise of Statistical Thinking, 1820-1900 , 2020 .

[29]  Frederick Mosteller,et al.  Representative Sampling, II: Scientific Literature, Excluding Statistics , 1979 .

[30]  G. U. Y.,et al.  A First Course in Statistics , 1922, Nature.

[31]  H. Pitkin The Concept of Representation , 1969 .

[32]  A. Desrosières,et al.  How to Make Things Which Hold Together: Social Science, Statistics and the State , 1990 .

[33]  Q. Mcnemar,et al.  Sampling in psychological research. , 1940 .

[34]  D. R. Bellhouse,et al.  1 A brief history of random sampling methods , 1988 .

[35]  Nisheeth K. Vishnoi,et al.  Fair and Diverse DPP-based Data Summarization , 2018, ICML.

[36]  Clark Wissler,et al.  Middletown : a study in contemporary American culture , 1929 .

[37]  J. Neyman On the Two Different Aspects of the Representative Method: the Method of Stratified Sampling and the Method of Purposive Selection , 1934 .

[38]  E. Herman :The Averaged American: Surveys, Citizens, and the Making of a Mass Public , 2008 .

[39]  Y. Seng Historical Survey of the Development of Sampling Theories and Practice , 1951 .

[40]  A. Tversky,et al.  Judgments of and by Representativeness , 1981 .

[41]  Frederick Mosteller,et al.  Representative Sampling, I: Non-Scientific Literature , 1979 .

[42]  Scott M. Williams,et al.  The Missing Diversity in Human Genetic Studies , 2019, Cell.

[43]  Adventures in Risk: Predicting Violent and Sexual Recidivism in Sentencing Law , 2014 .

[44]  R. Shweder,et al.  Ethnography and Human Development: Context and Meaning in Social Inquiry. , 1998 .

[45]  Matthew J. Salganik,et al.  Bit by bit: social research in the digital age , 2019, The Journal of mathematical sociology.

[46]  J. Kleinberg,et al.  Roles for computing in social change , 2019, FAT*.

[47]  Lael R. Keiser,et al.  Representative Bureaucracy and Attitudes Toward Automated Decision Making , 2020 .

[48]  John Beatty,et al.  The Empire of Chance: How Probability Changed Science and Everyday Life , 1989 .

[49]  E. Rosch,et al.  Categorization of Natural Objects , 1981 .

[50]  Anna Lauren Hoffmann,et al.  Terms of inclusion: Data, discourse, violence , 2020, New Media Soc..

[51]  Jon M. Kleinberg,et al.  Discrimination in the Age of Algorithms , 2018, SSRN Electronic Journal.

[52]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[53]  M. Couper Is the sky falling? new technology, changing media, and the future of surveys , 2013 .

[54]  A. L. Bowley Address to the Economic Science and Statistics Section of the British Association for the Advancement of Science, York, 1906 , 1906 .

[55]  D. G. Robinson,et al.  Danger Ahead: Risk Assessment and the Future of Bail Reform , 2018 .

[56]  Solon Barocas,et al.  Prediction-Based Decisions and Fairness: A Catalogue of Choices, Assumptions, and Definitions , 2018, 1811.07867.

[57]  D. Steinsaltz,et al.  Erosion of representativeness in a cohort study , 2020 .

[58]  L. Daston,et al.  The Image of Objectivity , 1992 .

[59]  Mary Dunnewold,et al.  How Many Cases Do I Need , 2001 .

[60]  Rico Neumann,et al.  Obfuscation: A user’s guide for privacy and protest , 2017, New Media Soc..

[61]  Frederick F. Stephan Stratification in Representative Sampling* , 1941 .

[62]  S. Merz Race after technology. Abolitionist tools for the new Jim Code , 2020, Ethnic and Racial Studies.

[63]  Zachary Chase Lipton The mythos of model interpretability , 2016, ACM Queue.

[64]  William Kruskal LANGUAGE: Formulas, Numbers, Words: Statistics in Prose , 2016 .

[65]  Bernard Manin,et al.  The principles of representative government , 1995 .

[66]  L. Schweber Disciplining Statistics: Demography and Vital Statistics in France and England, 1830–1885 , 2006 .

[67]  Archon Fung,et al.  Survey Article: Recipes for Public Spheres: Eight Institutional Design Choices and Their Consequences* , 2003 .

[68]  James J. Little,et al.  Does Your Model Know the Digit 6 Is Not a Cat? A Less Biased Evaluation of "Outlier" Detectors , 2018, ArXiv.

[69]  Emily Denton,et al.  Towards a critical race methodology in algorithmic fairness , 2019, FAT*.

[70]  Solon Barocas,et al.  Privacy Dependencies , 2020 .

[71]  Shurong Zheng,et al.  FUTURE OF STATISTICS , 2009 .

[72]  Inioluwa Deborah Raji,et al.  Actionable Auditing: Investigating the Impact of Publicly Naming Biased Performance Results of Commercial AI Products , 2019, AIES.

[73]  Ronald K. Mitchell,et al.  Toward a Theory of Stakeholder Identification and Salience: Defining the Principle of who and What Really Counts , 1997 .

[74]  Laura Forlano,et al.  Participation Is not a Design Fix for Machine Learning , 2020, EAAMO.

[75]  J. O'Neill Representing People, Representing Nature, Representing the World , 2001 .

[76]  H. Link The pulse of democracy: The public opinion poll and how it works. , 1941 .

[77]  Kevin D. Haggerty Methodology as a Knife Fight: The Process, Politics and Paradox of Evaluating Surveillance , 2009 .

[78]  Sean A. Munson,et al.  Unequal Representation and Gender Stereotypes in Image Search Results for Occupations , 2015, CHI.

[79]  Karthik Dinakar,et al.  Studying up: reorienting the study of algorithmic fairness around issues of power , 2020, FAT*.

[80]  T. Porter,et al.  Trust in Numbers , 2020 .

[81]  Lily Hu,et al.  What's sex got to do with machine learning? , 2020, FAT*.

[82]  W. Firestone Alternative Arguments for Generalizing From Data as Applied to Qualitative Research , 1993 .

[83]  Sasha Costanza-Chock,et al.  Design Justice , 2020 .

[84]  Marvin Minsky,et al.  The emotion machine: from pain to suffering , 1999, Creativity & Cognition.

[85]  A. L. Sir Bowley An elementary manual of statistics , 1910 .

[86]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[87]  Dan Bouk,et al.  How Our Days Became Numbered: Risk and the Rise of the Statistical Individual , 2015 .

[88]  Marcel Jirina,et al.  Selecting Representative Data Sets , 2012 .

[89]  P. Wolfe,et al.  The future of statistics and data science , 2018 .

[90]  David J. Hand,et al.  Classifier Technology and the Illusion of Progress , 2006, math/0606441.

[91]  J. A. Becker,et al.  Developments in Crop and Livestock Reporting since 1920 , 1939 .

[92]  Nathan Ensmenger,et al.  Is chess the drosophila of artificial intelligence? A social history of an algorithm , 2012, Social studies of science.

[93]  Gerd Gigerenzer,et al.  The empire of chance , 1989 .

[94]  Feiyang Sun,et al.  Beyond Open vs. Closed: Balancing Individual Privacy and Public Accountability in Data Sharing , 2019, FAT.

[95]  J. Teugels The Politics of Large Numbers: a History of Statistical Reasoning , 2003 .

[96]  Huan Liu,et al.  When is it biased?: assessing the representativeness of twitter's streaming API , 2014, WWW.

[97]  Jure Leskovec,et al.  WILDS: A Benchmark of in-the-Wild Distribution Shifts , 2021, ICML.

[98]  Cyrus Samii,et al.  Does Regression Produce Representative Estimates of Causal Effects? , 2015 .

[99]  E. Didier America by the Numbers , 2020 .

[100]  Natalia Kovalyova,et al.  Data feminism , 2020, Information, Communication & Society.

[101]  Alexandre de Paiva Rio Camargo The sociology of statistics: the possibilities of a new field of investigation. , 2009, Historia, ciencias, saude--Manguinhos.

[102]  Zeynep Tufekci,et al.  Big Questions for Social Media Big Data: Representativeness, Validity and Other Methodological Pitfalls , 2014, ICWSM.

[103]  Eric Gossett,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2015 .

[104]  J. Borges The Book of Sand , 1975 .

[105]  Tanja Aitamurto,et al.  Crowdsourced Deliberation: The Case of the Law on Off-Road Traffic in Finland , 2016 .

[106]  C Bain,et al.  Threats to Applicability of Randomised Trials: Exclusions and Selective Participation , 1999, Journal of health services research & policy.

[107]  J. Gallacher,et al.  Why representativeness should be avoided. , 2013, International journal of epidemiology.

[108]  Ariel D. Procaccia,et al.  WeBuildAI , 2019, Proc. ACM Hum. Comput. Interact..

[109]  Neil D. Lawrence,et al.  When Training and Test Sets Are Different: Characterizing Learning Transfer , 2009 .

[110]  Shah Ebrahim,et al.  Commentary: Should we always deliberately be non-representative? , 2013, International journal of epidemiology.

[111]  D. Sculley,et al.  No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Developing World , 2017, 1711.08536.

[112]  Neil Pearce,et al.  Commentary: Representativeness is usually not necessary and often should be avoided. , 2013, International journal of epidemiology.

[113]  Anthony K. H. Tung,et al.  Finding representative set from massive data , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[114]  Jonas Lerman,et al.  Big Data and Its Exclusions , 2013 .

[115]  J. Fisher,et al.  Challenging assumptions about minority participation in US clinical research. , 2011, American journal of public health.

[116]  J. Tenenbaum,et al.  The Rational Basis of Representativeness , 2001 .

[117]  L. Nader Up the Anthropologist: Perspectives Gained From Studying Up. , 1972 .