The production of prediction: What does machine learning want?

Retail, media, finance, science, industry, security and government increasingly depend on predictions produced through techniques such as machine learning. How is it that machine learning can promise to predict with great specificity what differences matter or what people want in many different settings? We need, I suggest, an account of its generalization if we are to understand the contemporary production of prediction. This article maps the principal forms of material action, narrative and problematization that run across algorithmic modelling techniques such as logistic regression, decision trees and Naive Bayes classifiers. It highlights several interlinked modes of generalization that engender increasingly vast data infrastructures and platforms, and intensified mathematical and statistical treatments of differences. Such an account also points to some key sites of instability or problematization inherent to the process of generalization. If movement through data is becoming a principal intersection of power relations, economic value and valid knowledge, an account of the production of prediction might also help us begin to ask how its generalization potentially gives rise to new forms of agency, experience or individuations.

[1]  Ashish Agarwal,et al.  Overlapping experiment infrastructure: more, better, faster experimentation , 2010, KDD.

[2]  Charles Anderson,et al.  The end of theory: The data deluge makes the scientific method obsolete , 2008 .

[3]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[4]  Eric Gossett,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2015 .

[5]  Viktor Mayer-Schnberger,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2013 .

[6]  Adrian Mackenzie,et al.  Programming subjects in the regime of anticipation: Software studies and subjectivity , 2013 .

[7]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[8]  M. Foucault,et al.  The Order of Things , 2017 .

[9]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[10]  T. Davenport,et al.  Data scientist: the sexiest job of the 21st century. , 2012, Harvard business review.

[11]  Chris Arney Social Physics: How Good Ideas Spread - the Lessons from a New Science , 2014 .

[12]  R. Fisher THE STATISTICAL UTILIZATION OF MULTIPLE MEASUREMENTS , 1938 .

[13]  Matthew A. Russell,et al.  Mining the social web , 2011 .

[14]  Richard A. Olshen,et al.  CART: Classification and Regression Trees , 1984 .

[15]  Javier Solana,et al.  Big Data: A Revolution that Will Transform How We Work, Live and Think , 2014 .

[16]  Drew Conway,et al.  Machine Learning for Hackers , 2012 .

[17]  Christopher Kelty,et al.  Ten Thousand Journal Articles Later: Ethnography of «The Literature» in Science , 2009 .

[18]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[19]  S. Lash Power after Hegemony , 2007 .

[20]  P. Rabinow Anthropos Today: Reflections on Modern Equipment , 2003 .

[21]  D. Steinberg CART: Classification and Regression Trees , 2009 .

[22]  Thomas J. Steenburgh,et al.  Motivating Salespeople: What Really Works , 2012, Harvard business review.

[23]  Peter A. Flach,et al.  Machine Learning - The Art and Science of Algorithms that Make Sense of Data , 2012 .

[24]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[25]  Rachel Schutt,et al.  Doing Data Science , 2013 .