Machine Learning that Matters

Much of current machine learning (ML) research has lost its connection to problems of import to the larger world of science and society. From this perspective, there exist glaring limitations in the data sets we investigate, the metrics we employ for evaluation, and the degree to which results are communicated back to their originating domains. What changes are needed to how we conduct research to increase the impact that ML has? We present six Impact Challenges to explicitly focus the field’s energy and attention, and we discuss existing obstacles that must be addressed. We aim to inspire ongoing discussion and focus on ML that matters.

[1]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[2]  Doina Precup,et al.  A Machine Learning Approach to the Detection of Fetal Hypoxia during Labor and Delivery , 2010, AI Mag..

[3]  Amartya Sen,et al.  Human development Index: Methodology and Measurement , 1994 .

[4]  Yi Chang,et al.  Yahoo! Learning to Rank Challenge Overview , 2010, Yahoo! Learning to Rank Challenge.

[5]  Jelle J Goeman,et al.  Resolving confusion of tongues in statistics and machine learning: A primer for biologists and bioinformaticians , 2012, Proteomics.

[6]  James Bennett,et al.  The Netflix Prize , 2007 .

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Pat Langley,et al.  The changing science of machine learning , 2011, Machine Learning.

[9]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[10]  R. Real,et al.  AUC: a misleading measure of the performance of predictive distribution models , 2008 .

[11]  Jonathan A. Zdziarski,et al.  Ending Spam: Bayesian Content Filtering and the Art of Statistical Language Classification , 2005 .

[12]  C. Gomes Computational Sustainability: Computational methods for a sustainable environment, economy, and society , 2009 .

[13]  Student,et al.  THE PROBABLE ERROR OF A MEAN , 1908 .

[14]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.