Putting the Scientist in the Loop -- Accelerating Scientific Progress with Interactive Machine Learning

Technology drives advances in science. Giving scientists access to more powerful tools for collecting and understanding data enables them to both ask and answer new kinds questions that were previously beyond their reach. Of these new tools at their disposal, machine learning offers the opportunity to understand and analyze data at unprecedented scales and levels of detail. The standard machine learning pipeline consists of data labeling, feature extraction, training, and evaluation. However, without expert machine learning knowledge, it is difficult for scientists to optimally construct this pipeline to fully leverage machine learning in their work. Using ecology as a motivating example, we analyze a typical scientist's data collection and processing workflow and highlight many problems facing practitioners when attempting to capitalize on advances in machine learning and pattern recognition. Understanding these shortcomings allows us to outline several novel and underexplored research directions. We end with recommendations to motivate progress in future cross-disciplinary work.

[1]  Luc Van Gool,et al.  Interactive object detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Tim Weyrich,et al.  Capturing Time-of-Flight data with confidence , 2011, CVPR 2011.

[3]  Radford M. Neal Probabilistic Inference Using Markov Chain Monte Carlo Methods , 2011 .

[4]  Roberto Cipolla,et al.  Semantic object classes in video: A high-definition ground truth database , 2009, Pattern Recognit. Lett..

[5]  Marc Pollefeys,et al.  Learning a Confidence Measure for Optical Flow , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..

[7]  Tao Xiang,et al.  Active Rare Class Discovery and Classification Using Dirichlet Processes , 2014, International Journal of Computer Vision.

[8]  James Davey,et al.  Guiding feature subset selection with an interactive visualization , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[9]  Charless C. Fowlkes,et al.  Do We Need More Training Data or Better Models for Object Detection? , 2012, BMVC.

[10]  Kristen Grauman,et al.  Active Learning of an Action Detector from Untrimmed Videos , 2013, 2013 IEEE International Conference on Computer Vision.

[11]  Desney S. Tan,et al.  EnsembleMatrix: interactive visualization to support machine learning with multiple classifiers , 2009, CHI.

[12]  Friedrich Recknagel,et al.  Applications of machine learning to ecological modelling , 2001 .

[13]  Srinivas C. Turaga,et al.  Space-time wiring specificity supports direction selectivity in the retina , 2014, Nature.

[14]  Kate E. Jones,et al.  Challenges of Using Bioacoustics to Globally Monitor Bats , 2013 .

[15]  Kate E. Jones,et al.  Indicator bats program: A system for the global acoustic monitoring of bats , 2013 .

[16]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[17]  Weng-Keen Wong,et al.  Category detection using hierarchical mean shift , 2009, KDD.

[18]  Christian Dietz,et al.  A continental-scale tool for acoustic identification of European bats , 2012 .

[19]  Kristen Grauman,et al.  Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds , 2011, CVPR 2011.

[20]  Gerardo Hermosillo,et al.  Supervised learning from multiple experts: whom to trust when everyone lies a bit , 2009, ICML '09.

[21]  P. Leadley,et al.  Impacts of climate change on the future of biodiversity. , 2012, Ecology letters.

[22]  Jan Kautz,et al.  Hierarchical Subquery Evaluation for Active Learning on a Graph , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Hagai Attias,et al.  A Variational Bayesian Framework for Graphical Models , 1999 .

[24]  Dong Yu,et al.  Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Cordelia Schmid,et al.  Action and Event Recognition with Fisher Vectors on a Compact Feature Set , 2013, 2013 IEEE International Conference on Computer Vision.

[26]  J. Lamarque,et al.  Global Biodiversity: Indicators of Recent Declines , 2010, Science.

[27]  G. Mace,et al.  Biodiversity and ecosystem services: a multilayered relationship. , 2012, Trends in ecology & evolution.

[28]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[29]  David Cohn,et al.  Active Learning , 2010, Encyclopedia of Machine Learning.

[30]  Ronald Marsh,et al.  Wildlife@Home: Combining Crowd Sourcing and Volunteer Computing to Analyze Avian Nesting Video , 2013, 2013 IEEE 9th International Conference on e-Science.

[31]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[32]  Thomas Mensink,et al.  Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[33]  Deva Ramanan,et al.  Efficiently Scaling up Crowdsourced Video Annotation , 2012, International Journal of Computer Vision.

[34]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[35]  Gabriel J. Brostow,et al.  Revisiting Example Dependent Cost-Sensitive Learning with Decision Trees , 2013, 2013 IEEE International Conference on Computer Vision.

[36]  Todd Kulesza,et al.  Structured labeling for facilitating concept evolution in machine learning , 2014, CHI.

[37]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[38]  Christopher K. I. Williams,et al.  Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[39]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[40]  T. Scott Brandes,et al.  Automated sound recording and analysis techniques for bird surveys and conservation , 2008, Bird Conservation International.

[41]  Joachim M. Buhmann,et al.  Active learning for semantic segmentation with expected change , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Deva Ramanan,et al.  Video Annotation and Tracking with Active Learning , 2011, NIPS.

[43]  Kiri Wagstaff,et al.  Machine Learning that Matters , 2012, ICML.

[44]  Alexei A. Efros,et al.  Unbiased look at dataset bias , 2011, CVPR 2011.

[45]  Mark A. Girolami,et al.  Bat Call Identification with Gaussian Process Multinomial Probit Regression and a Dynamic Time Warping Kernel , 2014, AISTATS.

[46]  Alan H. Fielding,et al.  Machine Learning Methods for Ecological Applications , 2012, Springer US.

[47]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[48]  Kristin Branson,et al.  JAABA: interactive machine learning for automatic annotation of animal behavior , 2013, Nature Methods.

[49]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[50]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[51]  C. Carbone,et al.  Surveys using camera traps: are we looking to a brighter future? , 2008 .

[52]  Pietro Perona,et al.  Social behavior recognition in continuous video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[54]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[55]  Jerry Alan Fails,et al.  Interactive machine learning , 2003, IUI '03.

[56]  Pietro Perona,et al.  The Multidimensional Wisdom of Crowds , 2010, NIPS.

[57]  Yuzhen Niu,et al.  Video summagator: an interface for video summarization and navigation , 2012, CHI.

[58]  Panagiotis G. Ipeirotis,et al.  Quality management on Amazon Mechanical Turk , 2010, HCOMP '10.

[59]  Frédéric Jurie,et al.  Motion Models that Only Work Sometimes , 2012, BMVC.

[60]  M. Girolami,et al.  Riemann manifold Langevin and Hamiltonian Monte Carlo methods , 2011, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[61]  Tom Minka,et al.  Expectation Propagation for approximate Bayesian inference , 2001, UAI.

[62]  G. Mace,et al.  Bringing Ecosystem Services into Economic Decision-Making: Land Use in the United Kingdom , 2013, Science.

[63]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[64]  Antonio Torralba,et al.  Are all training examples equally valuable? , 2013, ArXiv.

[65]  Kristen Grauman,et al.  Cost-Sensitive Active Visual Category Learning , 2010, International Journal of Computer Vision.