Stream-based joint exploration-exploitation active learning

Learning from streams of evolving and unbounded data is an important problem, for example in visual surveillance or internet scale data. For such large and evolving real-world data, exhaustive supervision is impractical, particularly so when the full space of classes is not known in advance therefore joint class discovery (exploration) and boundary learning (exploitation) becomes critical. Active learning has shown promise in jointly optimising exploration-exploitation with minimal human supervision. However, existing active learning methods either rely on heuristic multi-criteria weighting or are limited to batch processing. In this paper, we present a new unified framework for joint exploration-exploitation active learning in streams without any heuristic weighting. Extensive evaluation on classification of various image and surveillance video datasets demonstrates the superiority of our framework over existing methods.

[1]  Shlomo Argamon,et al.  Committee-Based Sample Selection for Probabilistic Classifiers , 1999, J. Artif. Intell. Res..

[2]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[3]  Shaogang Gong,et al.  Finding Rare Classes: Adapting Generative and Discriminative Models in Active Learning , 2011, PAKDD.

[4]  Sanjay Kumar Madria,et al.  Sensor networks: an overview , 2003 .

[5]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[6]  Adriana Kovashka,et al.  Actively selecting annotations among objects and attributes , 2011, 2011 International Conference on Computer Vision.

[7]  Tao Xiang,et al.  Active Learning using Dirichlet Processes for Rare Class Discovery and Classification , 2011, BMVC.

[8]  Simon J. D. Prince,et al.  Computer Vision: Models, Learning, and Inference , 2012 .

[9]  Frank D. Wood,et al.  The sequence memoizer , 2011, Commun. ACM.

[10]  J. Pitman,et al.  The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator , 1997 .

[11]  Pietro Perona,et al.  Incremental learning of nonparametric Bayesian mixture models , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Yee Whye Teh,et al.  A Hierarchical Bayesian Language Model Based On Pitman-Yor Processes , 2006, ACL.

[13]  R. Schiffer Psychobiology of Language , 1986 .

[14]  Shaogang Gong,et al.  Stream-Based Active Unusual Event Detection , 2010, ACCV.

[15]  John Platt,et al.  ALADIN: Active Learning of Anomalies to Detect Intrusion , 2008 .

[16]  Yee Whye Teh,et al.  A Bayesian Interpretation of Interpolated Kneser-Ney , 2006 .

[17]  Trevor Darrell,et al.  Gaussian Processes for Object Categorization , 2010, International Journal of Computer Vision.

[18]  Harry Wechsler,et al.  Query by Transduction , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[20]  Andrew W. Moore,et al.  Active Learning for Anomaly and Rare-Category Detection , 2004, NIPS.

[21]  Pietro Perona,et al.  Online crowdsourcing: Rating annotators and obtaining cost-effective labels , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[22]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[23]  Jingrui He,et al.  Nearest-Neighbor-Based Active Learning for Rare Category Detection , 2007, NIPS.

[24]  Max Welling,et al.  Accelerated Variational Dirichlet Process Mixtures , 2006, NIPS.

[25]  Emin Orhan Dirichlet Processes , 2012 .

[26]  George Kingsley Zipf,et al.  The Psychobiology of Language , 2022 .

[27]  Kristen Grauman,et al.  Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds , 2011, CVPR 2011.

[28]  Joshua B. Tenenbaum,et al.  Learning to share visual appearance for multiclass object detection , 2011, CVPR 2011.

[29]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[30]  Tao Xiang,et al.  Finding Rare Classes: Active Learning with Generative and Discriminative Models , 2013, IEEE Transactions on Knowledge and Data Engineering.

[31]  John Langford,et al.  Importance weighted active learning , 2008, ICML '09.

[32]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[33]  Simon J. D. Prince,et al.  Computer Vision: Index , 2012 .

[34]  Trevor Darrell,et al.  Supervised hierarchical Pitman-Yor process for natural scene segmentation , 2011, CVPR 2011.