Less annotation on active learning using confidence-weighted predictions

Abstract This paper proposes an efficient and effective active online sequential learning approach, named as Less Annotated Active Learning Extreme Learning Machine (LAAL-ELM). It leverages the predictions’ confidence of the new arriving data to actively select both query-annotated samples and confidence-weighted predict-annotated ones to update the classifier, which contributes to less actively query annotation, and applies WOS-ELM, a discriminant model, to significantly reduce the computation complexity for doing online updating in one step. The proposed approach firstly gives a principle to evaluate confidence of the prediction in WOS-ELM; then determines what and how to update the model with new arriving data in the online phase: the uncertain instances are annotated by query their classes, almost-certain ones are weighted on its prediction’s confidence and the certain ones are discarded directly for reducing over-fitting; at last, the weighted and query-annotated samples are used to update the classifier. The proposed approach is evaluated on five real-world benchmark classification issues. And the experimental results demonstrate that the proposed LAAL-ELM can effectively reduce the number of queried samples while maintaining high level of classification performance.

[1]  Mary Czerwinski,et al.  Interactions with big data analytics , 2012, INTR.

[2]  Yiqiang Chen,et al.  Weighted extreme learning machine for imbalance learning , 2013, Neurocomputing.

[3]  Yiqiang Chen,et al.  Constraint Online Sequential Extreme Learning Machine for lifelong indoor localization system , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[4]  Zhiping Lin,et al.  Weighted Online Sequential Extreme Learning Machine for Class Imbalance Learning , 2013, Neural Processing Letters.

[5]  Kenji Fukumizu,et al.  Active Learning in Multilayer Perceptrons , 1995, NIPS.

[6]  Yong Zhang,et al.  Sequential active learning using meta-cognitive extreme learning machine , 2016, Neurocomputing.

[7]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[8]  Edwin Lughofer,et al.  Single-pass active learning with conflict and ignorance , 2012, Evolving Systems.

[9]  Jason Weston,et al.  Fast Kernel Classifiers with Online and Active Learning , 2005, J. Mach. Learn. Res..

[10]  Raymond J. Mooney,et al.  Active Learning for Natural Language Parsing and Information Extraction , 1999, ICML.

[11]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[12]  Edwin Lughofer,et al.  Hybrid active learning for reducing the annotation effort of operators in classification systems , 2012, Pattern Recognit..

[13]  Edwin Lughofer,et al.  On-line evolving image classifiers and their application to surface inspection , 2010, Image Vis. Comput..

[14]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[15]  Iman Saleh,et al.  Social-Network-Sourced Big Data Analytics , 2013, IEEE Internet Computing.

[16]  Za'er Salim Abo-Hammour,et al.  Numerical solution of systems of second-order boundary value problems using continuous genetic algorithm , 2014, Inf. Sci..

[17]  Peter Groves,et al.  The 'big data' revolution in healthcare: Accelerating value and innovation , 2016 .

[18]  D. Sculley,et al.  Online Active Learning Methods for Fast Label-Efficient Spam Filtering , 2007, CEAS.

[19]  Eyke Hüllermeier,et al.  FR3: A Fuzzy Rule Learner for Inducing Reliable Classifiers , 2009, IEEE Transactions on Fuzzy Systems.

[20]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[21]  Robert van Liere,et al.  Overview of interactive visualization , 2009 .

[22]  David D. Lewis,et al.  Heterogeneous Uncertainty Sampling for Supervised Learning , 1994, ICML.

[23]  Gert Cauwenberghs,et al.  SVM incremental learning, adaptation and optimization , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[24]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[25]  Gisele Roesems-Kerremans,et al.  Big Data in Healthcare , 2016 .

[26]  Lihong Li,et al.  Unbiased online active learning in data streams , 2011, KDD.

[27]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[28]  Guang-Bin Huang,et al.  An Insight into Extreme Learning Machines: Random Neurons, Random Features and Kernels , 2014, Cognitive Computation.