A Simple Unlearning Framework for Online Learning Under Concept Drifts

Real-world online learning applications often face data coming from changing target functions or distributions. Such changes, called the concept drift, degrade the performance of traditional online learning algorithms. Thus, many existing works focus on detecting concept drift based on statistical evidence. Other works use sliding window or similar mechanisms to select the data that closely reflect current concept. Nevertheless, few works study how the detection and selection techniques can be combined to improve the learning performance. We propose a novel framework on top of existing online learning algorithms to improve the learning performance under concept drifts. The framework detects the possible concept drift by checking whether forgetting some older data may be helpful, and then conduct forgetting through a step called unlearning. The framework effectively results in a dynamic sliding window that selects some data flexibly for different kinds of concept drifts. We design concrete approaches from the framework based on three popular online learning algorithms. Empirical results show that the framework consistently improves those algorithms on ten synthetic data sets and two real-world data sets.

[1]  Gerhard Widmer,et al.  Learning in the presence of concept drift and hidden contexts , 2004, Machine Learning.

[2]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[3]  A. Bifet,et al.  Early Drift Detection Method , 2005 .

[4]  M. Harries SPLICE-2 Comparative Evaluation: Electricity Pricing , 1999 .

[5]  Shai Shalev-Shwartz,et al.  Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[6]  Koby Crammer,et al.  Confidence-weighted linear classification , 2008, ICML '08.

[7]  Thorsten Joachims,et al.  Detecting Concept Drift with Support Vector Machines , 2000, ICML.

[8]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[9]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[10]  Hamid Beigy,et al.  New Drift Detection Method for Data Streams , 2011, ICAIS.

[11]  Koby Crammer,et al.  Adaptive regularization of weight vectors , 2009, Machine Learning.

[12]  Alexey Tsymbal,et al.  The problem of concept drift: definitions and related work , 2004 .

[13]  Marcus A. Maloof,et al.  Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts , 2007, J. Mach. Learn. Res..

[14]  Koichiro Yamauchi,et al.  Detecting Concept Drift Using Statistical Testing , 2007, Discovery Science.

[15]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[16]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..