Detecting Concept Drift with Support Vector Machines

For many learning tasks where data is collected over an extended period of time, its underlying distribution is likely to change. A typical example is information ltering, i.e. the adaptive classiication of documents with respect to a particular user interest. Both the interest of the user and the document content change over time. A ltering system should be able to adapt to such concept changes. This paper proposes a new method to recognize and handle concept changes with support vector machines. The method maintains a window on the training data. The key idea is to automatically adjust the window size so that the estimated generalization error is minimized. The new approach is both theoretically well-founded as well as eeective and eecient in practice. Since it does not require complicated parameterization, it is simpler to use and more robust than comparable heuristics. Experiments with simulated concept drift scenarios based on real-world text data compare the new method with other window management approaches. We show that it can eeectively select an appropriate window size in a robust way.

[1]  Roger Fletcher,et al.  Practical methods of optimization; (2nd ed.) , 1987 .

[2]  R. Fletcher Practical Methods of Optimization , 1988 .

[3]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[4]  Ronald L. Rivest,et al.  Learning Time-Varying Concepts , 1990, NIPS.

[5]  IJsbrand Jan Aalbersberg,et al.  Incremental relevance feedback , 1992, SIGIR '92.

[6]  Tom M. Mitchell,et al.  Experience with a learning personal assistant , 1994, CACM.

[7]  James Allan,et al.  Incremental relevance feedback for information filtering , 1996, SIGIR '96.

[8]  Marko Balabanovic,et al.  An adaptive Web page recommendation service , 1997, AGENTS '97.

[9]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[10]  Ingrid Renz,et al.  Adaptive Information Filtering: Learning in the Presence of Concept Drifts , 1998 .

[11]  J. C. BurgesChristopher A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[12]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[13]  B. Schölkopf,et al.  Advances in kernel methods: support vector learning , 1999 .

[14]  Huan Liu,et al.  Handling concept drifts in incremental learning with support vector machines , 1999, KDD '99.

[15]  Ralf Klinkenberg Maschinelle Lernverfahren zum adaptiven Informationsfiltern bei sich verändernden Konzepten , 2000 .

[16]  Thorsten Joachims,et al.  Estimating the Generalization Performance of an SVM Efficiently , 2000, ICML.