Cost-sensitive Boosting for Concept Drift

Concept drift is a phenomenon typically experienced when data distributions change continuously over a period of time. In this paper we propose a cost-sensitive boosting approach for learning under concept drift. The proposed methodology estimates relevance costs of ‘old’ data samples w.r.t. to ‘newer’ samples and integrates it into the boosting process. We experiment this methodology on usenet1 and accelerometer based activity gesture datasets. The results demonstrate that the cost-sensitive boosting approach significantly improves classification performance over existing algorithms.

[1]  Grigorios Tsoumakas,et al.  An Ensemble of Classifiers for coping with Recurring Contexts in Data Streams , 2008, ECAI.

[2]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[3]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[4]  Yang Wang,et al.  Cost-sensitive boosting for classification of imbalanced data , 2007, Pattern Recognit..

[5]  Sethuraman Panchanathan,et al.  Recognition of hand movements using wearable accelerometers , 2009, J. Ambient Intell. Smart Environ..

[6]  ChengXiang Zhai,et al.  Instance Weighting for Domain Adaptation in NLP , 2007, ACL.

[7]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[8]  Eric Eaton,et al.  Set-Based Boosting for Instance-Level Transfer , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.