Testing Concept Drift Detection Technique on Data Stream

Data mutates dynamically, and these transmutations are so diverse that it affects the quality and reliability of the model. Concept Drift is the quandary of such dynamic cognitions and modifications in the data stream which leads to change in the behaviour of the model. The problem of concept drift affects the prognostication quality of the software and thus reduces its precision. In most of the drift detection methods, it is followed that there are given labels for the incipient data sample which however is not practically possible. In this paper, the performance and accuracy of the proposed concept drift detection technique for the classification of streaming data with undefined labels will be tested. Testing is followed with the creation of the centroid classification model by utilizing some training examples with defined labels and test its precision with the test set and then compare the accuracy of the prediction model with and without the proposed concept drift detection technique.

[1]  Heng Wang,et al.  Concept drift detection for streaming data , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[2]  Latifur Khan Data Stream Mining: Challenges and Techniques , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[3]  Koichiro Yamauchi,et al.  Detecting Concept Drift Using Statistical Testing , 2007, Discovery Science.

[4]  Shikha Mehta,et al.  Concept drift in Streaming Data Classification: Algorithms, Platforms and Issues , 2017, ITQM.

[5]  Eyke Hüllermeier,et al.  Open challenges for data stream mining research , 2014, SKDD.

[6]  Cheong Hee Park,et al.  Concept Drift Detection on Streaming Data under Limited Labeling , 2016, 2016 IEEE International Conference on Computer and Information Technology (CIT).