A First Attempt on Online Data Stream Classifier Using Context

The big data is characterized by 4Vs (volume, velocity, variety, and variability). In this paper we focus on the velocity, but actually it usually comes together with volume. It means, that the crucial problem of the contemporary data analytics is to answer the question how to discover useful knowledge from fast incoming data. The paper presents an online data stream classification method, which adapts the classification with context to recognize incoming examples and additionally takes into consideration the memory and processing time limitations. The proposed method was evaluated on the real medical diagnosis task. The preliminary results of the experiments encourage us to continue works on the proposed approach.

[1]  Frank Kirchner,et al.  Performance evaluation of EANT in the robocup keepaway benchmark , 2007, ICMLA 2007.

[2]  Geoff Hulten,et al.  A General Framework for Mining Massive Data Streams , 2003 .

[3]  Robert M. Haralick,et al.  Decision Making in Context , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[5]  Josef Raviv,et al.  Decision making in Markov chains applied to the problem of pattern recognition , 1967, IEEE Trans. Inf. Theory.

[6]  Andrzej Onierek Pattern recognition algorithms for controlled Markov chains and their application to medical diagnosis , 1983 .

[7]  Michal Wozniak,et al.  Active learning approach to concept drift problem , 2012, Log. J. IGPL.

[8]  Michal Wozniak,et al.  Proposition of common classifier construction for pattern recognition with context task , 2006, Knowl. Based Syst..

[9]  Matthew Goldstein,et al.  Kn -nearest Neighbor Classification , 1972, IEEE Trans. Inf. Theory.

[10]  Raj K. Bhatnagar,et al.  Tracking recurrent concept drift in streaming data using ensemble classifiers , 2007, ICMLA 2007.

[11]  Thomas Seidl,et al.  MOA: A Real-Time Analytics Open Source Framework , 2011, ECML/PKDD.

[12]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[13]  Godfried T. Toussaint,et al.  The use of context in pattern recognition , 1978, Pattern Recognit..

[14]  Burr Settles,et al.  Active Learning , 2012, Synthesis Lectures on Artificial Intelligence and Machine Learning.