Towards Hierarchical Classification of Data Streams

In data stream mining, state-of-the-art machine learning algorithms for the classification task associate each event with a class belonging to a finite, devoid of structural dependencies and usually small, set of classes. However, there are more complex dynamic problems where the classes we want to predict make up a hierarchal structure. In this paper, we propose an incremental method for hierarchical classification of data streams. We experimentally show that our stream hierarchical classifier present advantages to the traditional online setting in three real-world problems related to entomology, ichthyology, and audio processing.

[1]  Beth Logan,et al.  Mel Frequency Cepstral Coefficients for Music Modeling , 2000, ISMIR.

[2]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[3]  Niall M. Adams,et al.  The impact of changing populations on classifier performance , 1999, KDD '99.

[4]  João Gama,et al.  Data Stream Classification Guided by Clustering on Nonstationary Environments and Extreme Verification Latency , 2015, SDM.

[5]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[6]  Hervé Glotin,et al.  LifeCLEF 2016: Multimedia Life Species Identification Challenges , 2016, CLEF.

[7]  Fabrizio Sebastiani,et al.  On the Selection of Negative Examples for Hierarchical Text Categorization , 2007 .

[8]  Stan Matwin,et al.  Functional Annotation of Genes Using Hierarchical Text Categorization , 2005 .

[9]  Wee Keong Ng,et al.  A survey on data stream clustering and classification , 2015, Knowledge and Information Systems.

[10]  Rich Caruana,et al.  An empirical evaluation of supervised learning in high dimensions , 2008, ICML '08.

[11]  Newton Spolaôr,et al.  Dermoscopic assisted diagnosis in melanoma: Reviewing results, optimizing methodologies and quantifying empirical guidelines , 2018, Knowl. Based Syst..

[12]  Roberto Souto Maior de Barros,et al.  A comparative study on concept drift detectors , 2014, Expert Syst. Appl..

[13]  Alex A. Freitas,et al.  A survey of hierarchical classification across different application domains , 2010, Data Mining and Knowledge Discovery.

[14]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[15]  Duane Szafron,et al.  Improving Protein Function Prediction using the Hierarchical Structure of the Gene Ontology , 2005, 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[16]  Vinícius M. A. de Souza,et al.  Classification of Data Streams Applied to Insect Recognition: Initial Results , 2013, 2013 Brazilian Conference on Intelligent Systems.