Improving large-scale hierarchical classification by rewiring: a data-driven filter based approach

Hierarchical Classification (HC) is a supervised learning problem where unlabeled instances are classified into a taxonomy of classes. Several methods that utilize the hierarchical structure have been developed to improve the HC performance. However, in most cases apriori defined hierarchical structure by domain experts is inconsistent; as a consequence performance improvement is not noticeable in comparison to flat classification methods. We propose a scalable data-driven filter based rewiring approach to modify an expert-defined hierarchy. Experimental comparisons of top-down hierarchical classification with our modified hierarchy, on a wide range of datasets shows classification performance improvement over the baseline hierarchy (i.e., defined by expert), clustered hierarchy and flattening based hierarchy modification approaches. In comparison to existing rewiring approaches, our developed method (rewHier) is computationally efficient, enabling it to scale to datasets with large numbers of classes, instances and features. We also show that our modified hierarchy leads to improved classification performance for classes with few training samples in comparison to flat and state-of-the-art hierarchical classification approaches. Source Code: https://cs.gmu.edu/~mlbio/TaxMod/

[1]  Shui-Lung Chuang,et al.  A practical web-based approach to generating topic hierarchy for text segments , 2004, CIKM '04.

[2]  Philip S. Yu,et al.  On the merits of building categorization systems by supervised clustering , 1999, KDD '99.

[3]  Daphne Koller,et al.  Hierarchically Classifying Documents Using Very Few Words , 1997, ICML.

[4]  Saso Dzeroski,et al.  Decision trees for hierarchical multi-label classification , 2008, Machine Learning.

[5]  Vipin Kumar,et al.  The Challenges of Clustering High Dimensional Data , 2004 .

[6]  Heng Tao Shen,et al.  Hashing for Similarity Search: A Survey , 2014, ArXiv.

[7]  Xiaolin Wang,et al.  Flatten hierarchies for large-scale hierarchical text categorization , 2010, 2010 Fifth International Conference on Digital Information Management (ICDIM).

[8]  Arthur Zimek,et al.  A Study of Hierarchical and Flat Classification of Proteins , 2010, IEEE/ACM Transactions on Computational Biology & Bioinformatics.

[9]  Tom M. Mitchell,et al.  Improving Text Classification by Shrinkage in a Hierarchy of Classes , 1998, ICML.

[10]  Thomas Hofmann,et al.  Hierarchical document categorization with support vector machines , 2004, CIKM '04.

[11]  Huzefa Rangwala,et al.  HierCost: Improving Large Scale Hierarchical Classification with Cost Sensitive Learning , 2015, ECML/PKDD.

[12]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[13]  Azad Naik,et al.  Integrated Framework for Improving Large-Scale Hierarchical Classification , 2017, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA).

[14]  Yiming Yang,et al.  Recursive regularization for large-scale classification with hierarchical and graphical dependencies , 2013, KDD.

[15]  Spyros Sioutas,et al.  CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and MapReduce , 2014, AIAI Workshops.

[16]  Saso Dzeroski,et al.  Hierarchical annotation of medical images , 2011, Pattern Recognit..

[17]  Kiyoshi Nitta,et al.  Improving taxonomies for large-scale hierarchical classifiers of web documents , 2010, CIKM.

[18]  Azad Naik,et al.  HierFlat: flattened hierarchies for improving top-down hierarchical classification , 2017, International Journal of Data Science and Analytics.

[19]  Ioannis Partalas,et al.  Maximum-Margin Framework for Training Data Synchronization in Large-Scale Hierarchical Classification , 2013, ICONIP.

[20]  Saso Dzeroski,et al.  Hierarchical classification of diatom images using ensembles of predictive clustering trees , 2012, Ecol. Informatics.

[21]  Georgios Paliouras,et al.  Evaluation measures for hierarchical classification: a unified view and novel approaches , 2013, Data Mining and Knowledge Discovery.

[22]  Huan Liu,et al.  Acclimatizing Taxonomic Semantics for Hierarchical Content Classification , 2006, KDD '06.

[23]  Joydeep Ghosh,et al.  Automatically learning document taxonomies for hierarchical classification , 2005, WWW '05.

[24]  Daphne Koller,et al.  Discriminative learning of relaxed hierarchy for large-scale visual recognition , 2011, 2011 International Conference on Computer Vision.

[25]  Brian D. Davison,et al.  Hierarchy evolution for improved classification , 2011, CIKM '11.

[26]  Alex A. Freitas,et al.  A survey of hierarchical classification across different application domains , 2010, Data Mining and Knowledge Discovery.

[27]  Ioannis Partalas,et al.  On Flat versus Hierarchical Classification in Large-Scale Taxonomies , 2013, NIPS.

[28]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[29]  Lin Xiao,et al.  Hierarchical Classification via Orthogonal Transfer , 2011, ICML.

[30]  Hassan H. Malik Improving Hierarchical SVMs by Hierarchy Flattening and Lazy Classification , 2010 .

[31]  Ee-Peng Lim,et al.  Hierarchical text classification and evaluation , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[32]  Azad Naik,et al.  Filter based Taxonomy Modification for Improving Hierarchical Classification , 2016, ArXiv.

[33]  Azad Naik,et al.  Inconsistent Node Flattening for Improving Top-Down Hierarchical Classification , 2016, 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[34]  Tao Qin,et al.  Site abstraction for rare category classification in large-scale web directory , 2005, WWW '05.