Three-way decisions based blocking reduction models in hierarchical classification

Abstract Hierarchical classification (HC) is effective when categories are organized hierarchically. However, the blocking problem makes the effect of hierarchical classification greatly reduced. Blocking means that samples are easily getting misclassified in high-level classifiers so that the samples are blocked at the high-level of the hierarchy. This issue is caused by the inconsistency between the artificially defined hierarchy and the actual hierarchy of the raw data. Another issue is that it is flippant to strictly process data following the hierarchy. Therefore, special treatment is required for some uncertain data. To address the first issue, we learn category relationships and modify the hierarchy. To address the second issue, we introduce three-way decisions (3WD) to targetedly deal with the ambiguous data. We extend original studies and propose two HC models based on 3WD, collectively referred to as TriHC, for carefully modifying the hierarchy to alleviate the blocking problem. The proposed TriHC model learns new category hierarchies by the following three steps: (1) mining category relations; (2) modifying category hierarchies according to the latent category relations; and (3) using 3WD to divide observed objects into three regions: positive region, boundary region, and negative region, and making decisions based on different strategies. Specifically, based on different category relation mining methods, there are two versions of TriHC, cross-level blocking priori knowledge based TriHC (CLPK-TriHC) and expert classifier based TriHC (EC-TriHC). The CLPK-TriHC model defines a cross-level blocking distribution matrix to mine the category relations between the higher and lower levels. To better exploit category hierarchical relations, the EC-TriHC model builds expert classifiers using topic model to learn latent category topics. Experimental results validate that the proposed methods can simultaneously reduce the blocking and improve the classification accuracy.

[1]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[2]  Daoqiang Zhang,et al.  Joint Binary Classifier Learning for ECOC-Based Multi-Class Classification , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Ning Jia,et al.  Decoding design based on posterior probabilities in Ternary Error-Correcting Output Codes , 2012, Pattern Recognit..

[4]  Fei-Fei Li,et al.  Novel Dataset for Fine-Grained Image Categorization : Stanford Dogs , 2012 .

[5]  Chih-Fong Tsai,et al.  Soft estimation by hierarchical classification and regression , 2017, Neurocomputing.

[6]  G. Riecker,et al.  [Principles of medical decision making]. , 1993, Medizinische Klinik.

[7]  Yiyu Yao,et al.  An Outline of a Theory of Three-Way Decisions , 2012, RSCTC.

[8]  Zhihua Wei,et al.  A Self-adaptive Cascade ConvNets Model Based on Three-Way Decision Theory , 2017, CCCV.

[9]  H. Sox,et al.  Principles of medical decision making. , 1999, Spine.

[10]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[11]  Xiaofei Deng,et al.  Multistage Email Spam Filtering Based on Three-Way Decisions , 2013, RSKT.

[12]  Xiaogang Wang,et al.  DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Yiyu Yao,et al.  Cost-sensitive three-way email spam filtering , 2013, Journal of Intelligent Information Systems.

[14]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[15]  Daphne Koller,et al.  Discriminative learning of relaxed hierarchy for large-scale visual recognition , 2011, 2011 International Conference on Computer Vision.

[16]  Alex A. Freitas,et al.  A survey of hierarchical classification across different application domains , 2010, Data Mining and Knowledge Discovery.

[17]  Yiyu Yao,et al.  Three-way decisions with probabilistic rough sets , 2010, Inf. Sci..

[18]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[19]  Yiyu Yao,et al.  Three-Way Decision: An Interpretation of Rules in Rough Set Theory , 2009, RSKT.

[20]  J. Kassirer,et al.  The threshold approach to clinical decision making. , 1980, The New England journal of medicine.

[21]  Andrey V. Savchenko,et al.  Sequential three-way decisions in multi-category image recognition with deep features based on distance factor , 2019, Inf. Sci..

[22]  Prabhakar Raghavan,et al.  Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies , 1998, The VLDB Journal.

[23]  Daphne Koller,et al.  Hierarchically Classifying Documents Using Very Few Words , 1997, ICML.

[24]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[25]  Bing Huang,et al.  Sequential three-way decision and granulation for cost-sensitive face recognition , 2016, Knowl. Based Syst..

[26]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Yiyu Yao,et al.  The superiority of three-way decisions in probabilistic rough set models , 2011, Inf. Sci..

[28]  Tianxing Wang,et al.  An optimization-based formulation for three-way decisions , 2019, Inf. Sci..

[29]  Ming Shao,et al.  Learning Consensus Representation for Weak Style Classification , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Xiangliang Zhang,et al.  Exploiting reject option in classification for social discrimination control , 2018, Inf. Sci..

[31]  Bing Huang,et al.  Cost-sensitive sequential three-way decision modeling using a deep neural network , 2017, Int. J. Approx. Reason..

[32]  Paolo Frasconi,et al.  New results on error correcting output codes of kernel machines , 2004, IEEE Transactions on Neural Networks.

[33]  Dominik Slezak,et al.  Rough Sets and Bayes Factor , 2005, Trans. Rough Sets.

[34]  Zhihua Wei,et al.  A self-adaptive cascade ConvNets model based on label relation mining , 2019, Neurocomputing.

[35]  Wei Zhang,et al.  Three-Way Decisions Based Multi-label Learning Algorithm with Label Dependency , 2016, IJCRS.

[36]  Miao Duo Hierarchical Text Classification Model Based on Blocking Priori Knowledge , 2010 .

[37]  Saso Dzeroski,et al.  Hierarchical annotation of medical images , 2011, Pattern Recognit..

[38]  Weisi Lin,et al.  Learning ECOC Code Matrix for Multiclass Classification with Application to Glaucoma Diagnosis , 2016, Journal of Medical Systems.

[39]  Jiaqi Wang,et al.  A cost-sensitive three-way combination technique for ensemble learning in sentiment classification , 2019, Int. J. Approx. Reason..

[40]  Alex Alves Freitas,et al.  A Hierarchical Classification Ant Colony Algorithm for Predicting Gene Ontology Terms , 2009, EvoBIO.

[41]  Azad Naik,et al.  Improving large-scale hierarchical classification by rewiring: a data-driven filter based approach , 2018, Journal of Intelligent Information Systems.

[42]  Junsheng Qiao,et al.  Hesitant relations: Novel properties and applications in three-way decisions , 2019, Inf. Sci..

[43]  Azad Naik,et al.  Inconsistent Node Flattening for Improving Top-Down Hierarchical Classification , 2016, 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[44]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[45]  Yiyu Yao,et al.  Sequential three-way decisions with probabilistic rough sets , 2011, IEEE 10th International Conference on Cognitive Informatics and Cognitive Computing (ICCI-CC'11).

[46]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Motoaki Kawanabe,et al.  Efficient Classification of Images with Taxonomies , 2009, ACCV.

[48]  Yiyu Yao,et al.  Detecting and refining overlapping regions in complex networks with three-way decisions , 2016, Inf. Sci..

[49]  Jaideep Srivastava,et al.  Blocking reduction strategies in hierarchical text classification , 2004, IEEE Transactions on Knowledge and Data Engineering.

[50]  Bing Huang,et al.  Sequential three-way decision based on multi-granular autoencoder features , 2020, Inf. Sci..

[51]  Vincent T. Y. Ng,et al.  A Hierarchical Ensemble of ECOC for cancer classification based on multi-class microarray data , 2016, Inf. Sci..