Top-k Hierarchical Classification

This paper studies a top-k hierarchical classification problem. In top-k classification, one is allowed to make k predictions and no penalty is incurred if at least one of k predictions is correct. In hierarchical classification, classes form a structured hierarchy, and misclassification costs depend on the relation between the correct class and the incorrect class in the hierarchy. Despite that the fact that both top-k classification and hierarchical classification have gained increasing interests, the two problems have always been studied separately. In this paper, we define a top-k hierarchical loss function using a real world application. We provide the Bayes-optimal solution that minimizes the expected top-k hierarchical misclassification cost. Via numerical experiments, we show that our solution outperforms two baseline methods that address only one of the two issues.

[1]  Maya R. Gupta,et al.  Training highly multiclass classifiers , 2014, J. Mach. Learn. Res..

[2]  Claudio Gentile,et al.  Hierarchical classification: combining Bayes with SVM , 2006, ICML.

[3]  Alex A. Freitas,et al.  A survey of hierarchical classification across different application domains , 2010, Data Mining and Knowledge Discovery.

[4]  Sechan Oh,et al.  Impact of cost uncertainty on pricing decisions under risk aversion , 2016, Eur. J. Oper. Res..

[5]  Amanda Clare,et al.  Predicting gene function in Saccharomyces cerevisiae , 2003, ECCB.

[6]  Bernt Schiele,et al.  Top-k Multiclass SVM , 2015, NIPS.

[7]  Brendan J. Frey,et al.  Probabilistic n-Choose-k Models for Classification and Ranking , 2012, NIPS.

[8]  James T. Kwok,et al.  Bayes-Optimal Hierarchical Multilabel Classification , 2015, IEEE Transactions on Knowledge and Data Engineering.

[9]  Kush R. Varshney,et al.  Optigrow: People Analytics for Job Transfers , 2015, 2015 IEEE International Congress on Big Data.

[10]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[11]  Stephen P. Boyd,et al.  Accuracy at the Top , 2012, NIPS.

[12]  Saso Dzeroski,et al.  Decision trees for hierarchical multi-label classification , 2008, Machine Learning.

[13]  Claudio Gentile,et al.  Incremental Algorithms for Hierarchical Classification , 2004, J. Mach. Learn. Res..

[14]  Thomas Hofmann,et al.  Hierarchical document categorization with support vector machines , 2004, CIKM '04.

[15]  Yaniv Altshuler,et al.  Workforce optimization: Identification and assignment of professional workers using constraint programming , 2007, IBM J. Res. Dev..

[16]  Luis Enrique Sucar,et al.  Hierarchical multilabel classification based on path evaluation , 2016, Int. J. Approx. Reason..

[17]  Jonathan Krause,et al.  Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Luis Enrique Sucar,et al.  Multi-label Classification for Tree and Directed Acyclic Graphs Hierarchies , 2014, Probabilistic Graphical Models.

[19]  Andrei Z. Broder,et al.  A semantic approach to contextual advertising , 2007, SIGIR.

[20]  Patrick Gallinari,et al.  Ranking with ordered weighted pairwise classification , 2009, ICML '09.

[21]  Michael Schroeder,et al.  Automated Patent Categorization and Guided Patent Search using IPC as Inspired by MeSH and PubMed , 2013, Journal of Biomedical Semantics.