Learning from data with uncertain labels by boosting credal classifiers

In this article, we investigate supervised learning when training data are associated with uncertain labels. We tackle this problem within the theory of belief functions. Each training pattern xi is thus associated with a basic belief assignment, representing partial knowledge of its actual class. Here, we propose to use the approach known as boosting to solve the classification problem. We propose a variant of the AdaBoost algorithm where the outputs of the classifiers are interpreted as belief functions. During training, our algorithm estimates the reliability of each classifier to identify patterns from the various classes. During test phase, the outputs of the classifiers are first discounted according to these reliabilities, and then combined using a suitable rule. Experiments conducted on classical datasets show that our algorithm is comparable to AdaBoost in accuracy. Processing EEG data with imperfect labels clearly demonstrates the interest of taking into account the reliability of the labelling, and thus the relevance of our approach.

[1]  Didier Dubois,et al.  The principle of minimum specificity as a basis for evidential reasoning , 1986, IPMU.

[2]  Thierry Denoeux,et al.  Adapting a Combination Rule to Non-Independent Information Sources , 2008, IPMU 2008.

[3]  Thierry Denoeux,et al.  Refined classifier combination using belief functions , 2008, 2008 11th International Conference on Information Fusion.

[4]  Thierry Denoeux A k -Nearest Neighbor Classification Rule Based on Dempster-Shafer Theory , 2008, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[5]  Alessandro Saffiotti,et al.  The Transferable Belief Model , 1991, ECSQARU.

[6]  Thierry Denoeux,et al.  Induction of decision trees from partially classified data using belief functions , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.

[7]  Yoav Freund,et al.  Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[8]  T. Denœux Conjunctive and disjunctive combination of belief functions induced by nondistinct bodies of evidence , 2008 .

[9]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[10]  P. Smets Data fusion in the transferable belief model , 2000, Proceedings of the Third International Conference on Information Fusion.

[11]  Thierry Denoeux,et al.  An evidence-theoretic k-NN rule with parameter optimization , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[12]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[13]  Thierry Denoeux,et al.  Refined modeling of sensor reliability in the belief function framework using contextual discounting , 2008, Inf. Fusion.

[14]  Didier Dubois,et al.  A definition of subjective possibility , 2008, Int. J. Approx. Reason..

[15]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[16]  Yaxin Bi,et al.  The combination of multiple classifiers using an evidential reasoning approach , 2008, Artif. Intell..

[17]  Philippe Smets,et al.  The Combination of Evidence in the Transferable Belief Model , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Radko Mesiar,et al.  Triangular Norms , 2000, Trends in Logic.

[19]  Cédric Richard Une méthodologie pour la détection à structure imposée : Applications au plan temps-fréquence , 1998 .

[20]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[21]  Thierry Denoeux,et al.  Pairwise classifier combination using belief functions , 2007, Pattern Recognit. Lett..

[22]  Thierry Denoeux,et al.  Handling possibilistic labels in pattern classification using evidential reasoning , 2001, Fuzzy Sets Syst..

[23]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[24]  P. Vannoorenberghe,et al.  Handling uncertain labels in multiclass problems using belief decision trees , 2002 .

[25]  Thierry Denoeux,et al.  Resample and combine: an approach to improving uncertainty representation in evidential pattern classification , 2003, Inf. Fusion.

[26]  Francesc Esteva,et al.  Review of Triangular norms by E. P. Klement, R. Mesiar and E. Pap. Kluwer Academic Publishers , 2003 .

[27]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[28]  Thierry Denoeux,et al.  Learning from partially supervised data using mixture models and belief functions , 2009, Pattern Recognit..

[29]  Ronald R. Yager,et al.  Building ensemble classifiers using belief functions and OWA operators , 2008, Soft Comput..

[30]  Philippe Smets,et al.  Belief functions: The disjunctive rule of combination and the generalized Bayesian theorem , 1993, Int. J. Approx. Reason..

[31]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1990, COLT '90.