论文信息 - Uncertainty Quantification for Text Classification

Uncertainty Quantification for Text Classification

This full-day tutorial introduces modern techniques for practical uncertainty quantification specifically in the context of multi-class and multi-label text classification. First, we explain the usefulness of estimating aleatoric uncertainty and epistemic uncertainty for text classification models. Then, we describe several state-of-the-art approaches to uncertainty quantification and analyze their scalability to big text data: Virtual Ensemble in GBDT, Bayesian Deep Learning (including Deep Ensemble, Monte-Carlo Dropout, Bayes by Backprop, and their generalization Epistemic Neural Networks), Evidential Deep Learning (including Prior Networks and Posterior Networks), as well as Distance Awareness (including Spectral-normalized Neural Gaussian Process and Deep Deterministic Uncertainty). Next, we talk about the latest advances in uncertainty quantification for pre-trained language models (including asking language models to express their uncertainty, interpreting uncertainties of text classifiers built on large-scale language models, uncertainty estimation in text generation, calibration of language models, and calibration for in-context learning). After that, we discuss typical application scenarios of uncertainty quantification in text classification (including in-domain calibration, cross-domain robustness, and novel class detection). Finally, we list popular performance metrics for the evaluation of uncertainty quantification effectiveness in text classification. Practical hands-on examples/exercises are provided to the attendees for them to experiment with different uncertainty quantification methods on a few real-world text classification datasets such as CLINC150.

[1] Yanran Li,et al. Distinguishability Calibration to In-Context Learning , 2023, FINDINGS.

[2] J. Frellsen,et al. Prior and Posterior Networks: A Survey on Evidential Deep Learning Methods For Uncertainty Estimation , 2021, Trans. Mach. Learn. Res..

[3] Benjamin Van Roy,et al. Epistemic Neural Networks , 2021, ArXiv.

[4] Latifur Khan,et al. Uncertainty-Aware Reliable Text Classification , 2021, KDD.

[5] Fredrik Lindsten,et al. Calibration tests beyond classification , 2021, ICLR.

[6] Philip H. S. Torr,et al. Deep Deterministic Uncertainty: A New Simple Baseline , 2021, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Stephan Günnemann,et al. Posterior Network: Uncertainty Estimation without OOD Samples via Density-Based Pseudo-Counts , 2020, NeurIPS.

[8] Fredrik Lindsten,et al. Calibration tests in multi-class classification: A unifying framework , 2019, NeurIPS.

[9] Lingjia Tang,et al. An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction , 2019, EMNLP.

[10] Zachary C. Lipton,et al. Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study , 2018, EMNLP.

[11] Murat Sensoy,et al. Evidential Deep Learning to Quantify Classification Uncertainty , 2018, NeurIPS.

[12] Alex Kendall,et al. What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[13] Charles Blundell,et al. Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[14] Kevin Gimpel,et al. A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks , 2016, ICLR.

[15] Zoubin Ghahramani,et al. Bayesian Active Learning for Classification and Preference Learning , 2011, ArXiv.