Useful Confidence Measures: Beyond the Max Score

Yona, Gal; Feder, Amir; Laish, Itay

Computer Science > Machine Learning

arXiv:2210.14070 (cs)

[Submitted on 25 Oct 2022]

Title:Useful Confidence Measures: Beyond the Max Score

Authors:Gal Yona, Amir Feder, Itay Laish

View PDF

Abstract:An important component in deploying machine learning (ML) in safety-critic applications is having a reliable measure of confidence in the ML model's predictions. For a classifier $f$ producing a probability vector $f(x)$ over the candidate classes, the confidence is typically taken to be $\max_i f(x)_i$. This approach is potentially limited, as it disregards the rest of the probability vector. In this work, we derive several confidence measures that depend on information beyond the maximum score, such as margin-based and entropy-based measures, and empirically evaluate their usefulness, focusing on NLP tasks with distribution shifts and Transformer-based models. We show that when models are evaluated on the out-of-distribution data ``out of the box'', using only the maximum score to inform the confidence measure is highly suboptimal. In the post-processing regime (where the scores of $f$ can be improved using additional in-distribution held-out data), this remains true, albeit less significant. Overall, our results suggest that entropy-based confidence is a surprisingly useful measure.

Comments:	Short paper; appeared in the Workshop on Distribution Shifts @ NeurIPS 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2210.14070 [cs.LG]
	(or arXiv:2210.14070v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.14070

Submission history

From: Gal Yona [view email]
[v1] Tue, 25 Oct 2022 14:54:44 UTC (112 KB)

Computer Science > Machine Learning

Title:Useful Confidence Measures: Beyond the Max Score

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Useful Confidence Measures: Beyond the Max Score

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators