A Consistent and Differentiable Lp Canonical Calibration Error Estimator

Popordanoska, Teodora; Sayer, Raphael; Blaschko, Matthew B.

Abstract:Calibrated probabilistic classifiers are models whose predicted probabilities can directly be interpreted as uncertainty estimates. It has been shown recently that deep neural networks are poorly calibrated and tend to output overconfident predictions. As a remedy, we propose a low-bias, trainable calibration error estimator based on Dirichlet kernel density estimates, which asymptotically converges to the true $L_p$ calibration error. This novel estimator enables us to tackle the strongest notion of multiclass calibration, called canonical (or distribution) calibration, while other common calibration methods are tractable only for top-label and marginal calibration. The computational complexity of our estimator is $\mathcal{O}(n^2)$, the convergence rate is $\mathcal{O}(n^{-1/2})$, and it is unbiased up to $\mathcal{O}(n^{-2})$, achieved by a geometric series debiasing scheme. In practice, this means that the estimator can be applied to small subsets of data, enabling efficient estimation and mini-batch updates. The proposed method has a natural choice of kernel, and can be used to generate consistent estimates of other quantities based on conditional expectation, such as the sharpness of a probabilistic classifier. Empirical results validate the correctness of our estimator, and demonstrate its utility in canonical calibration error estimation and calibration error regularized risk minimization.

Comments:	To appear at NeurIPS 2022
Subjects:	Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2210.07810 [stat.ML]
	(or arXiv:2210.07810v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2210.07810

Statistics > Machine Learning

Title:A Consistent and Differentiable Lp Canonical Calibration Error Estimator

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators