A Consistent and Differentiable Lp Canonical Calibration Error Estimator

Teodora Popordanoska; Raphael Sayer; Matthew B. Blaschko

一貫性のある微分可能な Lp 正準キャリブレーション誤差推定器

キャリブレーションされた確率的分類子は、予測された確率を不確実性の推定値として直接解釈できるモデルです。最近、ディープニューラルネットワークのキャリブレーションが不十分であり、自信過剰な予測を出力する傾向があることが示されています。改善策として、ディリクレカーネル密度推定に基づく低バイアスで訓練可能な較正誤差推定器を提案します。これは、真の L_p 較正誤差に漸近的に収束します。この新しい推定量により、標準 (または分布) キャリブレーションと呼ばれるマルチクラスキャリブレーションの最も強力な概念に取り組むことができますが、他の一般的なキャリブレーション方法は、トップラベルおよび限界キャリブレーションに対してのみ扱いやすいものです。推定器の計算量は O(n^2)、収束率は O(n^-1/2) で、O(n^-2) まで偏りがなく、等比級数デバイアススキームによって達成されます。実際には、これは推定器をデータの小さなサブセットに適用できることを意味し、効率的な推定とミニバッチ更新を可能にします。提案された方法には、カーネルの自然な選択があり、確率的分類器の鋭さなど、条件付き期待値に基づいて他の量の一貫した推定値を生成するために使用できます。経験的な結果は、推定量の正しさを検証し、正規の較正誤差推定と較正誤差の正規化されたリスク最小化におけるその有用性を示しています。

Calibrated probabilistic classifiers are models whose predicted probabilities can directly be interpreted as uncertainty estimates. It has been shown recently that deep neural networks are poorly calibrated and tend to output overconfident predictions. As a remedy, we propose a low-bias, trainable calibration error estimator based on Dirichlet kernel density estimates, which asymptotically converges to the true L_p calibration error. This novel estimator enables us to tackle the strongest notion of multiclass calibration, called canonical (or distribution) calibration, while other common calibration methods are tractable only for top-label and marginal calibration. The computational complexity of our estimator is O(n^2), the convergence rate is O(n^-1/2), and it is unbiased up to O(n^-2), achieved by a geometric series debiasing scheme. In practice, this means that the estimator can be applied to small subsets of data, enabling efficient estimation and mini-batch updates. The proposed method has a natural choice of kernel, and can be used to generate consistent estimates of other quantities based on conditional expectation, such as the sharpness of a probabilistic classifier. Empirical results validate the correctness of our estimator, and demonstrate its utility in canonical calibration error estimation and calibration error regularized risk minimization.

updated: Thu Oct 13 2022 15:11:11 GMT+0000 (UTC)

published: Thu Oct 13 2022 15:11:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト