Class-Distribution-Aware Calibration for Long-Tailed Visual Recognition

Mobarakol Islam; Lalithkumar Seenivasan; Hongliang Ren; Ben Glocker

ロングテール視覚認識のためのクラス分布対応キャリブレーション

印象的な精度にもかかわらず、ディープニューラルネットワークはしばしば誤って調整され、過度に自信のある予測をする傾向があります。温度スケーリング（TS）やラベル平滑化（LS）などの最近の手法は、それぞれスカラー係数を使用してロジットとハードラベルを平滑化することにより、適切に調整されたモデルを取得する効果を示しています。ただし、均一なTSまたはLS係数の使用は、モデルが高周波クラスに対して過度に信頼できる確率を生成するロングテールデータセットでトレーニングされたモデルのキャリブレーションには最適でない場合があります。この研究では、ロングテール分布のコンテキストでモデルのキャリブレーションにクラス頻度情報を組み込むことにより、クラス分布対応TS（CDA-TS）およびLS（CDA-LS）を提案します。 CDA-TSでは、スカラー温度値は、自信過剰を補うためにクラス周波数でエンコードされたCDA温度ベクトルに置き換えられます。同様に、CDA-LSはベクトル平滑化係数を使用し、対応するクラス分布に従ってハードラベルを平坦化します。また、CDA最適温度ベクトルを蒸留損失と統合します。これにより、自己蒸留（SD）の誤校正が減少します。クラス分布を意識したTSとLSは、不均衡なデータ分布に対応でき、キャリブレーションエラーと予測精度の両方で優れたパフォーマンスを発揮できることを経験的に示しています。また、データセットのバランスが極端に悪いSDは、キャリブレーションパフォーマンスの点で効果が低いこともわかりました。コードはhttps://github.com/mobarakol/Class-Distribution-Aware-TS-LSで入手できます。

Despite impressive accuracy, deep neural networks are often miscalibrated and tend to overly confident predictions. Recent techniques like temperature scaling (TS) and label smoothing (LS) show effectiveness in obtaining a well-calibrated model by smoothing logits and hard labels with scalar factors, respectively. However, the use of uniform TS or LS factor may not be optimal for calibrating models trained on a long-tailed dataset where the model produces overly confident probabilities for high-frequency classes. In this study, we propose class-distribution-aware TS (CDA-TS) and LS (CDA-LS) by incorporating class frequency information in model calibration in the context of long-tailed distribution. In CDA-TS, the scalar temperature value is replaced with the CDA temperature vector encoded with class frequency to compensate for the over-confidence. Similarly, CDA-LS uses a vector smoothing factor and flattens the hard labels according to their corresponding class distribution. We also integrate CDA optimal temperature vector with distillation loss, which reduces miscalibration in self-distillation (SD). We empirically show that class-distribution-aware TS and LS can accommodate the imbalanced data distribution yielding superior performance in both calibration error and predictive accuracy. We also observe that SD with an extremely imbalanced dataset is less effective in terms of calibration performance. Code is available in https://github.com/mobarakol/Class-Distribution-Aware-TS-LS.

updated: Sat Sep 11 2021 11:46:56 GMT+0000 (UTC)

published: Sat Sep 11 2021 11:46:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト