Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection

Muhammad Akhtar Munir; Muhammad Haris Khan; Salman Khan; Fahad Shahbaz Khan

精度と信頼性の橋渡し: 物体検出を調整するための学習時間の損失

ディープニューラルネットワーク (DNN) は、視覚に基づくいくつかの問題で驚異的な進歩を遂げました。高い予測精度を示しているにもかかわらず、最近、いくつかの研究では、自信過剰な予測を提供する傾向があり、キャリブレーションが不十分であることが明らかになりました。 DNN のミスキャリブレーションに対処する作業の大部分は、分類の範囲に該当し、ドメイン内の予測のみを考慮します。ただし、多くのビジョンベースの安全性が重要なアプリケーションの中心である DNN ベースのオブジェクト検出モデルのキャリブレーションの研究はほとんど、またはまったく進んでいません。このホワイトペーパーでは、トレーニング時のキャリブレーション方法に着想を得て、バウンディングボックスのクラス信頼度を予測の正確さ (つまり、精度) に合わせることを明示的に目的とする、新しい補助損失定式化を提案します。損失の元の定式化は、ミニバッチの真陽性と偽陽性の数に依存するため、他のアプリケーション固有の損失関数でトレーニング中に使用できる損失の微分可能なプロキシを開発します。 MS-COCO、Cityscapes、Sim10k、BDD100k を含む 6 つのベンチマークデータセットを使用して、困難なドメイン内およびドメイン外のシナリオで広範な実験を行います。私たちの結果は、トレーニング時間の損失が、ドメイン内とドメイン外の両方のシナリオでのキャリブレーションエラーの削減において、強力なキャリブレーションベースラインを上回っていることを明らかにしています。ソースコードと事前トレーニング済みモデルは、https://github.com/akhtarvision/bpc_calibration で入手できます。

Deep neural networks (DNNs) have enabled astounding progress in several vision-based problems. Despite showing high predictive accuracy, recently, several works have revealed that they tend to provide overconfident predictions and thus are poorly calibrated. The majority of the works addressing the miscalibration of DNNs fall under the scope of classification and consider only in-domain predictions. However, there is little to no progress in studying the calibration of DNN-based object detection models, which are central to many vision-based safety-critical applications. In this paper, inspired by the train-time calibration methods, we propose a novel auxiliary loss formulation that explicitly aims to align the class confidence of bounding boxes with the accurateness of predictions (i.e. precision). Since the original formulation of our loss depends on the counts of true positives and false positives in a minibatch, we develop a differentiable proxy of our loss that can be used during training with other application-specific loss functions. We perform extensive experiments on challenging in-domain and out-domain scenarios with six benchmark datasets including MS-COCO, Cityscapes, Sim10k, and BDD100k. Our results reveal that our train-time loss surpasses strong calibration baselines in reducing calibration error for both in and out-domain scenarios. Our source code and pre-trained models are available at https://github.com/akhtarvision/bpc_calibration

updated: Sat Mar 25 2023 08:56:21 GMT+0000 (UTC)

published: Sat Mar 25 2023 08:56:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト