Identifying Label Errors in Object Detection Datasets by Loss Inspection

Marius Schubert; Tobias Riedlinger; Karsten Kahl; Daniel Kröll; Sebastian Schoenen; Siniša Šegvić; Matthias Rottmann

損失検査によるオブジェクト検出データセットのラベルエラーの識別

監視対象オブジェクト検出用のデータセットのラベル付けは、単調で時間のかかる作業です。アノテーション中にエラーが簡単に発生し、レビュー中に見過ごされる可能性があるため、ベンチマークが不正確になり、ノイズの多いラベルでトレーニングされたディープニューラルネットワークのパフォーマンスが低下します。この作業では、オブジェクト検出データセットに対するラベルエラー検出方法のベンチマークと、ラベルエラー検出方法および多数のベースラインを初めて紹介します。適切にラベル付けされたオブジェクト検出データセットのトレーニングセットとテストセットで、4 種類のランダムに導入されたラベルエラーをシミュレートします。ラベルエラー検出方法では、2 段階のオブジェクト検出器が与えられると仮定し、両方の段階の分類損失と回帰損失の合計を考慮します。損失は、後者を検出することを目的として、シミュレートされたラベルエラーを含む予測とノイズの多いラベルに関して計算されます。私たちの方法を 3 つのベースラインと比較します: 深層学習のない単純なもの、オブジェクト検出器のスコア、および分類のソフトマックス分布のエントロピーです。私たちはすべてのベースラインよりも優れており、検討した方法の中で、4 つのタイプすべてのラベルエラーを効率的に検出できるのは私たちの方法だけであることを示しています。さらに、a) オブジェクト検出で一般的に使用されるテストデータセット、および b) 独自のデータセットで実際のラベルエラーを検出します。どちらの場合も、低い偽陽性率を達成しています。つまり、メソッドからの 200 の提案を考慮すると、a) では最大 71.5%、b) では 97% の精度でラベルエラーを検出します。

Labeling datasets for supervised object detection is a dull and time-consuming task. Errors can be easily introduced during annotation and overlooked during review, yielding inaccurate benchmarks and performance degradation of deep neural networks trained on noisy labels. In this work, we for the first time introduce a benchmark for label error detection methods on object detection datasets as well as a label error detection method and a number of baselines. We simulate four different types of randomly introduced label errors on train and test sets of well-labeled object detection datasets. For our label error detection method we assume a two-stage object detector to be given and consider the sum of both stages' classification and regression losses. The losses are computed with respect to the predictions and the noisy labels including simulated label errors, aiming at detecting the latter. We compare our method to three baselines: a naive one without deep learning, the object detector's score and the entropy of the classification softmax distribution. We outperform all baselines and demonstrate that among the considered methods, ours is the only one that detects label errors of all four types efficiently. Furthermore, we detect real label errors a) on commonly used test datasets in object detection and b) on a proprietary dataset. In both cases we achieve low false positives rates, i.e., when considering 200 proposals from our method, we detect label errors with a precision for a) of up to 71.5% and for b) with 97%.

updated: Mon Mar 13 2023 10:54:52 GMT+0000 (UTC)

published: Mon Mar 13 2023 10:54:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト