Striking the Right Balance: Recall Loss for Semantic Segmentation

Junjiao Tian; Niluthpol Mithun; Zach Seymour; Han-Pang Chiu; Zsolt Kira

適切なバランスをとる：セマンティックセグメンテーションの損失を思い出してください

クラスの不均衡は、セマンティックセグメンテーションなどのコンピュータビジョンアプリケーションの基本的な問題です。具体的には、トレーニングデータセット内の不均一なクラス分布は、多くの場合、過小評価されているクラスで不十分なパフォーマンスをもたらします。多くの研究が、サンプル数やクラスマージンなどのクラス統計に基づいて事前に計算された重みを使用して、標準のクロスエントロピー損失関数に重みを付けることを提案しています。これらの方法には2つの大きな欠点があります。1）マイノリティクラスを常にアップウェイトすると、セマンティックセグメンテーションで過度の誤検知が発生する可能性があります。 2）マイノリティクラスは必ずしもハードクラスではありません。その結果、誤検知が多すぎるため、精度が低くなります。この点で、瞬間的なリコールパフォーマンスに基づいて各クラスの損失に動的に重みを付けるように、バニラクロスエントロピー損失を再形成することにより、ハードクラスのマイニング損失を提案します。新規のリコール損失は、標準のクロスエントロピー損失と逆周波数加重損失の間で徐々に変化することを示します。リコール損失はまた、平均精度の向上につながり、同時に競争力のある平均交差点（IoU）パフォーマンスを提供します。 Synthiaデータセットでは、リコール損失は、クロスエントロピー損失と比較して、DeepLab-ResNet18を使用した競合平均IoUで平均精度の9％の相対的改善を達成します。コードはhttps://github.com/PotatoTian/recall-semsegで入手できます。

Class imbalance is a fundamental problem in computer vision applications such as semantic segmentation. Specifically, uneven class distributions in a training dataset often result in unsatisfactory performance on under-represented classes. Many works have proposed to weight the standard cross entropy loss function with pre-computed weights based on class statistics, such as the number of samples and class margins. There are two major drawbacks to these methods: 1) constantly up-weighting minority classes can introduce excessive false positives in semantic segmentation; 2) a minority class is not necessarily a hard class. The consequence is low precision due to excessive false positives. In this regard, we propose a hard-class mining loss by reshaping the vanilla cross entropy loss such that it weights the loss for each class dynamically based on instantaneous recall performance. We show that the novel recall loss changes gradually between the standard cross entropy loss and the inverse frequency weighted loss. Recall loss also leads to improved mean accuracy while offering competitive mean Intersection over Union (IoU) performance. On Synthia dataset, recall loss achieves 9% relative improvement on mean accuracy with competitive mean IoU using DeepLab-ResNet18 compared to the cross entropy loss. Code available at https://github.com/PotatoTian/recall-semseg.

updated: Mon Jun 28 2021 18:02:03 GMT+0000 (UTC)

published: Mon Jun 28 2021 18:02:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト