ThreshNet: Segmentation Refinement Inspired by Region-Specific Thresholding

Savinay Nagendra; Chaopeng Shen; Daniel Kifer

ThreshNet: 地域固有のしきい値処理に触発されたセグメンテーションの改良

バイナリセグメンテーションタスク用に設計されたニューラルネットワークの出力を調整するための後処理方法である ThreshNet を紹介します。 ThreshNet は、ベースネットワークによって生成された信頼度マップと、グローバルおよびローカルのパッチ情報を使用して、最先端の方法のパフォーマンスを大幅に改善します。バイナリセグメンテーションモデルは通常、信頼スコアを 0.5 (またはその他の固定数) でしきい値処理することにより、信頼マップを予測に変換します。ただし、最良のしきい値は画像に依存し、多くの場合領域固有でさえあることがわかります。画像のさまざまな部分で、さまざまなしきい値を使用することでメリットが得られます。したがって、ThreshNet はトレーニング済みのセグメンテーションモデルを取得し、トレーニングメカニズムの一部として領域固有のしきい値を組み込んだメモリ効率の高い後処理アーキテクチャを使用して、その予測を修正することを学習します。私たちの実験では、ThreshNet がバイナリセグメンテーションと顕著性検出における現在の最先端の方法よりも一貫して改善されており、通常は mIoU と mBA で 3 ～ 5% 向上しています。

We present ThreshNet, a post-processing method to refine the output of neural networks designed for binary segmentation tasks. ThreshNet uses the confidence map produced by a base network along with global and local patch information to significantly improve the performance of even state-of-the-art methods. Binary segmentation models typically convert confidence maps into predictions by thresholding the confidence scores at 0.5 (or some other fixed number). However, we observe that the best threshold is image-dependent and often even region-specific -- different parts of the image benefit from using different thresholds. Thus ThreshNet takes a trained segmentation model and learns to correct its predictions by using a memory-efficient post-processing architecture that incorporates region-specific thresholds as part of the training mechanism. Our experiments show that ThreshNet consistently improves over current the state-of-the-art methods in binary segmentation and saliency detection, typically by 3 to 5% in mIoU and mBA.

updated: Sat Nov 12 2022 03:49:22 GMT+0000 (UTC)

published: Sat Nov 12 2022 03:49:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト