DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Xing Shen; Jirui Yang; Chunbo Wei; Bing Deng; Jianqiang Huang; Xiansheng Hua; Xiaoliang Cheng; Kewei Liang

DCTマスク：インスタンスセグメンテーションのための離散コサイン変換マスク表現

バイナリグリッドマスク表現は、インスタンスのセグメンテーションで広く使用されています。代表的なインスタンス化は、28×28バイナリグリッド上のマスクを予測するマスクR-CNNです。一般に、低解像度のグリッドでは詳細をキャプチャするのに十分ではありませんが、高解像度のグリッドではトレーニングの複雑さが劇的に増加します。本論文では、離散コサイン変換（DCT）を適用して、高解像度のバイナリグリッドマスクをコンパクトなベクトルにエンコードすることにより、新しいマスク表現を提案します。 DCT-Maskと呼ばれる私たちの方法は、ほとんどのピクセルベースのインスタンスセグメンテーション方法に簡単に統合できます。 DCT-Maskは、ベルやホイッスルがなくても、さまざまなフレームワーク、バックボーン、データセット、およびトレーニングスケジュールで大幅な向上をもたらします。前処理や事前トレーニングは不要で、走行速度への悪影響はほとんどありません。特に、より高品質のアノテーションとより複雑なバックボーンの場合、私たちの方法は大幅に改善されています。さらに、マスク表現の品質の観点から、メソッドのパフォーマンスを分析します。 DCT-Maskがうまく機能する主な理由は、複雑さが低く、高品質のマスク表現が得られるためです。コードが利用可能になります。

Binary grid mask representation is broadly used in instance segmentation. A representative instantiation is Mask R-CNN which predicts masks on a 28×28 binary grid. Generally, a low-resolution grid is not sufficient to capture the details, while a high-resolution grid dramatically increases the training complexity. In this paper, we propose a new mask representation by applying the discrete cosine transform(DCT) to encode the high-resolution binary grid mask into a compact vector. Our method, termed DCT-Mask, could be easily integrated into most pixel-based instance segmentation methods. Without any bells and whistles, DCT-Mask yields significant gains on different frameworks, backbones, datasets, and training schedules. It does not require any pre-processing or pre-training, and almost no harm to the running speed. Especially, for higher-quality annotations and more complex backbones, our method has a greater improvement. Moreover, we analyze the performance of our method from the perspective of the quality of mask representation. The main reason why DCT-Mask works well is that it obtains a high-quality mask representation with low complexity. Code will be made available.

updated: Wed Apr 14 2021 11:46:21 GMT+0000 (UTC)

published: Thu Nov 19 2020 15:00:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト