Squeeze-and-Attention Networks for Semantic Segmentation

Zilong Zhong; Zhong Qiu Lin; Rene Bidart; Xiaodan Hu; Ibrahim Ben Daya; Zhifeng Li; Wei-Shi Zheng; Jonathan Li; Alexander Wong

セマンティックセグメンテーションのためのスクイーズアンドアテンションネットワーク

最近のアテンションメカニズムのセグメンテーションネットワークへの統合により、より有益な機能に大きな重点が置かれることで、表現能力が向上しています。ただし、これらのアテンションメカニズムは、セマンティックセグメンテーションの暗黙的なサブタスクを無視し、畳み込みカーネルのグリッド構造によって制約されます。この論文では、効果的なスクイーズアンドアテンション（SA）モジュールを活用して、セグメンテーションの2つの特徴を説明する新しいスクイーズアンドアテンションネットワーク（SANet）アーキテクチャを提案します。i）ピクセルグループアテンション、ii）ピクセル単位の予測。具体的には、提案されたSAモジュールは、「注意」畳み込みチャネルを導入することにより、従来の畳み込みにピクセルグループの注意を課し、空間チャネル相互依存性を効率的に考慮します。最終的なセグメンテーション結果は、SANetの4つの階層ステージからの出力をマージして、マルチスケールコンテキストを統合し、ピクセル単位の予測を強化することで生成されます。 2つの困難な公開データセットの実証実験により、提案されたSANetsの有効性が検証され、PASCAL VOCで83.2％mIoU（COCO事前トレーニングなし）およびPASCAL Contextで54.4％の最先端mIoUが達成されました。

The recent integration of attention mechanisms into segmentation networks improves their representational capabilities through a great emphasis on more informative features. However, these attention mechanisms ignore an implicit sub-task of semantic segmentation and are constrained by the grid structure of convolution kernels. In this paper, we propose a novel squeeze-and-attention network (SANet) architecture that leverages an effective squeeze-and-attention (SA) module to account for two distinctive characteristics of segmentation: i) pixel-group attention, and ii) pixel-wise prediction. Specifically, the proposed SA modules impose pixel-group attention on conventional convolution by introducing an 'attention' convolutional channel, thus taking into account spatial-channel inter-dependencies in an efficient manner. The final segmentation results are produced by merging outputs from four hierarchical stages of a SANet to integrate multi-scale contexts for obtaining an enhanced pixel-wise prediction. Empirical experiments on two challenging public datasets validate the effectiveness of the proposed SANets, which achieves 83.2% mIoU (without COCO pre-training) on PASCAL VOC and a state-of-the-art mIoU of 54.4% on PASCAL Context.

updated: Wed Apr 01 2020 05:50:33 GMT+0000 (UTC)

published: Sun Sep 08 2019 08:21:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト