Scale-aware Neural Network for Semantic Segmentation of Multi-resolution Remote Sensing Images

Libo Wang; Ce Zhang; Rui Li; Chenxi Duan; Xiaoliang Meng; Peter M. Atkinson

多重解像度リモートセンシング画像のセマンティックセグメンテーションのためのスケール認識ニューラルネットワーク

ピクセルレベルで特定のカテゴリを持つ地理空間オブジェクトを割り当てることは、リモートセンシング画像分析の基本的なタスクです。センサー技術の急速な発展に加えて、リモートセンシングされた画像は、さまざまなスケールで表される情報コンテンツを使用して、複数の空間解像度（MSR）でキャプチャできます。これらのMSR画像から情報を抽出することは、機能の表現と特性評価を強化するための大きなチャンスを表しています。ただし、MSR画像には2つの重大な問題があります。1）ジオオブジェクトのスケール変動の増加と、2）粗い空間解像度での詳細情報の損失です。これらのギャップを埋めるために、この論文では、MSRリモートセンシング画像のセマンティックセグメンテーションのための新しいスケール認識ニューラルネットワーク（SaNet）を提案します。 SaNetは、高密度接続機能ネットワーク（DCFFM）モジュールを展開して、高品質のマルチスケールコンテキストをキャプチャします。これにより、スケールの変動が適切に処理され、大小両方のオブジェクトのセグメンテーションの品質が向上します。空間機能再キャリブレーション（SFRM）モジュールがさらにネットワークに組み込まれ、情報損失の悪影響が除去された、強化された空間関係を持つインタクトなセマンティックコンテンツを学習します。 DCFFMとSFRMの組み合わせにより、SaNetは、既存のマルチスケール機能表現よりも優れたスケール認識機能表現を学習できます。 3つのセマンティックセグメンテーションデータセットに関する広範な実験により、クロスレゾリューションセグメンテーションにおける提案されたSaNetの有効性が実証されました。

Assigning geospatial objects with specific categories at the pixel level is a fundamental task in remote sensing image analysis. Along with rapid development in sensor technologies, remotely sensed images can be captured at multiple spatial resolutions (MSR) with information content manifested at different scales. Extracting information from these MSR images represents huge opportunities for enhanced feature representation and characterisation. However, MSR images suffer from two critical issues: 1) increased scale variation of geo-objects and 2) loss of detailed information at coarse spatial resolutions. To bridge these gaps, in this paper, we propose a novel scale-aware neural network (SaNet) for semantic segmentation of MSR remotely sensed imagery. SaNet deploys a densely connected feature network (DCFFM) module to capture high-quality multi-scale context, such that the scale variation is handled properly and the quality of segmentation is increased for both large and small objects. A spatial feature recalibration (SFRM) module is further incorporated into the network to learn intact semantic content with enhanced spatial relationships, where the negative effects of information loss are removed. The combination of DCFFM and SFRM allows SaNet to learn scale-aware feature representation, which outperforms the existing multi-scale feature representation. Extensive experiments on three semantic segmentation datasets demonstrated the effectiveness of the proposed SaNet in cross-resolution segmentation.

updated: Thu Nov 04 2021 07:12:49 GMT+0000 (UTC)

published: Sun Mar 14 2021 14:19:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト