LOANet: A Lightweight Network Using Object Attention for Extracting Buildings and Roads from UAV Aerial Remote Sensing Images

Xiaoxiang Han; Yiman Liu; Gang Liu; Yuanjie Lin; Qiaohong Liu

LOANet: UAV 航空リモートセンシング画像から建物や道路を抽出するためのオブジェクトアテンションを使用した軽量ネットワーク

深層学習によって無人航空機 (UAV) のリモートセンシング画像から建物や道路を抽出するセマンティックセグメンテーションは、測量や地図作成の分野において、従来の手動セグメンテーションよりも効率的で便利な方法になります。モデルを軽量化し、モデルの精度を向上させるために、UAV 航空リモートセンシング画像からの建物および道路に対するオブジェクトアテンションを使用した軽量ネットワーク (LOANet) が提案されています。提案されたネットワークは、エンコーダとして軽量高密度接続ネットワーク (LDCNet) が開発されたエンコーダデコーダアーキテクチャを採用しています。デコーダ部分では、Atrous Spatial Pyramid Pooling モジュール (ASPP) と Object tention Module (OAM) で構成されるデュアルマルチスケールコンテキストモジュールが、UAV リモートセンシング画像の特徴マップからより多くのコンテキスト情報をキャプチャするように設計されています。 ASPP と OAM の間では、ASPP から抽出されたマルチスケール機能を融合するために、機能ピラミッドネットワーク (FPN) モジュールが使用されます。 2431 個のトレーニングセット、945 個の検証セット、および 475 個のテストセットを含む、UAV によって撮影されたリモートセンシング画像のプライベートデータセットが構築されます。提案された基本モデルは、わずか 140 万のパラメーターと 5.48G の浮動小数点演算 (FLOP) を使用して、このデータセットで良好なパフォーマンスを示し、優れた平均交差オーバーユニオン (mIoU) を達成しています。提案された基本モデルと大規模モデルの有効性をさらに検証するために、公開されている LoveDA および CITY-OSM データセットに対するさらなる実験が実施され、優れた mIoU 結果が達成されました。すべてのコードは https://github.com/GtLinyer/LOANet で入手できます。

Semantic segmentation for extracting buildings and roads from uncrewed aerial vehicle (UAV) remote sensing images by deep learning becomes a more efficient and convenient method than traditional manual segmentation in surveying and mapping fields. In order to make the model lightweight and improve the model accuracy, a Lightweight Network Using Object Attention (LOANet) for Buildings and Roads from UAV Aerial Remote Sensing Images is proposed. The proposed network adopts an encoder-decoder architecture in which a Lightweight Densely Connected Network (LDCNet) is developed as the encoder. In the decoder part, the dual multi-scale context modules which consist of the Atrous Spatial Pyramid Pooling module (ASPP) and the Object Attention Module (OAM) are designed to capture more context information from feature maps of UAV remote sensing images. Between ASPP and OAM, a Feature Pyramid Network (FPN) module is used to fuse multi-scale features extracted from ASPP. A private dataset of remote sensing images taken by UAV which contains 2431 training sets, 945 validation sets, and 475 test sets is constructed. The proposed basic model performs well on this dataset, with only 1.4M parameters and 5.48G floating point operations (FLOPs), achieving excellent mean Intersection-over-Union (mIoU). Further experiments on the publicly available LoveDA and CITY-OSM datasets have been conducted to further validate the effectiveness of the proposed basic and large model, and outstanding mIoU results have been achieved. All codes are available on https://github.com/GtLinyer/LOANet.

updated: Thu Jul 06 2023 12:06:26 GMT+0000 (UTC)

published: Fri Dec 16 2022 14:02:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト