LEDCNet: A Lightweight and Efficient Semantic Segmentation Algorithm Using Dual Context Module for Extracting Buildings and Roads from UAV Aerial Remote Sensing Images

Xiaoxiang Han; Yiman Liu; Gang Liu; Qiaohong Liu

LEDCNet: UAV 空中リモートセンシング画像から建物と道路を抽出するためのデュアルコンテキストモジュールを使用した、軽量で効率的なセマンティックセグメンテーションアルゴリズム

無人航空機 (UAV) のリモートセンシング画像からディープラーニングによって建物や道路を抽出するセマンティックセグメンテーションは、測量およびマッピング分野での従来の手動セグメンテーションよりも効率的で便利な方法になります。モデルを軽量化し、モデルの精度を向上させるために、UAV 空中リモートセンシング画像からの建物と道路用のデュアルコンテキストモジュール (LEDCNet) を使用して実装された軽量で効率的なネットワークが提案されています。提案されたネットワークは、Lightweight Densely Connected Network (LDCNet) がエンコーダとして開発されたエンコーダ/デコーダアーキテクチャを採用しています。デコーダー部分では、Atrous Spatial Pyramid Pooling モジュール (ASPP) と Object Contextual Representation モジュール (OCR) で構成されるデュアルマルチスケールコンテキストモジュールは、UAV リモートセンシング画像の特徴マップからより多くのコンテキスト情報を取得するように設計されています。 ASPP と OCR の間には、Feature Pyramid Network (FPN) モジュールが使用され、ASPP から抽出されたマルチスケールフィーチャが融合されます。 2431 のトレーニングセット、945 の検証セット、および 475 のテストセットを含む、UAV によって取得されたリモートセンシング画像のプライベートデータセットが構築されます。提案されたモデルは、140 万個のパラメーターと 5.48G の浮動小数点演算 (FLOP) のみで、このデータセットでうまく機能し、71.12% の平均交差対結合比 (mIoU) を達成します。公開された LoveDA データセットと CITY-OSM データセットでのより広範な実験により、提案されたモデルの有効性がさらに検証され、mIoU でそれぞれ 65.27% と 74.39% という優れた結果が得られました。ソースコードは https://github.com/GtLinyer/LEDCNet で公開されます。

Semantic segmentation for extracting buildings and roads, from unmanned aerial vehicle (UAV) remote sensing images by deep learning becomes a more efficient and convenient method than traditional manual segmentation in surveying and mapping field. In order to make the model lightweight and improve the model accuracy, A Lightweight and Efficient Network implemented using Dual Context modules (LEDCNet) for Buildings and Roads from UAV Aerial Remote Sensing Images is proposed. The proposed network adopts an encoder-decoder architecture in which a Lightweight Densely Connected Network (LDCNet) is developed as the encoder. In the decoder part, the dual multi-scale context modules which consist of the Atrous Spatial Pyramid Pooling module (ASPP) and the Object Contextual Representation module (OCR) are designed to capture more context information from feature maps of UAV remote sensing images. Between ASPP and OCR, a Feature Pyramid Network (FPN) module is used to and fuse multi-scale features extracting from ASPP. A private dataset of remote sensing images taken by UAV which contains 2431 training sets, 945 validation sets, and 475 test sets is constructed. The proposed model performs well on this dataset, with only 1.4M parameters and 5.48G floating-point operations (FLOPs), achieving an mean intersection-over-union ratio (mIoU) of 71.12%. More extensive experiments on the public LoveDA dataset and CITY-OSM dataset to further verify the effectiveness of the proposed model with excellent results on mIoU of 65.27% and 74.39%, respectively. The source code will be made available on https://github.com/GtLinyer/LEDCNet .

updated: Sun Feb 19 2023 15:47:10 GMT+0000 (UTC)

published: Fri Dec 16 2022 14:02:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト