Building and Road Segmentation Using EffUNet and Transfer Learning Approach

Sahil Gangurde

EffUNet と転移学習アプローチを使用した建物と道路のセグメンテーション

都市においては、水道、鉄道、送電線、建物、道路などの都市物体に関する情報が都市計画に必要です。特に、政策立案者が影響力のある決定を下すには、これらの物体の広がり、場所、容量に関する情報が必要です。この論文の目的は、衛星や無人航空機によって捕捉された航空画像から建物や道路をセグメント化することです。セマンティックセグメンテーションタスクにはさまざまなアーキテクチャが提案されており、UNet もその 1 つです。本論文では、セグメンテーションマップを構築するためのUNetデコーダによる特徴抽出のためのエンコーダとして、Googleが新しく提案したEfficientNetV2に基づく新しいアーキテクチャを提案します。このアプローチを使用して、マサチューセッツ州の建物と道路のデータセットのベンチマークスコアをそれぞれ 0.8365 と 0.9153 の mIOU で達成しました。

In city, information about urban objects such as water supply, railway lines, power lines, buildings, roads, etc., is necessary for city planning. In particular, information about the spread of these objects, locations and capacity is needed for the policymakers to make impactful decisions. This thesis aims to segment the building and roads from the aerial image captured by the satellites and UAVs. Many different architectures have been proposed for the semantic segmentation task and UNet being one of them. In this thesis, we propose a novel architecture based on Google's newly proposed EfficientNetV2 as an encoder for feature extraction with UNet decoder for constructing the segmentation map. Using this approach we achieved a benchmark score for the Massachusetts Building and Road dataset with an mIOU of 0.8365 and 0.9153 respectively.

updated: Sat Jul 08 2023 14:08:37 GMT+0000 (UTC)

published: Sat Jul 08 2023 14:08:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト