Sci-Net: a Scale Invariant Model for Buildings Segmentation from Aerial Images

Hasan Nasrallah; Mustafa Shukor; Ali J. Ghandour

Sci-Net：航空写真からの建物のセグメンテーションのためのスケール不変モデル

建物のセグメンテーションは、地球観測と航空画像分析の分野における基本的なタスクです。文献にある既存の深層学習ベースの方法のほとんどは、固定または狭範囲の空間分解能の画像に適用できます。実際のシナリオでは、ユーザーは幅広い画像解像度を扱います。したがって、特定の空中画像は、深層学習モデルのトレーニングに使用されるデータセットの空間解像度に一致するように再サンプリングする必要があり、その結果、セグメンテーションのパフォーマンスが低下します。これを克服するために、さまざまな空間解像度で航空写真に存在する建物をセグメント化できるスケール不変ニューラルネットワーク（Sci-Net）を提案します。具体的には、私たちのアプローチは、UNet階層表現と拡張畳み込みを活用して、きめ細かいマルチスケール表現を抽出します。私たちの方法は、Open Cities AIデータセットの他の最先端モデルを大幅に上回り、さまざまな解像度にわたって着実に改善されています。

Buildings' segmentation is a fundamental task in the field of earth observation and aerial imagery analysis. Most existing deep learning-based methods in the literature can be applied to fixed or narrow-ranged spatial resolution imagery. In practical scenarios, users deal with a broad spectrum of image resolutions. Thus, a given aerial image often needs to be re-sampled to match the spatial resolution of the dataset used to train the deep learning model, which results in a degradation in segmentation performance. To overcome this, we propose a Scale-invariant Neural Network (Sci-Net) that can segment buildings present in aerial images at different spatial resolutions. Specifically, our approach leverages UNet hierarchical representations and dilated convolutions to extract fine-grained multi-scale representations. Our method significantly outperforms other state of the art models on the Open Cities AI dataset with a steady improvements margin across different resolutions.

updated: Mon Feb 28 2022 10:58:48 GMT+0000 (UTC)

published: Fri Nov 12 2021 16:45:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト