Building Segmentation through a Gated Graph Convolutional Neural Network with Deep Structured Feature Embedding

Yilei Shi; Qingyu Li; Xiao Xiang Zhu

深い構造化された特徴の埋め込みを備えたゲート付きグラフ畳み込みニューラルネットワークによるセグメンテーションの構築

光学画像からの建物の自動抽出は、たとえば建物の形状が複雑なため、課題のままです。セマンティックセグメンテーションは、このタスクの効率的なアプローチです。ディープコンボリューショナルニューラルネットワーク（DCNN）の最新の開発により、正確なピクセルレベルの分類タスクが可能になりました。それでも、中心的な問題が1つ残っています。境界の正確な描写です。一般に、ディープアーキテクチャは、プログレッシブダウンサンプリングのため、正確な境界を持つきめ細かなセグメンテーションを生成できません。したがって、グラフ畳み込みネットワーク（GCN）と深層構造化機能埋め込み（DSFE）をエンドツーエンドのワークフローに統合し、問題を克服するための汎用フレームワークを導入します。さらに、古典的なグラフ畳み込みニューラルネットワークを使用する代わりに、弱いグラフおよび粗いセマンティック予測を改良してシャープな境界線ときめの細かいピクセルレベルの分類を生成できるゲートグラフ畳み込みネットワークを提案します。建物のフットプリントのセマンティックセグメンテーションを実用的な例として、アーキテクチャとグラフニューラルネットワークを埋め込むさまざまな機能を比較しました。新しいGCNアーキテクチャを備えた提案されたフレームワークは、最先端のアプローチよりも優れています。この作業の主なタスクはフットプリント抽出の構築ですが、提案された方法は一般に他のバイナリまたはマルチラベルセグメンテーションタスクに適用できます。

Automatic building extraction from optical imagery remains a challenge due to, for example, the complexity of building shapes. Semantic segmentation is an efficient approach for this task. The latest development in deep convolutional neural networks (DCNNs) has made accurate pixel-level classification tasks possible. Yet one central issue remains: the precise delineation of boundaries. Deep architectures generally fail to produce fine-grained segmentation with accurate boundaries due to their progressive down-sampling. Hence, we introduce a generic framework to overcome the issue, integrating the graph convolutional network (GCN) and deep structured feature embedding (DSFE) into an end-to-end workflow. Furthermore, instead of using a classic graph convolutional neural network, we propose a gated graph convolutional network, which enables the refinement of weak and coarse semantic predictions to generate sharp borders and fine-grained pixel-level classification. Taking the semantic segmentation of building footprints as a practical example, we compared different feature embedding architectures and graph neural networks. Our proposed framework with the new GCN architecture outperforms state-of-the-art approaches. Although our main task in this work is building footprint extraction, the proposed method can be generally applied to other binary or multi-label segmentation tasks.

updated: Fri Nov 08 2019 10:19:13 GMT+0000 (UTC)

published: Fri Nov 08 2019 10:19:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト