Cross-Region Building Counting in Satellite Imagery using Counting Consistency

Muaaz Zakria; Hamza Rawal; Waqas Sultani; Mohsen Ali

カウントの一貫性を使用した衛星画像でのクロスリージョンの建物のカウント

地理的地域の建物の数を見積もることは、都市分析、災害管理、および公共政策決定の重要な要素です。衛星画像でローカリゼーションとカウントを構築するためのディープラーニング手法は、実行可能で安価な代替手段として役立ちます。ただし、これらのアルゴリズムは、トレーニングされていない領域に適用するとパフォーマンスが低下します。現在の大規模なデータセットは主に開発された地域をカバーしており、すべての地域についてそのようなデータセットを収集することは、費用と時間がかかり、困難な作業です。本論文では、ラベル付きソースドメイン（開発地域）を使用し、トレーニング済みモデルをラベルなしターゲットドメイン（開発地域）に適応させる、建物をカウントするための教師なしドメイン適応方法を提案します。最初に、敵対的な損失を通じて出力スペースの分布を調整することにより、ドメイン間で分布マップを調整します。次に、カウントの一貫性の制約、画像内のカウントの一貫性、および画像間のカウントの一貫性を活用して、ドメインシフトを減らします。イメージ内の一貫性により、イメージ全体の建物の数は、そのサブイメージのいずれかの数以上である必要があります。画像全体の一貫性の制約により、画像に他の画像よりもかなり多くの建物が含まれている場合、それらのサブ画像も同じ順序になります。これらの2つの制約により、スケールに関係なく、画像全体および画像内で動作の一貫性が保たれます。提案されたアプローチのパフォーマンスを評価するために、既存のデータセットと比較して、より高い建物密度と不規則な構造を持つ挑戦的な南アジア地域からなる大規模なデータセットを収集して注釈を付けました。私たちは、私たちのアプローチの有効性を検証するために広範な実験を行い、競合するベースライン方法よりも約7％から20％の改善を報告しています。

Estimating the number of buildings in any geographical region is a vital component of urban analysis, disaster management, and public policy decision. Deep learning methods for building localization and counting in satellite imagery, can serve as a viable and cheap alternative. However, these algorithms suffer performance degradation when applied to the regions on which they have not been trained. Current large datasets mostly cover the developed regions and collecting such datasets for every region is a costly, time-consuming, and difficult endeavor. In this paper, we propose an unsupervised domain adaptation method for counting buildings where we use a labeled source domain (developed regions) and adapt the trained model on an unlabeled target domain (developing regions). We initially align distribution maps across domains by aligning the output space distribution through adversarial loss. We then exploit counting consistency constraints, within-image count consistency, and across-image count consistency, to decrease the domain shift. Within-image consistency enforces that building count in the whole image should be greater than or equal to count in any of its sub-image. Across-image consistency constraint enforces that if an image contains considerably more buildings than the other image, then their sub-images shall also have the same order. These two constraints encourage the behavior to be consistent across and within the images, regardless of the scale. To evaluate the performance of our proposed approach, we collected and annotated a large-scale dataset consisting of challenging South Asian regions having higher building densities and irregular structures as compared to existing datasets. We perform extensive experiments to verify the efficacy of our approach and report improvements of approximately 7% to 20% over the competitive baseline methods.

updated: Tue Oct 26 2021 10:36:56 GMT+0000 (UTC)

published: Tue Oct 26 2021 10:36:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト