SCIDA: Self-Correction Integrated Domain Adaptation from Single- to Multi-label Aerial Images

Tianze Yu; Jianzhe Lin; Lichao Mou; Yuansheng Hua; Xiaoxiang Zhu; Z. Jane Wang

SCIDA：シングルラベルからマルチラベルの航空画像への自己修正統合ドメイン適応

画像分類のために公開されているデータセットのほとんどは単一のラベルを使用していますが、画像は私たちの日常生活では本質的に複数のラベルが付けられています。このような注釈のギャップにより、事前にトレーニングされた多くの単一ラベル分類モデルが実際のシナリオで失敗します。この注釈の問題は、航空画像に関係しています。センサーから収集された航空データは、当然、複数のラベルが付いた比較的広い土地をカバーしますが、公開されている注釈付きの航空データセット（UCM、AIDなど）は単一ラベルです。マルチラベル航空画像に手動で注釈を付けることは時間/労力を要するため、自動マルチラベル学習のための新しい自己修正統合ドメイン適応（SCIDA）法を提案します。 SCIDAは弱く監視されています。つまり、公開されている大量のシングルラベル画像を使用してマルチラベル画像分類モデルを自動的に学習します。この目標を達成するために、基礎となるラベルの相関関係をよりよく調査するための新しいLabel-Wise self-Correction（LWC）モジュールを提案します。このモジュールは、シングルラベルデータからマルチラベルデータへの教師なしドメイン適応（UDA）も可能にします。モデルトレーニングの場合、提案されたモデルは単一ラベル情報のみを使用しますが、複数ラベルデータの事前知識は必要ありません。また、マルチラベル航空画像のラベルを予測します。単一ラベルのMAI-AID-sおよびMAI-UCM-sデータセットでトレーニングされた実験では、提案されたモデルは、収集されたマルチシーン空中画像（MAI）データセットで直接テストされます。

Most publicly available datasets for image classification are with single labels, while images are inherently multi-labeled in our daily life. Such an annotation gap makes many pre-trained single-label classification models fail in practical scenarios. This annotation issue is more concerned for aerial images: Aerial data collected from sensors naturally cover a relatively large land area with multiple labels, while annotated aerial datasets, which are publicly available (e.g., UCM, AID), are single-labeled. As manually annotating multi-label aerial images would be time/labor-consuming, we propose a novel self-correction integrated domain adaptation (SCIDA) method for automatic multi-label learning. SCIDA is weakly supervised, i.e., automatically learning the multi-label image classification model from using massive, publicly available single-label images. To achieve this goal, we propose a novel Label-Wise self-Correction (LWC) module to better explore underlying label correlations. This module also makes the unsupervised domain adaptation (UDA) from single- to multi-label data possible. For model training, the proposed model only uses single-label information yet requires no prior knowledge of multi-labeled data; and it predicts labels for multi-label aerial images. In our experiments, trained with single-labeled MAI-AID-s and MAI-UCM-s datasets, the proposed model is tested directly on our collected Multi-scene Aerial Image (MAI) dataset.

updated: Mon Nov 29 2021 23:09:49 GMT+0000 (UTC)

published: Sun Aug 15 2021 20:38:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト