Continual Coarse-to-Fine Domain Adaptation in Semantic Segmentation

Donald Shenaj; Francesco Barbato; Umberto Michieli; Pietro Zanuttigh

セマンティックセグメンテーションにおける継続的な粗いドメインから細かいドメインへの適応

ディープニューラルネットワークは通常、特定のタスクとデータ分散のために1回のショットでトレーニングされますが、実際の設定では、タスクとアプリケーションのドメインの両方が変更される可能性があります。この問題は、セマンティックセグメンテーションなどの高密度の予測タスクではさらに困難になり、さらにほとんどのアプローチは2つの問題に別々に取り組みます。この論文では、ドメインシフトが存在する場合のセマンティックセグメンテーションアーキテクチャの粗い学習から細かい学習までの新しいタスクを紹介します。その後の学習段階では、セマンティックレベルでタスクを段階的に改善することを検討します。つまり、各学習ステップでのより細かいセマンティックラベルのセットは、前のステップのより粗いセットから階層的に導出されます。このシナリオに取り組むための新しいアプローチ（CCDA）を提案します。まず、最大二乗損失を使用して、ソースドメインとターゲットドメインを整列させ、同時に、適切に分類されたサンプルとより難しいサンプルの間の勾配のバランスを取ります。次に、粗いラベルのセットで取得したネットワーク機能を細かいラベルのセットに転送するための、新しい粗い知識から細かい知識への蒸留制約を導入します。最後に、各粗いクラスからそれぞれの細かいクラスに重要性を広げるために、粗い重みから細かい重みへの初期化ルールを設計します。私たちのアプローチを評価するために、GTA5データセットからソース知識を抽出し、それをCityscapesまたはIDDデータセットに転送する、2つのベンチマークを設計し、それが主要な競合他社をどのように上回っているかを示します。

Deep neural networks are typically trained in a single shot for a specific task and data distribution, but in real world settings both the task and the domain of application can change. The problem becomes even more challenging in dense predictive tasks, such as semantic segmentation, and furthermore most approaches tackle the two problems separately. In this paper we introduce the novel task of coarse-to-fine learning of semantic segmentation architectures in presence of domain shift. We consider subsequent learning stages progressively refining the task at the semantic level; i.e., the finer set of semantic labels at each learning step is hierarchically derived from the coarser set of the previous step. We propose a new approach (CCDA) to tackle this scenario. First, we employ the maximum squares loss to align source and target domains and, at the same time, to balance the gradients between well-classified and harder samples. Second, we introduce a novel coarse-to-fine knowledge distillation constraint to transfer network capabilities acquired on a coarser set of labels to a set of finer labels. Finally, we design a coarse-to-fine weight initialization rule to spread the importance from each coarse class to the respective finer classes. To evaluate our approach, we design two benchmarks where source knowledge is extracted from the GTA5 dataset and it is transferred to either the Cityscapes or the IDD datasets, and we show how it outperforms the main competitors.

updated: Tue Jan 18 2022 13:31:19 GMT+0000 (UTC)

published: Tue Jan 18 2022 13:31:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト