Semantic Human Parsing via Scalable Semantic Transfer over Multiple Label Domains

Jie Yang; Chaoqun Wang; Zhen Li; Junle Wang; Ruimao Zhang

複数のラベルドメインにわたるスケーラブルなセマンティック転送によるセマンティックヒューマンパーシング

このホワイトペーパーでは、新しいトレーニングパラダイムである Scalable Semantic Transfer (SST) を紹介し、さまざまなラベルドメイン (つまり、さまざまなレベルのラベル粒度) からのデータの相互利益を活用して、強力なヒューマンパーシングネットワークをトレーニングする方法を探ります。実際には、ユニバーサル解析と専用解析と呼ばれる 2 つの一般的なアプリケーションシナリオが扱われます。前者は複数のラベルドメインから均質な人間の表現を学習し、異なるセグメンテーションヘッドのみを使用して予測を切り替えることを目的とし、後者は特定のドメイン予測を学習することを目的としています。他のドメインからセマンティック知識を抽出しながら。提案された SST には、次の魅力的な利点があります。(1) 複数のラベルドメインから人間の身体部分のセマンティックな関連付けを人間の表現学習プロセスに埋め込むための効果的なトレーニングスキームとして機能することができます。（2）複数のラベルドメインの全体的な関係を事前に決定することなく、拡張可能なセマンティックトランスファーフレームワークであり、トレーニングを促進するために人間の解析データセットを継続的に追加できます。 (3) 関連するモジュールは補助トレーニングにのみ使用され、推論中に削除できるため、余分な推論コストが削減されます。実験結果は、SST が有望なユニバーサルヒューマンパーシングパフォーマンスを効果的に達成できるだけでなく、3 つのヒューマンパーシングベンチマーク (PASCAL-Person-Part、ATR、および CIHP) と比較して印象的な改善を達成できることを示しています。コードは https://github.com/yangjie-cv/SST で入手できます。

This paper presents Scalable Semantic Transfer (SST), a novel training paradigm, to explore how to leverage the mutual benefits of the data from different label domains (i.e. various levels of label granularity) to train a powerful human parsing network. In practice, two common application scenarios are addressed, termed universal parsing and dedicated parsing, where the former aims to learn homogeneous human representations from multiple label domains and switch predictions by only using different segmentation heads, and the latter aims to learn a specific domain prediction while distilling the semantic knowledge from other domains. The proposed SST has the following appealing benefits: (1) it can capably serve as an effective training scheme to embed semantic associations of human body parts from multiple label domains into the human representation learning process; (2) it is an extensible semantic transfer framework without predetermining the overall relations of multiple label domains, which allows continuously adding human parsing datasets to promote the training. (3) the relevant modules are only used for auxiliary training and can be removed during inference, eliminating the extra reasoning cost. Experimental results demonstrate SST can effectively achieve promising universal human parsing performance as well as impressive improvements compared to its counterparts on three human parsing benchmarks (i.e., PASCAL-Person-Part, ATR, and CIHP). Code is available at https://github.com/yangjie-cv/SST.

updated: Sun Apr 09 2023 02:44:29 GMT+0000 (UTC)

published: Sun Apr 09 2023 02:44:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト