Part-Aware Transformer for Generalizable Person Re-identification

Hao Ni; Yuke Li; Lianli Gao; Heng Tao Shen; Jingkuan Song

一般化可能な個人の再識別のための部分認識トランスフォーマー

ドメイン一般化人物再識別 (DG-ReID) は、ソースドメインでモデルをトレーニングし、目に見えないドメインで適切に一般化することを目的としています。 Vision Transformer は通常、分布の変化の下で一般的な CNN ネットワークよりも優れた汎化能力をもたらします。ただし、Transformer ベースの ReID モデルは、ソースドメインの教師あり学習戦略により、必然的にドメイン固有のバイアスに過剰適合します。異なる ID のグローバル画像は異なる特徴を持つはずですが、それらの類似したローカル部分 (例: 黒いバックパック) はこの制約によって制限されていないことがわかります。これを動機として、異なる ID によって共有されるローカルな視覚情報をマイニングするための Cross-ID 類似性学習 (CSL) という名前のプロキシタスクを設計することにより、DG-ReID 用の純粋な Transformer モデル (Part-aware Transformer と呼ばれる) を提案します。このプロキシタスクにより、モデルは ID ラベルに関係なくパーツの視覚的な類似性のみを考慮するため、一般的な特徴を学習できるため、ドメイン固有のバイアスによる副作用が軽減されます。 CSL で得られた局所的類似性に基づいて、全体的な特徴の一般化をさらに改善するために、部分ガイド付き自己蒸留 (PSD) が提案されています。私たちの方法は、ほとんどの DG ReID 設定の下で最先端のパフォーマンスを実現します。 Market\toDuke 設定では、我々の手法は Rank1 と mAP でそれぞれ 10.9% と 12.8% 最先端技術を上回っています。コードは https://github.com/liyuke65535/Part-Aware-Transformer で入手できます。

Domain generalization person re-identification (DG-ReID) aims to train a model on source domains and generalize well on unseen domains. Vision Transformer usually yields better generalization ability than common CNN networks under distribution shifts. However, Transformer-based ReID models inevitably over-fit to domain-specific biases due to the supervised learning strategy on the source domain. We observe that while the global images of different IDs should have different features, their similar local parts (e.g., black backpack) are not bounded by this constraint. Motivated by this, we propose a pure Transformer model (termed Part-aware Transformer) for DG-ReID by designing a proxy task, named Cross-ID Similarity Learning (CSL), to mine local visual information shared by different IDs. This proxy task allows the model to learn generic features because it only cares about the visual similarity of the parts regardless of the ID labels, thus alleviating the side effect of domain-specific biases. Based on the local similarity obtained in CSL, a Part-guided Self-Distillation (PSD) is proposed to further improve the generalization of global features. Our method achieves state-of-the-art performance under most DG ReID settings. Under the Market\toDuke setting, our method exceeds state-of-the-art by 10.9% and 12.8% in Rank1 and mAP, respectively. The code is available at https://github.com/liyuke65535/Part-Aware-Transformer.

updated: Mon Sep 18 2023 08:18:31 GMT+0000 (UTC)

published: Mon Aug 07 2023 06:15:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト