Adversarial Camera Alignment Network for Unsupervised Cross-camera Person Re-identification

Lei Qi; Lei Wang; Jing Huo; Yinghuan Shi; Xin Geng; Yang Gao

教師なしクロスカメラ人物再識別のための敵対的カメラ調整ネットワーク

個人の再識別（Re-ID）では、監視ありの方法では通常、大量の高価なラベル情報が必要ですが、監視なしの方法では、十分な識別パフォーマンスを提供できません。この論文では、カメラ内（カメラ内）ラベル情報のみを必要とし、カメラ間（カメラ間）ラベルを必要としない、監視されていないクロスカメラパーソンRe-IDと呼ばれる新しいパーソンRe-IDタスクを紹介します。入手するのにより高価です。実際のアプリケーションでは、カメラ内のラベル情報は、追跡アルゴリズムまたはいくつかの手動注釈によって簡単にキャプチャできます。この状況では、主な課題は、さまざまなカメラのさまざまな体のポーズ、オクルージョン、画像解像度、照明条件、およびバックグラウンドノイズによって引き起こされる、さまざまなカメラビュー間での分布の不一致になります。この状況に対処するために、教師なしクロスカメラパーソンRe-ID用の新しい敵対カメラアライメントネットワーク（ACAN）を提案します。これは、カメラの調整タスクと監視されたカメラ内学習タスクで構成されます。カメラの位置合わせを実現するために、マルチカメラ敵対学習（MCAL）を開発して、さまざまなカメラの画像を共有部分空間にマッピングします。特に、マルチカメラの敵対的タスクを実行するために、既存のGRL（つまり、勾配反転レイヤー）スキームと「その他のカメラ等確率」（OCE）と呼ばれる提案されたスキームを含む2つの異なるスキームを調査します。次に、この共有部分空間に基づいて、カメラ内ラベルを活用してネットワークをトレーニングします。 5つの大規模データセットでの広範な実験は、ラベル付けされたソースドメインとGANベースのモデルによって生成された画像を利用する複数の最先端の教師なし手法に対するACANの優位性を示しています。特に、提案されたマルチカメラの敵対的タスクが大幅な改善に貢献していることを確認します。

In person re-identification (Re-ID), supervised methods usually need a large amount of expensive label information, while unsupervised ones are still unable to deliver satisfactory identification performance. In this paper, we introduce a novel person Re-ID task called unsupervised cross-camera person Re-ID, which only needs the within-camera (intra-camera) label information but not cross-camera (inter-camera) labels which are more expensive to obtain. In real-world applications, the intra-camera label information can be easily captured by tracking algorithms or few manual annotations. In this situation, the main challenge becomes the distribution discrepancy across different camera views, caused by the various body pose, occlusion, image resolution, illumination conditions, and background noises in different cameras. To address this situation, we propose a novel Adversarial Camera Alignment Network (ACAN) for unsupervised cross-camera person Re-ID. It consists of the camera-alignment task and the supervised within-camera learning task. To achieve the camera alignment, we develop a Multi-Camera Adversarial Learning (MCAL) to map images of different cameras into a shared subspace. Particularly, we investigate two different schemes, including the existing GRL (i.e., gradient reversal layer) scheme and the proposed scheme called "other camera equiprobability" (OCE), to conduct the multi-camera adversarial task. Based on this shared subspace, we then leverage the within-camera labels to train the network. Extensive experiments on five large-scale datasets demonstrate the superiority of ACAN over multiple state-of-the-art unsupervised methods that take advantage of labeled source domains and generated images by GAN-based models. In particular, we verify that the proposed multi-camera adversarial task does contribute to the significant improvement.

updated: Fri Jul 09 2021 14:35:59 GMT+0000 (UTC)

published: Fri Aug 02 2019 13:58:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト