Unsupervised Person Re-identification by Deep Asymmetric Metric Embedding

Hong-Xing Yu; Ancong Wu; Wei-Shi Zheng

深層非対称メトリック埋め込みによる教師なし人物再識別

人物の再識別(Re-ID)は、重複しないカメラビュー間のアイデンティティの照合を目的としている。研究者は多くの教師付き再識別モデルを提案しているが、これらのモデルではクロスビューのペアでラベル付けされた大量のデータを必要とする。このため、複数のカメラビューからの大量のデータが利用可能なもののラベル付けされていない多くのアプリケーションでのスケーラビリティが制限されている。このスケーラビリティの問題を解決するために、教師なしRe-IDモデルがいくつか提案されているが、これらのモデルは多くの場合、異なるカメラビュー間の劇的な分散、例えば、異なる照明、視点、オクルージョンなどによって引き起こされるビュー固有のバイアス問題に悩まされている。劇的な分散は、異なるカメラビューで特定の特徴の歪みを誘発し、これは、バイアスを軽減するのに役立つラベル情報が利用できないため、教師なしシナリオでRe-IDのためのクロスビュー識別情報を見つける際に非常に邪魔になる可能性がある。我々は、クロスビュークラスタリングに基づく教師なしの非対称距離メトリックを学習することで、この問題に明示的に対処することを提案する。非対称距離メトリックは、各カメラビューに対して特定の特徴量を変換し、特定の特徴量の歪みに対処することを可能にする。次に、非対称距離メトリックをディープニューラルネットワークに埋め込むための新しい教師なし損失関数を設計し、その結果、ディープクラスタリングに基づく非対称メトリック学習(DEep Clustering-based Asymmetric MEtric Learning; DECAMEL)と名付けられた新しい教師なしディープフレームワークを開発した。このようにして、DECAMELは特徴表現と教師なしの非対称メトリックを共同で学習する。DECAMELは、再識別データのコンパクトなクロスビュークラスタ構造を学習することで、ビュー固有のバイアスを緩和し、教師なし再識別のための潜在的なクロスビュー識別情報のマイニングを容易にすることができる。異なるオーダのサイズを持つ7つのベンチマークデータセットを用いた広範な実験により、我々のフレームワークの有効性が示された。

Person re-identification (Re-ID) aims to match identities across non-overlapping camera views. Researchers have proposed many supervised Re-ID models which require quantities of cross-view pairwise labelled data. This limits their scalabilities to many applications where a large amount of data from multiple disjoint camera views is available but unlabelled. Although some unsupervised Re-ID models have been proposed to address the scalability problem, they often suffer from the view-specific bias problem which is caused by dramatic variances across different camera views, e.g., different illumination, viewpoints and occlusion. The dramatic variances induce specific feature distortions in different camera views, which can be very disturbing in finding cross-view discriminative information for Re-ID in the unsupervised scenarios, since no label information is available to help alleviate the bias. We propose to explicitly address this problem by learning an unsupervised asymmetric distance metric based on cross-view clustering. The asymmetric distance metric allows specific feature transformations for each camera view to tackle the specific feature distortions. We then design a novel unsupervised loss function to embed the asymmetric metric into a deep neural network, and therefore develop a novel unsupervised deep framework named the DEep Clustering-based Asymmetric MEtric Learning (DECAMEL). In such a way, DECAMEL jointly learns the feature representation and the unsupervised asymmetric metric. DECAMEL learns a compact cross-view cluster structure of Re-ID data, and thus help alleviate the view-specific bias and facilitate mining the potential cross-view discriminative information for unsupervised Re-ID. Extensive experiments on seven benchmark datasets whose sizes span several orders show the effectiveness of our framework.

updated: Tue Jan 29 2019 08:49:26 GMT+0000 (UTC)

published: Tue Jan 29 2019 08:49:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト