Graph Sampling Based Deep Metric Learning for Generalizable Person Re-Identification

Shengcai Liao; Ling Shao

一般化可能な人物の再識別のためのグラフサンプリングベースのディープメトリック学習

最近の研究によると、明示的な深い特徴のマッチングと大規模で多様なトレーニングデータの両方が、個人の再識別の一般化を大幅に改善できることが示されています。ただし、大規模なデータでディープマッチャーを学習する効率はまだ十分に研究されていません。分類パラメータまたはクラスメモリを使用した学習は一般的な方法ですが、大量のメモリと計算コストが発生します。対照的に、ミニバッチ内のペアワイズディープメトリック学習はより良い選択です。ただし、最も一般的なランダムサンプリング方法であるよく知られているPKサンプラーは、詳細なメトリック学習には有益で効率的ではありません。オンラインのハードサンプルマイニングは学習効率をある程度改善しましたが、ランダムサンプリング後のミニバッチでのマイニングはまだ制限されています。これにより、データサンプリングの段階で、ハードサンプルマイニングの使用を早期に検討するようになります。そのために、本論文では、大規模な深距離計量学習のために、グラフサンプリング（GS）と呼ばれる効率的なミニバッチサンプリング法を提案します。基本的な考え方は、各エポックの開始時にすべてのクラスの最近傍関係グラフを作成することです。次に、各ミニバッチは、ランダムに選択されたクラスとそれに最も近い隣接するクラスで構成され、学習のための有益でやりがいのある例を提供します。適応された競争力のあるベースラインとともに、一般化可能な人物の再識別における以前の最先端技術を大幅に改善し、ランク1で最大24％、mAPで最大13.8％向上します。さらに、提案された方法は、ランク1で最大6.2％、mAPで5.3％も競合ベースラインを上回っています。一方、トレーニング時間は最大5分の1に大幅に短縮されます。たとえば、8,000個のIDを持つ大規模なデータセットでトレーニングする場合は12.2時間から2.3時間に短縮されます。コードはhttps://github.com/ShengcaiLiao/QAConvで入手できます。

Recent studies show that, both explicit deep feature matching as well as large-scale and diverse training data can significantly improve the generalization of person re-identification. However, the efficiency of learning deep matchers on large-scale data has not yet been adequately studied. Though learning with classification parameters or class memory is a popular way, it incurs large memory and computational costs. In contrast, pairwise deep metric learning within mini batches would be a better choice. However, the most popular random sampling method, the well-known PK sampler, is not informative and efficient for deep metric learning. Though online hard example mining has improved the learning efficiency to some extent, the mining in mini batches after random sampling is still limited. This inspires us to explore the use of hard example mining earlier, in the data sampling stage. To do so, in this paper, we propose an efficient mini-batch sampling method, called graph sampling (GS), for large-scale deep metric learning. The basic idea is to build a nearest neighbor relationship graph for all classes at the beginning of each epoch. Then, each mini batch is composed of a randomly selected class and its nearest neighboring classes so as to provide informative and challenging examples for learning. Together with an adapted competitive baseline, we improve the previous state of the art in generalizable person re-identification significantly, by up to 24% in Rank-1 and 13.8% in mAP. Besides, the proposed method also outperforms the competitive baseline by up to 6.2% in Rank-1 and 5.3% in mAP. Meanwhile, the training time is significantly reduced by up to five times, e.g. from 12.2 hours to 2.3 hours when training on a large-scale dataset with 8,000 identities. Code is available at https://github.com/ShengcaiLiao/QAConv.

updated: Tue Dec 07 2021 18:44:31 GMT+0000 (UTC)

published: Sun Apr 04 2021 06:44:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト