Efficient Person Search: An Anchor-Free Approach

Yichao Yan; Jinpeng Li; Jie Qin; Shengcai Liao; Xiaokang Yang

効率的な人の検索：アンカーのないアプローチ

人物検索は、現実的なトリミングされていない画像からクエリ人物を同時にローカライズして識別することを目的としています。この目標を達成するために、最先端のモデルは通常、FasterR-CNNなどの2段式検出器にre-idブランチを追加します。 ROI-Align操作により、このパイプラインは、re-idフィーチャが対応するオブジェクト領域と明示的に位置合わせされるため、有望な精度を実現しますが、その間、オブジェクトアンカーが密集しているため、計算のオーバーヘッドが高くなります。この作業では、次の専用設計を導入することにより、この困難なタスクに効率的に取り組むためのアンカーフリーアプローチを提示します。まず、フレームワークのプロトタイプとして、アンカーのない検出器（つまり、FCOS）を選択します。高密度のオブジェクトアンカーがないため、既存の人物検索モデルと比較して大幅に高い効率を示します。第二に、人の検索のためにこのアンカーフリー検出器を直接収容する場合、堅牢なre-id機能の学習にはいくつかの大きな課題があります。これは、さまざまなレベル（つまり、スケール、領域、タスク）での不整合の問題として要約されます。これらの問題に対処するために、より識別力のある堅牢な特徴埋め込みを生成するための整列された特徴集約モジュールを提案します。したがって、このモデルをFeature-Aligned Person Search Network（AlignPS）と名付けます。第3に、アンカーベースモデルとアンカーフリーモデルの両方の利点を調査することにより、RoI-AlignヘッドでAlignPSをさらに強化します。これにより、モデルの効率を維持しながら、re-id機能の堅牢性が大幅に向上します。 2つの挑戦的なベンチマーク（つまり、CUHK-SYSUとPRW）で実施された広範な実験は、私たちのフレームワークがより高い効率を示しながら、最先端または競争力のあるパフォーマンスを達成することを示しています。すべてのソースコード、データ、およびトレーニング済みモデルは、https：//github.com/daodaofr/alignpsで入手できます。

Person search aims to simultaneously localize and identify a query person from realistic, uncropped images. To achieve this goal, state-of-the-art models typically add a re-id branch upon two-stage detectors like Faster R-CNN. Owing to the ROI-Align operation, this pipeline yields promising accuracy as re-id features are explicitly aligned with the corresponding object regions, but in the meantime, it introduces high computational overhead due to dense object anchors. In this work, we present an anchor-free approach to efficiently tackling this challenging task, by introducing the following dedicated designs. First, we select an anchor-free detector (i.e., FCOS) as the prototype of our framework. Due to the lack of dense object anchors, it exhibits significantly higher efficiency compared with existing person search models. Second, when directly accommodating this anchor-free detector for person search, there exist several major challenges in learning robust re-id features, which we summarize as the misalignment issues in different levels (i.e., scale, region, and task). To address these issues, we propose an aligned feature aggregation module to generate more discriminative and robust feature embeddings. Accordingly, we name our model as Feature-Aligned Person Search Network (AlignPS). Third, by investigating the advantages of both anchor-based and anchor-free models, we further augment AlignPS with an ROI-Align head, which significantly improves the robustness of re-id features while still keeping our model highly efficient. Extensive experiments conducted on two challenging benchmarks (i.e., CUHK-SYSU and PRW) demonstrate that our framework achieves state-of-the-art or competitive performance, while displaying higher efficiency. All the source codes, data, and trained models are available at: https://github.com/daodaofr/alignps.

updated: Wed Sep 01 2021 07:01:33 GMT+0000 (UTC)

published: Wed Sep 01 2021 07:01:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト