Generalizable Pedestrian Detection: The Elephant In The Room

Irtiza Hasan; Shengcai Liao; Jinpeng Li; Saad Ullah Akram; Ling Shao

一般化可能な歩行者の検出：部屋の中の象

歩行者検知は、ビデオ監視から自動運転に至るまで、多くの視覚ベースのアプリケーションで使用されています。高性能を達成しているにもかかわらず、既存の検出器が目に見えないデータにどれだけうまく一般化されているかはまだほとんどわかっていません。実用的な検出器は、アプリケーションのさまざまなシナリオですぐに使用できる必要があるため、これは重要です。この目的のために、直接クロスデータセット評価の一般原則を使用して、このペーパーで包括的な調査を実施します。この調査を通じて、既存の最先端の歩行者検出器は、同じデータセットでトレーニングおよびテストした場合は非常に優れたパフォーマンスを発揮しますが、クロスデータセットの評価では一般化が不十分であることがわかりました。この傾向には2つの理由があることを示しています。まず、それらの設計（アンカー設定など）は、従来の単一データセットのトレーニングおよびテストパイプラインで一般的なベンチマークに偏っている可能性がありますが、その結果、一般化機能が大幅に制限されます。第二に、トレーニングソースは一般的に歩行者が密集しておらず、シナリオが多様です。直接のクロスデータセット評価の下で、驚くべきことに、設計に歩行者に合わせた適応がない汎用オブジェクト検出器は、既存の最先端の歩行者検出器と比較してはるかに一般化されていることがわかります。さらに、Webをクロールすることによって収集された多様で高密度のデータセットが、歩行者検出のための事前トレーニングの効率的なソースとして機能することを示します。したがって、プログレッシブトレーニングパイプラインを提案し、自動運転指向の歩行者検出に適していることを確認します。したがって、この論文で実施された研究は、一般化可能な歩行者検出器の将来の設計のために、データセット間の評価にさらに重点を置く必要があることを示唆しています。コードとモデルには、https：//github.com/hasanirtiza/Pedestronからアクセスできます。

Pedestrian detection is used in many vision based applications ranging from video surveillance to autonomous driving. Despite achieving high performance, it is still largely unknown how well existing detectors generalize to unseen data. This is important because a practical detector should be ready to use in various scenarios in applications. To this end, we conduct a comprehensive study in this paper, using a general principle of direct cross-dataset evaluation. Through this study, we find that existing state-of-the-art pedestrian detectors, though perform quite well when trained and tested on the same dataset, generalize poorly in cross dataset evaluation. We demonstrate that there are two reasons for this trend. Firstly, their designs (e.g. anchor settings) may be biased towards popular benchmarks in the traditional single-dataset training and test pipeline, but as a result largely limit their generalization capability. Secondly, the training source is generally not dense in pedestrians and diverse in scenarios. Under direct cross-dataset evaluation, surprisingly, we find that a general purpose object detector, without pedestrian-tailored adaptation in design, generalizes much better compared to existing state-of-the-art pedestrian detectors. Furthermore, we illustrate that diverse and dense datasets, collected by crawling the web, serve to be an efficient source of pre-training for pedestrian detection. Accordingly, we propose a progressive training pipeline and find that it works well for autonomous-driving oriented pedestrian detection. Consequently, the study conducted in this paper suggests that more emphasis should be put on cross-dataset evaluation for the future design of generalizable pedestrian detectors. Code and models can be accessed at https://github.com/hasanirtiza/Pedestron.

updated: Wed Dec 09 2020 08:56:09 GMT+0000 (UTC)

published: Thu Mar 19 2020 14:14:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト