Center and Scale Prediction: A Box-free Approach for Pedestrian and Face Detection

Wei Liu; Irtiza Hasan; Shengcai Liao

中心およびスケール予測：歩行者および顔検出のためのボックスなしアプローチ

一般に、オブジェクト検出には、伝統的なスライディングウィンドウ分類子、または現代の深層学習アプローチにおけるアンカーボックスベースの予測が必要です。ただし、これらのアプローチのいずれも、ボックス内の退屈な構成を必要とします。このホワイトペーパーでは、オブジェクトの検出が高レベルのセマンティックフィーチャ検出タスクとして動機付けられているという新しい視点を提供します。エッジ、コーナー、ブロブ、その他の特徴検出器のように、提案された検出器は、画像全体にわたって特徴点をスキャンします。畳み込みは自然に適しています。ただし、これらの従来の低レベルの機能とは異なり、提案されている検出器は高レベルの抽象化を目指しています。つまり、オブジェクトがある中心点を探しており、現代の深いモデルはすでにそのような高レベルのセマンティック抽象化が可能です。また、ブロブ検出と同様に、中心点のスケールも予測しますが、これも簡単な畳み込みです。したがって、このペーパーでは、歩行者と顔の検出は、畳み込みによる単純な中心およびスケール予測タスクとして単純化されます。このようにして、提案された方法は、ボックスなしの設定を楽しんでいます。構造的にはシンプルですが、歩行者の検出や顔の検出など、いくつかの困難なベンチマークで競争力のある精度を提供します。さらに、クロスデータセット評価が実行され、提案された方法の優れた一般化能力が実証されます。

Object detection generally requires sliding-window classifiers in tradition or anchor box based predictions in modern deep learning approaches. However, either of these approaches requires tedious configurations in boxes. In this paper, we provide a new perspective where detecting objects is motivated as a high-level semantic feature detection task. Like edges, corners, blobs and other feature detectors, the proposed detector scans for feature points all over the image, for which the convolution is naturally suited. However, unlike these traditional low-level features, the proposed detector goes for a higher-level abstraction, that is, we are looking for central points where there are objects, and modern deep models are already capable of such a high-level semantic abstraction. Besides, like blob detection, we also predict the scales of the central points, which is also a straightforward convolution. Therefore, in this paper, pedestrian and face detection is simplified as a straightforward center and scale prediction task through convolutions. This way, the proposed method enjoys a box-free setting. Though structurally simple, it presents competitive accuracy on several challenging benchmarks, including pedestrian detection and face detection. Furthermore, a cross-dataset evaluation is performed, demonstrating a superior generalization ability of the proposed method

updated: Sun Feb 09 2020 13:32:58 GMT+0000 (UTC)

published: Fri Apr 05 2019 09:14:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト