ThunderNet: Towards Real-time Generic Object Detection

Zheng Qin; Zeming Li; Zhaoning Zhang; Yiping Bao; Gang Yu; Yuxing Peng; Jian Sun

ThunderNet：リアルタイムの汎用オブジェクト検出に向けて

モバイルプラットフォームでのリアルタイムの汎用オブジェクト検出は、重要ですが困難なコンピュータビジョンタスクです。ただし、以前のCNNベースの検出器は、莫大な計算コストに悩まされており、計算に制約のあるシナリオでのリアルタイムの推論を妨げています。本論文では、リアルタイムの一般的な検出における2段検出器の有効性を調査し、ThunderNetという名前の軽量2段検出器を提案します。バックボーンの部分では、以前の軽量バックボーンの欠点を分析し、オブジェクト検出用に設計された軽量バックボーンを紹介します。検出部分では、非常に効率的なRPNと検出ヘッドの設計を活用します。より識別力のある特徴表現を生成するために、2つの効率的なアーキテクチャブロック、コンテキスト拡張モジュールと空間注意モジュールを設計します。最後に、入力解像度、バックボーン、および検出ヘッドの間のバランスを調査します。 ThunderNetは、軽量の1ステージ検出器と比較して、PASCAL VOCおよびCOCOベンチマークの計算コストのわずか40％で優れたパフォーマンスを実現します。ベルやホイッスルがない場合、モデルはARMベースのデバイスで24.1fpsで動作します。私たちの知る限り、これはARMプラットフォームで報告された最初のリアルタイム検出器です。コードとモデルはhttps://github.com/qinzheng93/ThunderNetで入手できます。

Real-time generic object detection on mobile platforms is a crucial but challenging computer vision task. However, previous CNN-based detectors suffer from enormous computational cost, which hinders them from real-time inference in computation-constrained scenarios. In this paper, we investigate the effectiveness of two-stage detectors in real-time generic detection and propose a lightweight two-stage detector named ThunderNet. In the backbone part, we analyze the drawbacks in previous lightweight backbones and present a lightweight backbone designed for object detection. In the detection part, we exploit an extremely efficient RPN and detection head design. To generate more discriminative feature representation, we design two efficient architecture blocks, Context Enhancement Module and Spatial Attention Module. At last, we investigate the balance between the input resolution, the backbone, and the detection head. Compared with lightweight one-stage detectors, ThunderNet achieves superior performance with only 40% of the computational cost on PASCAL VOC and COCO benchmarks. Without bells and whistles, our model runs at 24.1 fps on an ARM-based device. To the best of our knowledge, this is the first real-time detector reported on ARM platforms. Our code and models are available at https://github.com/qinzheng93/ThunderNet.

updated: Mon Mar 28 2022 16:36:17 GMT+0000 (UTC)

published: Thu Mar 28 2019 01:48:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト