Real-Time Anchor-Free Single-Stage 3D Detection with IoU-Awareness

Runzhou Ge; Zhuangzhuang Ding; Yihan Hu; Wenxin Shao; Li Huang; Kun Li; Qiang Liu

IoU対応のリアルタイムアンカーフリーシングルステージ3D検出

このレポートでは、リアルタイム3D検出の受賞ソリューションと、CVPR2021でのWaymoOpen Dataset Challengesの「最も効率的なモデル」を紹介します。昨年の受賞歴のあるモデルAFDetから拡張され、いくつかのモデルを作成しました。ベースモデルに変更を加えて、精度を向上させると同時に、レイテンシーを大幅に削減します。 AFDetV2という名前の変更されたモデルは、ライト3D特徴抽出器、拡張された受容野を備えた改良されたRPN、およびIoU対応の信頼スコアを生成する追加のサブヘッドを備えています。これらのモデルの機能強化は、強化されたデータ拡張、確率的重みの平均化、およびボクセル化のGPUベースの実装とともに、60.06ミリ秒の遅延と72.57 mAPH /の精度でAFDetV2の73.12mAPH / L2の勝利精度をもたらします。 AFDetV2ベースのL2。チャレンジスポンサーから「最も効率的なモデル」と題され、55.86ミリ秒の遅延が発生します。

In this report, we introduce our winning solution to the Real-time 3D Detection and also the "Most Efficient Model" in the Waymo Open Dataset Challenges at CVPR 2021. Extended from our last year's award-winning model AFDet, we have made a handful of modifications to the base model, to improve the accuracy and at the same time to greatly reduce the latency. The modified model, named as AFDetV2, is featured with a lite 3D Feature Extractor, an improved RPN with extended receptive field and an added sub-head that produces an IoU-aware confidence score. These model enhancements, together with enriched data augmentation, stochastic weights averaging, and a GPU-based implementation of voxelization, lead to a winning accuracy of 73.12 mAPH/L2 for our AFDetV2 with a latency of 60.06 ms, and an accuracy of 72.57 mAPH/L2 for our AFDetV2-base, entitled as the "Most Efficient Model" by the challenge sponsor, with a winning latency of 55.86 ms.

updated: Tue Aug 03 2021 20:56:54 GMT+0000 (UTC)

published: Thu Jul 29 2021 21:47:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト