Exploring Active 3D Object Detection from a Generalization Perspective

Yadan Luo; Zhuoxiao Chen; Zijian Wang; Xin Yu; Zi Huang; Mahsa Baktashmotlagh

一般化の観点からのアクティブ 3D オブジェクト検出の調査

LiDAR ベースの 3D オブジェクト検出における高いアノテーションコストを軽減するために、アクティブラーニングは、モデルのパフォーマンスを損なうことなく、ラベル付けされていないデータのごく一部のみを選択してアノテーションを付けることを学習する有望なソリューションです。ただし、私たちの実証研究では、主流の不確実性ベースおよび多様性ベースのアクティブラーニングポリシーは、3D 検出タスクに適用すると効果的ではないことが示唆されています。これは、点群の情報量とボックスレベルの注釈コストの間のトレードオフのバランスを取ることができないためです。この制限を克服するために、ポイントクラウド取得のためのフレームワーク Crb の 3 つの新しい基準 (ラベルの簡潔さ)、特徴の代表性、幾何学的バランスを共同で調査します。たとえば、点群密度) をラベル付けされていないサンプルプールから収集し、注釈を付けるオブジェクトが少なく、有益なものを貪欲に選択します。私たちの理論的分析は、提案された基準が選択されたサブセットの限界分布と見えないテストセットの事前分布を整列させ、一般化誤差の上限を最小限に抑えることを示しています。 Crb の有効性と適用性を検証するために、KITTI と Waymo の 2 つのベンチマーク 3D オブジェクト検出データセットで広範な実験を行い、1 段階 (つまり、Second) と 2 段階の 3D 検出器 (つまり、Pv-rcnn) の両方を調べます。実験は、提案されたアプローチが既存の能動学習戦略よりも優れており、境界ボックスと点群のそれぞれ 1% と 8% の注釈を必要とする完全に教師ありのパフォーマンスを達成することを証明しています。ソースコード: https://github.com/Luoyadan/CRB-active-3Ddet.

To alleviate the high annotation cost in LiDAR-based 3D object detection, active learning is a promising solution that learns to select only a small portion of unlabeled data to annotate, without compromising model performance. Our empirical study, however, suggests that mainstream uncertainty-based and diversity-based active learning policies are not effective when applied in the 3D detection task, as they fail to balance the trade-off between point cloud informativeness and box-level annotation costs. To overcome this limitation, we jointly investigate three novel criteria in our framework Crb for point cloud acquisition - label conciseness}, feature representativeness and geometric balance, which hierarchically filters out the point clouds of redundant 3D bounding box labels, latent features and geometric characteristics (e.g., point cloud density) from the unlabeled sample pool and greedily selects informative ones with fewer objects to annotate. Our theoretical analysis demonstrates that the proposed criteria align the marginal distributions of the selected subset and the prior distributions of the unseen test set, and minimizes the upper bound of the generalization error. To validate the effectiveness and applicability of Crb, we conduct extensive experiments on the two benchmark 3D object detection datasets of KITTI and Waymo and examine both one-stage (i.e., Second) and two-stage 3D detectors (i.e., Pv-rcnn). Experiments evidence that the proposed approach outperforms existing active learning strategies and achieves fully supervised performance requiring 1% and 8% annotations of bounding boxes and point clouds, respectively. Source code: https://github.com/Luoyadan/CRB-active-3Ddet.

updated: Mon Jan 23 2023 02:43:03 GMT+0000 (UTC)

published: Mon Jan 23 2023 02:43:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト