Boosting Semi-Supervised 3D Object Detection with Semi-Sampling

Xiaopei Wu; Yang Zhao; Liang Peng; Hua Chen; Xiaoshui Huang; Binbin Lin; Haifeng Liu; Deng Cai; Wanli Ouyang

セミサンプリングによる半教師あり 3D オブジェクト検出の強化

現在の 3D オブジェクト検出方法は、膨大な量の注釈に大きく依存しています。この問題を軽減するために、半教師あり学習を使用できます。以前の半教師あり 3D オブジェクト検出方法は、完全に教師ありの方法の実践に従って、ラベル付けされたデータとラベル付けされていないデータを増強しますが、これは最適ではありません。この論文では、半教師あり学習のためのデータ拡張方法を設計します。これは、セミサンプリングと呼ばれます。具体的には、グラウンドトゥルースラベルと疑似ラベルを使用して、それぞれラベル付きフレームとラベルなしフレームで gt サンプルと疑似サンプルをトリミングします。次に、gt サンプルデータベースと疑似サンプルデータベースを生成できます。教師と生徒の半教師付きフレームワークをトレーニングするとき、ラベル付きフレームとラベルなしフレームの両方に対して gt サンプルと疑似サンプルをランダムに選択し、強力なデータ拡張を行います。私たちのセミサンプリングは、半教師あり学習への gt サンプリングの拡張と見なすことができます。私たちの方法はシンプルですが効果的です。私たちは、ScanNet、SUN-RGBD、および KITTI ベンチマークで最先端の手法を一貫して大幅に改善しています。たとえば、ScanNet で 10% のラベル付きデータのみを使用してトレーニングすると、mAP@0.25 および mAP@0.5 に関して、3DioUMatch で 3.1 mAP および 6.4 mAP の改善が達成されます。 KITTI で 1% のラベル付きデータのみを使用してトレーニングすると、車、歩行者、自転車のクラスで 3DioUMatch が 3.5 mAP、6.7 mAP、14.1 mAP 増加します。コードは、https://github.com/LittlePey/Semi-Sampling で公開されます。

Current 3D object detection methods heavily rely on an enormous amount of annotations. Semi-supervised learning can be used to alleviate this issue. Previous semi-supervised 3D object detection methods directly follow the practice of fully-supervised methods to augment labeled and unlabeled data, which is sub-optimal. In this paper, we design a data augmentation method for semi-supervised learning, which we call Semi-Sampling. Specifically, we use ground truth labels and pseudo labels to crop gt samples and pseudo samples on labeled frames and unlabeled frames, respectively. Then we can generate a gt sample database and a pseudo sample database. When training a teacher-student semi-supervised framework, we randomly select gt samples and pseudo samples to both labeled frames and unlabeled frames, making a strong data augmentation for them. Our semi-sampling can be regarded as an extension of gt-sampling to semi-supervised learning. Our method is simple but effective. We consistently improve state-of-the-art methods on ScanNet, SUN-RGBD, and KITTI benchmarks by large margins. For example, when training using only 10% labeled data on ScanNet, we achieve 3.1 mAP and 6.4 mAP improvement upon 3DIoUMatch in terms of mAP@0.25 and mAP@0.5. When training using only 1% labeled data on KITTI, we boost 3DIoUMatch by 3.5 mAP, 6.7 mAP and 14.1 mAP on car, pedestrian and cyclist classes. Codes will be made publicly available at https://github.com/LittlePey/Semi-Sampling.

updated: Mon Nov 14 2022 03:22:03 GMT+0000 (UTC)

published: Mon Nov 14 2022 03:22:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト