Data Efficient 3D Learner via Knowledge Transferred from 2D Model

Ping-Chung Yu; Cheng Sun; Min Sun

2Dモデルから転送された知識によるデータ効率の高い3D学習者

登録された3Dポイントクラウドの収集とラベル付けにはコストがかかります。その結果、トレーニング用の3Dリソースは、通常、対応する2D画像と比較して量が制限されます。この作業では、RGB-D画像を介して強力な2Dモデルから知識を転送することにより、3Dタスクのデータ不足の課題に対処します。具体的には、2D画像用の強力で十分に訓練されたセマンティックセグメンテーションモデルを利用して、疑似ラベルでRGB-D画像を補強します。次に、拡張されたデータセットを使用して、3Dモデルを事前トレーニングできます。最後に、いくつかのラベル付き3Dインスタンスを微調整するだけで、私たちの方法は、3Dラベルの効率に合わせて調整された既存の最先端技術をすでに上回っています。また、平均教師とエントロピーの最小化の結果は、事前トレーニングによって改善できることを示しています。これは、転送された知識が半教師あり設定で役立つことを示唆しています。 2つの人気のある3Dモデルと3つの異なるタスクに対するアプローチの有効性を検証します。 ScanNetの公式評価では、データ効率の高いトラックで新しい最先端のセマンティックセグメンテーション結果を確立します。

Collecting and labeling the registered 3D point cloud is costly. As a result, 3D resources for training are typically limited in quantity compared to the 2D images counterpart. In this work, we deal with the data scarcity challenge of 3D tasks by transferring knowledge from strong 2D models via RGB-D images. Specifically, we utilize a strong and well-trained semantic segmentation model for 2D images to augment RGB-D images with pseudo-label. The augmented dataset can then be used to pre-train 3D models. Finally, by simply fine-tuning on a few labeled 3D instances, our method already outperforms existing state-of-the-art that is tailored for 3D label efficiency. We also show that the results of mean-teacher and entropy minimization can be improved by our pre-training, suggesting that the transferred knowledge is helpful in semi-supervised setting. We verify the effectiveness of our approach on two popular 3D models and three different tasks. On ScanNet official evaluation, we establish new state-of-the-art semantic segmentation results on the data-efficient track.

updated: Wed Mar 16 2022 09:14:44 GMT+0000 (UTC)

published: Wed Mar 16 2022 09:14:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト