A New Benchmark: On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptation

Hui Tang; Kui Jia

新しいベンチマーク: ベア教師あり学習とダウンストリームドメイン適応のための Blender を使用した合成データの有用性について

コンピュータービジョンのディープラーニングは、ラベル付けされた大規模なトレーニングデータの価格で大きな成功を収めました。ただし、人件費が高く、ラベリングの精度が保証されていないため、関心のあるすべてのドメインの各タスクに対して徹底的なデータ注釈を行うことは現実的ではありません。さらに、制御不能なデータ収集プロセスにより、IID 以外のトレーニングデータとテストデータが生成され、望ましくない重複が存在する可能性があります。これらの煩わしさはすべて、典型的な理論の検証と新しい発見への露出を妨げる可能性があります.それらを回避するための代替手段は、ドメインのランダム化を使用して 3D レンダリングを介して合成データを生成することです。この作業では、裸の教師あり学習と下流のドメイン適応に関する深遠で広範な研究を行うことにより、この方針に沿って前進します。具体的には、3D レンダリングによって適切に制御された IID データ設定の下で、ショートカット学習などの典型的で重要な学習の洞察を体系的に検証し、一般化におけるさまざまなデータ体制とネットワークアーキテクチャの新しい法則を発見します。さらに、オブジェクトのスケール、マテリアルテクスチャ、照明、カメラの視点、3D シーンの背景など、一般化に対する画像形成要因の影響を調査します。さらに、事前トレーニングに使用した場合の合成データと実際のデータ間の転送可能性を比較するためのダウンストリームタスクとして、シミュレーションから現実への適応を使用します。これは、合成データの事前トレーニングが実際のテスト結果の改善にも有望であることを示しています。最後に、将来の研究を促進するために、S2RDA と呼ばれる、画像分類のための新しい大規模な合成から現実へのベンチマークを開発します。これは、シミュレーションから現実への移行により重要な課題を提供します。コードとデータセットは、https://github.com/huitangtang/On_the_Utility_of_Synthetic_Data で入手できます。

Deep learning in computer vision has achieved great success with the price of large-scale labeled training data. However, exhaustive data annotation is impracticable for each task of all domains of interest, due to high labor costs and unguaranteed labeling accuracy. Besides, the uncontrollable data collection process produces non-IID training and test data, where undesired duplication may exist. All these nuisances may hinder the verification of typical theories and exposure to new findings. To circumvent them, an alternative is to generate synthetic data via 3D rendering with domain randomization. We in this work push forward along this line by doing profound and extensive research on bare supervised learning and downstream domain adaptation. Specifically, under the well-controlled, IID data setting enabled by 3D rendering, we systematically verify the typical, important learning insights, e.g., shortcut learning, and discover the new laws of various data regimes and network architectures in generalization. We further investigate the effect of image formation factors on generalization, e.g., object scale, material texture, illumination, camera viewpoint, and background in a 3D scene. Moreover, we use the simulation-to-reality adaptation as a downstream task for comparing the transferability between synthetic and real data when used for pre-training, which demonstrates that synthetic data pre-training is also promising to improve real test results. Lastly, to promote future research, we develop a new large-scale synthetic-to-real benchmark for image classification, termed S2RDA, which provides more significant challenges for transfer from simulation to reality. The code and datasets are available at https://github.com/huitangtang/On_the_Utility_of_Synthetic_Data.

updated: Thu Mar 23 2023 09:02:33 GMT+0000 (UTC)

published: Thu Mar 16 2023 09:03:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト