3D Common Corruptions and Data Augmentation

Oğuzhan Fatih Kar; Teresa Yeo; Andrei Atanov; Amir Zamir

3Dの一般的な破損とデータ拡張

モデルのロバスト性を評価するための「破損」として使用できる一連の画像変換と、ニューラルネットワークをトレーニングするための「データ拡張」メカニズムを紹介します。提案された変換の主な違いは、Common Corruptionsなどの既存のアプローチとは異なり、シーンのジオメトリが変換に組み込まれているため、現実の世界で発生する可能性が高い破損につながることです。これらの変換は「効率的」（オンザフライで計算可能）、「拡張可能」（実際の画像のほとんどのデータセットに適用可能）であり、既存のモデルの脆弱性を明らかにし、「3Dデータ拡張」メカニズム。いくつかのタスクとデータセットで実行された評価は、3D情報を堅牢性のベンチマークとトレーニングに組み込むことで、堅牢性の研究に有望な方向性を開くことを示唆しています。

We introduce a set of image transformations that can be used as `corruptions' to evaluate the robustness of models as well as `data augmentation' mechanisms for training neural networks. The primary distinction of the proposed transformations is that, unlike existing approaches such as Common Corruptions, the geometry of the scene is incorporated in the transformations -- thus leading to corruptions that are more likely to occur in the real world. We show these transformations are `efficient' (can be computed on-the-fly), `extendable' (can be applied on most datasets of real images), expose vulnerability of existing models, and can effectively make models more robust when employed as `3D data augmentation' mechanisms. Our evaluations performed on several tasks and datasets suggest incorporating 3D information into robustness benchmarking and training opens up a promising direction for robustness research.

updated: Wed Mar 02 2022 22:31:16 GMT+0000 (UTC)

published: Wed Mar 02 2022 22:31:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト