A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective

Chanwoo Park; Sangdoo Yun; Sanghyuk Chun

混合サンプルデータ拡張の統合分析: 損失関数の観点

Mixup や CutMix などの混合サンプルデータ拡張 (MSDA) の最初の統一された理論的分析を提案します。理論的な結果は、ミキシング戦略の選択に関係なく、MSDA が基礎となるトレーニング損失のピクセルレベルの正則化および最初のレイヤーパラメーターの正則化として動作することを示しています。同様に、私たちの理論的結果は、MSDA トレーニング戦略がバニラトレーニング戦略と比較して敵対的ロバスト性と一般化を改善できることを裏付けています。理論的な結果を使用して、MSDA のさまざまな設計選択がどのように異なる動作をするかについての高レベルの理解を提供します。たとえば、最も一般的な MSDA メソッドである Mixup と CutMix の動作が異なることを示します。たとえば、CutMix はピクセル距離によって入力勾配を正則化しますが、Mixup はピクセル距離に関係なく入力勾配を正則化します。理論的な結果は、最適な MSDA 戦略がタスク、データセット、またはモデルパラメーターに依存することも示しています。これらの観察から、Mixup と CutMix (HMix) のハイブリッドバージョンである一般化された MSDA と、Mixup と CutMix の単純な拡張である Gaussian Mixup (GMix) を提案します。私たちの実装は Mixup と CutMix の利点を活用できますが、実装は非常に効率的であり、計算コストは Mixup と CutMix のようにほとんど無視できます。私たちの経験的研究は、HMix と GMix が CIFAR-100 と ImageNet 分類タスクで以前の最先端の MSDA メソッドよりも優れていることを示しています。ソースコードは https://github.com/naver-ai/hmix-gmix で入手できます

We propose the first unified theoretical analysis of mixed sample data augmentation (MSDA), such as Mixup and CutMix. Our theoretical results show that regardless of the choice of the mixing strategy, MSDA behaves as a pixel-level regularization of the underlying training loss and a regularization of the first layer parameters. Similarly, our theoretical results support that the MSDA training strategy can improve adversarial robustness and generalization compared to the vanilla training strategy. Using the theoretical results, we provide a high-level understanding of how different design choices of MSDA work differently. For example, we show that the most popular MSDA methods, Mixup and CutMix, behave differently, e.g., CutMix regularizes the input gradients by pixel distances, while Mixup regularizes the input gradients regardless of pixel distances. Our theoretical results also show that the optimal MSDA strategy depends on tasks, datasets, or model parameters. From these observations, we propose generalized MSDAs, a Hybrid version of Mixup and CutMix (HMix) and Gaussian Mixup (GMix), simple extensions of Mixup and CutMix. Our implementation can leverage the advantages of Mixup and CutMix, while our implementation is very efficient, and the computation cost is almost neglectable as Mixup and CutMix. Our empirical study shows that our HMix and GMix outperform the previous state-of-the-art MSDA methods in CIFAR-100 and ImageNet classification tasks. Source code is available at https://github.com/naver-ai/hmix-gmix

updated: Sun Aug 21 2022 15:54:25 GMT+0000 (UTC)

published: Sun Aug 21 2022 15:54:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト