Observations on K-image Expansion of Image-Mixing Augmentation for Classification

Joonhyun Jeong; Sungmin Cha; Youngjoon Yoo; Sangdoo Yun; Taesup Moon; Jongwon Choi

分類のための画像混合拡張の K 画像展開に関する観察

通常、2 つの画像を混合することを含む画像混合拡張 (Mixup や CutMix など) は、画像分類の事実上のトレーニング手法になっています。画像分類で大きな成功を収めたにもかかわらず、混合する画像の数は文献では解明されていません。単純な K 画像展開のみがパフォーマンスの低下につながることが示されています。この研究では、ディリクレ事前分布の下でのスティック破壊プロセスに基づいて、新しい K 画像混合拡張を導出します。広範な実験と分析を通じて、従来の2画像混合拡張方法に対するK画像拡張拡張の優位性を示します。（1）より堅牢で一般化された分類子。 (2) より望ましい損失景観形状。 (3) より優れた敵対的ロバスト性。さらに、確率モデルがサンプル単位の不確実性を測定し、検索時間を 7 分の 1 に短縮することでネットワークアーキテクチャ検索の効率を高めることができることを示します。コードは https://github.com/yjyoo3312/DCutMix-PyTorch.git で入手できます。

Image-mixing augmentations (e.g., Mixup and CutMix), which typically involve mixing two images, have become the de-facto training techniques for image classification. Despite their huge success in image classification, the number of images to be mixed has not been elucidated in the literature: only the naive K-image expansion has been shown to lead to performance degradation. This study derives a new K-image mixing augmentation based on the stick-breaking process under Dirichlet prior distribution. We demonstrate the superiority of our K-image expansion augmentation over conventional two-image mixing augmentation methods through extensive experiments and analyses: (1) more robust and generalized classifiers; (2) a more desirable loss landscape shape; (3) better adversarial robustness. Moreover, we show that our probabilistic model can measure the sample-wise uncertainty and boost the efficiency for network architecture search by achieving a 7-fold reduction in the search time. Code will be available at https://github.com/yjyoo3312/DCutMix-PyTorch.git.

updated: Fri Mar 17 2023 11:07:16 GMT+0000 (UTC)

published: Fri Oct 08 2021 16:58:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト