Learning Data Augmentation with Online Bilevel Optimization for Image Classification

Saypraseuth Mounsaveng; Issam Laradji; Ismail Ben Ayed; David Vazquez; Marco Pedersoli

画像分類のためのオンラインバイレベル最適化による学習データ増強

データの拡張は、一般化のパフォーマンスを向上させるための機械学習の重要なプラクティスです。ただし、最適なデータ拡張ハイパーパラメータを見つけるには、ドメイン知識または計算量の多い検索が必要です。この問題に対処するために、変換の効果的な分散を学習して一般化を改善するネットワークを自動的にトレーニングする効率的なアプローチを提案します。バイレベル最適化を使用して、検証セットを使用してデータ拡張パラメーターを直接最適化します。このフレームワークは、分類器のようなエンドタスクモデルと共同で最適なデータ拡張を学習するための一般的なソリューションとして使用できます。結果は、私たちの共同トレーニング方法が、注意深く手作りされたデータ拡張に匹敵するか、それよりも優れた画像分類精度を生み出すことを示しています。それでも、データ拡張ハイパーパラメータに高価な外部検証ループは必要ありません。

Data augmentation is a key practice in machine learning for improving generalization performance. However, finding the best data augmentation hyperparameters requires domain knowledge or a computationally demanding search. We address this issue by proposing an efficient approach to automatically train a network that learns an effective distribution of transformations to improve its generalization. Using bilevel optimization, we directly optimize the data augmentation parameters using a validation set. This framework can be used as a general solution to learn the optimal data augmentation jointly with an end task model like a classifier. Results show that our joint training method produces an image classification accuracy that is comparable to or better than carefully hand-crafted data augmentation. Yet, it does not need an expensive external validation loop on the data augmentation hyperparameters.

updated: Tue Nov 10 2020 16:11:57 GMT+0000 (UTC)

published: Thu Jun 25 2020 21:01:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト