AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

Zicheng Liu; Siyuan Li; Di Wu; Zihan Liu; Zhiyuan Chen; Lirong Wu; Stan Z. Li

AutoMix: より強力な分類器のためのミックスアップの力を明らかにする

データ混合の拡張は、ディープニューラルネットワークの一般化能力を向上させるのに効果的であることが証明されています。初期の方法では手動のポリシー (線形補間など) によってサンプルを混合しますが、最近の方法では顕著性情報を利用して、複雑なオフライン最適化によって混合サンプルとラベルを照合します。ただし、正確な混合ポリシーと最適化の複雑さの間にはトレードオフが生じます。この課題に対処するために、混合ポリシーがパラメーター化され、最終的な分類目標を直接提供する、新しい自動混合 (AutoMix) フレームワークを提案します。具体的には、AutoMix はミックスアップ分類を、対応するサブネットワークを使用して 2 つのサブタスク (つまり、混合サンプル生成とミックスアップ分類) に再定式化し、2 レベルの最適化フレームワークでそれらを解決します。生成のために、学習可能な軽量ミックスアップジェネレーターである Mix Block は、対応する混合ラベルの直接監督下でパッチごとの関係をモデル化することにより、混合サンプルを生成するように設計されています。バイレベル最適化の劣化と不安定性を防ぐために、AutoMix をエンドツーエンドでトレーニングするためのモーメンタムパイプラインをさらに導入します。 9 つの画像ベンチマークに関する大規模な実験により、さまざまな分類シナリオおよびダウンストリームタスクにおける最先端技術と比較した AutoMix の優位性が証明されています。

Data mixing augmentation have proved to be effective in improving the generalization ability of deep neural networks. While early methods mix samples by hand-crafted policies (e.g., linear interpolation), recent methods utilize saliency information to match the mixed samples and labels via complex offline optimization. However, there arises a trade-off between precise mixing policies and optimization complexity. To address this challenge, we propose a novel automatic mixup (AutoMix) framework, where the mixup policy is parameterized and serves the ultimate classification goal directly. Specifically, AutoMix reformulates the mixup classification into two sub-tasks (i.e., mixed sample generation and mixup classification) with corresponding sub-networks and solves them in a bi-level optimization framework. For the generation, a learnable lightweight mixup generator, Mix Block, is designed to generate mixed samples by modeling patch-wise relationships under the direct supervision of the corresponding mixed labels. To prevent the degradation and instability of bi-level optimization, we further introduce a momentum pipeline to train AutoMix in an end-to-end manner. Extensive experiments on nine image benchmarks prove the superiority of AutoMix compared with state-of-the-art in various classification scenarios and downstream tasks.

updated: Wed Sep 21 2022 21:08:24 GMT+0000 (UTC)

published: Wed Mar 24 2021 07:21:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト