Two Heads are Better than One: Robust Learning Meets Multi-branch Models

Dong Huang; Qingwen Bu; Yuhao Qing; Haowen Pi; Sen Wang; Heming Cui

2 つのヘッドは 1 つよりも優れています: 堅牢な学習とマルチブランチモデルの融合

ディープニューラルネットワーク (DNN) は敵対的な例に対して脆弱であり、DNN は知覚できない摂動を含む入力が原因で誤った出力に誘導されます。信頼できる効果的な防御方法である敵対的トレーニングは、ニューラルネットワークの脆弱性を大幅に軽減する可能性があり、堅牢な学習のデファクトスタンダードになる可能性があります。最近の多くの研究では、より良い敵対的例を生成する方法や、生成モデルを使用して追加のトレーニングデータを生成する方法など、データ中心の哲学を実践していますが、モデル自体を振り返り、深層特徴分布の観点から敵対的ロバスト性を再検討します。洞察に満ちた補完性。この論文では、敵対的トレーニング用の元のデータセットのみを使用して最先端のパフォーマンスを得るために、Branch Orthogonality adveRsarial Training (BORT) を提案します。複数の直交ソリューション空間を統合するという設計思想を実践するために、推論時間を増加させることなく敵対的攻撃を凌駕するシンプルでわかりやすいマルチブランチニューラルネットワークを活用します。マルチブランチモデルの各解空間を直交させるために、対応する損失関数、ブランチ直交損失をヒューリスティックに提案します。サイズ ϵ= 8/255 の ℓ_∞ ノルム有界摂動に対して、CIFAR-10、CIFAR-100、および SVHN に対するアプローチをそれぞれ評価します。徹底的な実験を行って、私たちの方法がすべての最先端の方法をトリックなしで超えていることを示します。トレーニングに追加データを使用しないすべての方法と比較して、当社のモデルは CIFAR-10 および CIFAR-100 で 67.3% および 41.5% のロバスト精度を達成します (最新技術より +7.23% および +9.07% 向上) ）。また、私たちのものよりもはるかに大きなスケールのトレーニングセットを使用する方法よりも優れています。すべてのモデルとコードは、https://github.com/huangd1999/BORT でオンラインで入手できます。

Deep neural networks (DNNs) are vulnerable to adversarial examples, in which DNNs are misled to false outputs due to inputs containing imperceptible perturbations. Adversarial training, a reliable and effective method of defense, may significantly reduce the vulnerability of neural networks and becomes the de facto standard for robust learning. While many recent works practice the data-centric philosophy, such as how to generate better adversarial examples or use generative models to produce additional training data, we look back to the models themselves and revisit the adversarial robustness from the perspective of deep feature distribution as an insightful complementarity. In this paper, we propose Branch Orthogonality adveRsarial Training (BORT) to obtain state-of-the-art performance with solely the original dataset for adversarial training. To practice our design idea of integrating multiple orthogonal solution spaces, we leverage a simple and straightforward multi-branch neural network that eclipses adversarial attacks with no increase in inference time. We heuristically propose a corresponding loss function, branch-orthogonal loss, to make each solution space of the multi-branch model orthogonal. We evaluate our approach on CIFAR-10, CIFAR-100, and SVHN against ℓ_∞ norm-bounded perturbations of size ϵ= 8/255, respectively. Exhaustive experiments are conducted to show that our method goes beyond all state-of-the-art methods without any tricks. Compared to all methods that do not use additional data for training, our models achieve 67.3% and 41.5% robust accuracy on CIFAR-10 and CIFAR-100 (improving upon the state-of-the-art by +7.23% and +9.07%). We also outperform methods using a training set with a far larger scale than ours. All our models and codes are available online at https://github.com/huangd1999/BORT.

updated: Wed Aug 17 2022 05:42:59 GMT+0000 (UTC)

published: Wed Aug 17 2022 05:42:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト