An Orthogonal Classifier for Improving the Adversarial Robustness of Neural Networks

Cong Xu; Xiang Li; Min Yang

ニューラルネットワークの敵対的ロバスト性を改善するための直交分類器

ニューラルネットワークは、人工的に設計された敵対的摂動の影響を受けやすくなっています。最近の取り組みでは、分類層に特定の変更を加えることで、ニューラルネットワークの堅牢性を向上させることができることが示されています。この論文では、エントリが同じ大きさである密な直交重み行列を明示的に構築し、それによって新しいロバストな分類器を導きます。提案された分類器は、前の作業での望ましくない構造的冗長性の問題を回避します。クリーンなデータの標準トレーニングでこの分類器を適用することで、モデルの高精度と優れた堅牢性を確保できます。さらに、追加の敵対的なサンプルが使用される場合、特別な最悪の場合の損失の助けを借りて、より良いロバスト性をさらに得ることができます。実験結果は、私たちの方法が効率的であり、多くの最先端の防御的アプローチに対して競争力があることを示しています。私たちのコードはhttps://github.com/MTandHJ/robocで入手できます。

Neural networks are susceptible to artificially designed adversarial perturbations. Recent efforts have shown that imposing certain modifications on classification layer can improve the robustness of the neural networks. In this paper, we explicitly construct a dense orthogonal weight matrix whose entries have the same magnitude, thereby leading to a novel robust classifier. The proposed classifier avoids the undesired structural redundancy issue in previous work. Applying this classifier in standard training on clean data is sufficient to ensure the high accuracy and good robustness of the model. Moreover, when extra adversarial samples are used, better robustness can be further obtained with the help of a special worst-case loss. Experimental results show that our method is efficient and competitive to many state-of-the-art defensive approaches. Our code is available at https://github.com/MTandHJ/roboc.

updated: Wed May 19 2021 13:12:14 GMT+0000 (UTC)

published: Wed May 19 2021 13:12:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト