Adversarial Robustness by Design through Analog Computing and Synthetic Gradients

Alessandro Cappelli; Ruben Ohana; Julien Launay; Laurent Meunier; Iacopo Poli; Florent Krzakala

アナログ計算と合成勾配による設計による敵対的ロバスト性

ホワイトボックスとブラックボックスの両方の設定で自然な精度を損なうことなく堅牢性を提供する、光コプロセッサに触発された敵対的な攻撃に対する新しい防御メカニズムを提案します。このハードウェアコプロセッサは、非線形の固定ランダム変換を実行します。この場合、パラメータは不明であり、十分な大きさの次元に対して十分な精度で取得することは不可能です。ホワイトボックス設定では、ランダム投影のパラメーターを難読化することで防御が機能します。難読化された勾配に依存する他の防御とは異なり、難読化されたパラメーターの信頼できる後方微分可能な近似を構築できないことがわかります。さらに、私たちのモデルはハイブリッドバックプロパゲーション-合成勾配法で良好な自然精度に達しますが、敵対的例を生成するために採用された場合、同じアプローチは最適ではありません。光学システムでのランダム投影と2値化の組み合わせにより、さまざまなタイプのブラックボックス攻撃に対する堅牢性も向上することがわかりました。最後に、私たちのハイブリッドトレーニング方法は、転送攻撃に対して堅牢な機能を構築します。 CIFAR-10およびCIFAR-100で、畳み込み機能の上に防御を配置する、VGGのようなアーキテクチャでのアプローチを示します。コードはhttps://github.com/lightonai/adversarial-robustness-by-designで入手できます。

We propose a new defense mechanism against adversarial attacks inspired by an optical co-processor, providing robustness without compromising natural accuracy in both white-box and black-box settings. This hardware co-processor performs a nonlinear fixed random transformation, where the parameters are unknown and impossible to retrieve with sufficient precision for large enough dimensions. In the white-box setting, our defense works by obfuscating the parameters of the random projection. Unlike other defenses relying on obfuscated gradients, we find we are unable to build a reliable backward differentiable approximation for obfuscated parameters. Moreover, while our model reaches a good natural accuracy with a hybrid backpropagation - synthetic gradient method, the same approach is suboptimal if employed to generate adversarial examples. We find the combination of a random projection and binarization in the optical system also improves robustness against various types of black-box attacks. Finally, our hybrid training method builds robust features against transfer attacks. We demonstrate our approach on a VGG-like architecture, placing the defense on top of the convolutional features, on CIFAR-10 and CIFAR-100. Code is available at https://github.com/lightonai/adversarial-robustness-by-design.

updated: Wed Jan 06 2021 16:15:29 GMT+0000 (UTC)

published: Wed Jan 06 2021 16:15:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト