Detecting Adversarial Examples with Bayesian Neural Network

Yao Li; Tongyi Tang; Cho-Jui Hsieh; Thomas C. M. Lee

ベイジアンニューラルネットワークによる敵対的な例の検出

この論文では、ランダムなコンポーネントが予測子の滑らかさを改善し、ディープニューラルネットワークの出力分布のシミュレーションをより容易にするという観察に動機付けられた敵対的な例を検出するための新しいフレームワークを提案します。これらの観察により、敵対例検出のパフォーマンスを向上させるために、BATer の略である新しいベイジアン敵対例検出器を提案します。具体的には、自然例と敵対例の間の隠れ層出力の分布の違いを研究し、ベイジアンニューラルネットワーク (BNN) のランダム性を使用して隠れ層出力分布をシミュレートし、分布分散を活用して敵対例を検出することを提案します。 BNN の利点は、出力が確率的であるのに対し、ランダムコンポーネントのないニューラルネットワークにはそのような特性がないことです。一般的な攻撃に対するいくつかのベンチマークデータセットの経験的結果は、提案された BATer が敵対例の検出において最先端の検出器よりも優れていることを示しています。

In this paper, we propose a new framework to detect adversarial examples motivated by the observations that random components can improve the smoothness of predictors and make it easier to simulate output distribution of deep neural network. With these observations, we propose a novel Bayesian adversarial example detector, short for BATer, to improve the performance of adversarial example detection. In specific, we study the distributional difference of hidden layer output between natural and adversarial examples, and propose to use the randomness of Bayesian neural network (BNN) to simulate hidden layer output distribution and leverage the distribution dispersion to detect adversarial examples. The advantage of BNN is that the output is stochastic while neural networks without random components do not have such characteristics. Empirical results on several benchmark datasets against popular attacks show that the proposed BATer outperforms the state-of-the-art detectors in adversarial example detection.

updated: Fri May 28 2021 20:04:48 GMT+0000 (UTC)

published: Tue May 18 2021 15:51:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト