From a Fourier-Domain Perspective on Adversarial Examples to a Wiener Filter Defense for Semantic Segmentation

Nikhil Kapoor; Andreas Bär; Serin Varghese; Jan David Schneider; Fabian Hüger; Peter Schlicht; Tim Fingscheidt

敵対的な例に関するフーリエ領域の視点からセマンティックセグメンテーションのためのウィーナーフィルター防御まで

最近の進歩にもかかわらず、ディープニューラルネットワークは敵対的な摂動に対して堅牢ではありません。提案されている敵対的防御アプローチの多くは、セマンティックセグメンテーションなどの複雑な実世界のタスクに対応せず、わずかな改善しか提供しない、計算コストの高いトレーニングメカニズムを使用しています。さらに、敵対的な摂動の性質とネットワークアーキテクチャとの関係に関する基本的な質問はほとんど研究されていません。この作業では、周波数領域の観点から敵対的な問題を研究します。より具体的には、いくつかの敵対画像の離散フーリエ変換（DFT）スペクトルを分析し、2つの主要な発見を報告します。1つは、モデルアーキテクチャと、周波数領域で観察および対処できる敵対的摂動の性質との間に強い関連性が存在することです。第二に、観測された周波数パターンは、主に画像と攻撃のタイプに依存しません。これは、そのようなパターンを利用する防御の実際的な影響にとって重要です。これらの調査結果に動機付けられて、データ駆動型の方法で敵対的な頻度をキャプチャして抑制する、よく知られているウィーナーフィルターに基づく敵対的な防御方法をさらに提案します。私たちが提案する方法は、目に見えない攻撃全体を一般化するだけでなく、さまざまな攻撃設定の2つのモデルにわたる5つの既存の最先端の方法を打ち負かします。

Despite recent advancements, deep neural networks are not robust against adversarial perturbations. Many of the proposed adversarial defense approaches use computationally expensive training mechanisms that do not scale to complex real-world tasks such as semantic segmentation, and offer only marginal improvements. In addition, fundamental questions on the nature of adversarial perturbations and their relation to the network architecture are largely understudied. In this work, we study the adversarial problem from a frequency domain perspective. More specifically, we analyze discrete Fourier transform (DFT) spectra of several adversarial images and report two major findings: First, there exists a strong connection between a model architecture and the nature of adversarial perturbations that can be observed and addressed in the frequency domain. Second, the observed frequency patterns are largely image- and attack-type independent, which is important for the practical impact of any defense making use of such patterns. Motivated by these findings, we additionally propose an adversarial defense method based on the well-known Wiener filters that captures and suppresses adversarial frequencies in a data-driven manner. Our proposed method not only generalizes across unseen attacks but also beats five existing state-of-the-art methods across two models in a variety of attack settings.

updated: Wed Apr 21 2021 15:44:10 GMT+0000 (UTC)

published: Wed Dec 02 2020 22:06:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト