Medical Aegis: Robust adversarial protectors for medical images

Qingsong Yao; Zecheng He; S. Kevin Zhou

Medical Aegis：医療画像用の堅牢な敵対的プロテクター

ディープニューラルネットワークベースの医用画像システムは、敵対的な例に対して脆弱です。多くの防御メカニズムが文献で提案されていますが、既存の防御は、防御システムについてほとんど知らず、防御に従って攻撃戦略を変更しない受動的な攻撃者を想定しています。最近の研究によると、攻撃者が防御システムについて完全な知識を持っていると想定される強力な適応攻撃は、既存の防御を簡単に回避できることが示されています。この論文では、MedicalAegisと呼ばれる新しい敵対的な防御システムの例を提案します。私たちの知る限りでは、Medical Aegisは、医療画像に対する強力な適応型の敵対的な例の攻撃に首尾よく対処した文献の最初の防御です。 Medical Aegisは、2層のプロテクターを誇っています。クッションの最初の層は、高周波成分を除去することで攻撃の敵対的な操作能力を弱めますが、元の画像の分類パフォーマンスへの影響は最小限に抑えられます。 Shieldの第2層は、保護されたモデルのロジットを予測するために、クラスごとのDNNモデルのセットを学習します。シールドの予測からの逸脱は、敵対的な例を示しています。 Shieldは、DNNモデルの浅い層に堅牢なトレイルが存在するというストレステストの観察に触発されており、適応攻撃はほとんど破壊できません。実験結果は、提案された防御が適応的攻撃を正確に検出し、モデル推論のオーバーヘッドはごくわずかであることを示しています。

Deep neural network based medical image systems are vulnerable to adversarial examples. Many defense mechanisms have been proposed in the literature, however, the existing defenses assume a passive attacker who knows little about the defense system and does not change the attack strategy according to the defense. Recent works have shown that a strong adaptive attack, where an attacker is assumed to have full knowledge about the defense system, can easily bypass the existing defenses. In this paper, we propose a novel adversarial example defense system called Medical Aegis. To the best of our knowledge, Medical Aegis is the first defense in the literature that successfully addresses the strong adaptive adversarial example attacks to medical images. Medical Aegis boasts two-tier protectors: The first tier of Cushion weakens the adversarial manipulation capability of an attack by removing its high-frequency components, yet posing a minimal effect on classification performance of the original image; the second tier of Shield learns a set of per-class DNN models to predict the logits of the protected model. Deviation from the Shield's prediction indicates adversarial examples. Shield is inspired by the observations in our stress tests that there exist robust trails in the shallow layers of a DNN model, which the adaptive attacks can hardly destruct. Experimental results show that the proposed defense accurately detects adaptive attacks, with negligible overhead for model inference.

updated: Mon Nov 22 2021 03:17:07 GMT+0000 (UTC)

published: Mon Nov 22 2021 03:17:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト