Explainability Matters: Backdoor Attacks on Medical Imaging

Munachiso Nwadike; Takumi Miyawaki; Esha Sarkar; Michail Maniatakos; Farah Shamout

説明性の問題：医用画像に対するバックドア攻撃

ディープニューラルネットワークはバックドア攻撃に対して脆弱であることが示されています。バックドア攻撃は、モデルトレーニングの前にトレーニングセットに簡単に導入される可能性があります。最近の研究は、自然画像やおもちゃのデータセットに対するバックドア攻撃の調査に焦点を合わせています。その結果、バックドアの正確な影響は、誤診が非常にコストがかかる可能性がある医療画像などの複雑な実際のアプリケーションではまだ完全には理解されていません。このホワイトペーパーでは、攻撃者がトレーニングデータセットを操作して攻撃を実行できることを前提として、胸部X線撮影を使用してマルチラベル疾患分類タスクに対するバックドア攻撃の影響を調査します。最先端のアーキテクチャの広範な評価は、数ピクセルの摂動を伴う画像をトレーニングセットに導入することにより、攻撃者がトレーニング手順に関与することなくバックドアを正常に実行できることを示しています。単純な3×3ピクセルのトリガーは、感染した画像のセットで最大1.00の受信者動作特性（AUROC）曲線の下の領域を達成できます。クリーンな画像のセットでは、バックドアニューラルネットワークは最大0.85 AUROCを達成でき、攻撃のステルス性を強調しています。ディープラーニングベースの診断システムの使用が臨床診療で急増するにつれて、推論時間内に空間的にローカライズされたバックドアを識別できるため、このコンテキストで説明可能性がどのように不可欠であるかも示します。

Deep neural networks have been shown to be vulnerable to backdoor attacks, which could be easily introduced to the training set prior to model training. Recent work has focused on investigating backdoor attacks on natural images or toy datasets. Consequently, the exact impact of backdoors is not yet fully understood in complex real-world applications, such as in medical imaging where misdiagnosis can be very costly. In this paper, we explore the impact of backdoor attacks on a multi-label disease classification task using chest radiography, with the assumption that the attacker can manipulate the training dataset to execute the attack. Extensive evaluation of a state-of-the-art architecture demonstrates that by introducing images with few-pixel perturbations into the training set, an attacker can execute the backdoor successfully without having to be involved with the training procedure. A simple 3×3 pixel trigger can achieve up to 1.00 Area Under the Receiver Operating Characteristic (AUROC) curve on the set of infected images. In the set of clean images, the backdoored neural network could still achieve up to 0.85 AUROC, highlighting the stealthiness of the attack. As the use of deep learning based diagnostic systems proliferates in clinical practice, we also show how explainability is indispensable in this context, as it can identify spatially localized backdoors in inference time.

updated: Wed Dec 30 2020 09:41:19 GMT+0000 (UTC)

published: Wed Dec 30 2020 09:41:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト