Simple black-box universal adversarial attacks on medical image classification based on deep neural networks

Kazuki Koga; Kazuhiro Takemoto

ディープニューラルネットワークに基づく医用画像分類に対する単純なブラックボックスの敵対的攻撃

普遍的な敵対的摂動（UAP）と呼ばれる小さな単一の摂動のみを使用してほとんどのディープニューラルネットワーク（DNN）タスクを妨げる普遍的な敵対的攻撃は、DNNの実際のアプリケーションに対する現実的なセキュリティの脅威です。特に、そのような攻撃は医用画像において深刻な問題を引き起こします。コンピュータベースのシステムは一般に、入力に対するクエリのみが許可され、出力にアクセスできるブラックボックス条件で動作することを考えると、UAPを生成するためによく使用されるアルゴリズムはホワイトに限定されているため、UAPの影響は限定的であるように思われます。敵対者がモデルの重みと損失勾配にアクセスできるボックス条件。それにもかかわらず、ブラックボックス条件下で比較的小さなデータセットを使用してUAPを簡単に生成できることを示します。特に、DNN出力のみに基づく単純な山登り検索を使用してUAPを生成する方法を提案し、代表的なDNNベースの医用画像分類を使用して提案された方法の有効性を示します。ブラックボックスUAPは、非標的型攻撃と標的型攻撃の両方を実行するために使用できます。全体として、ブラックボックスUAPは高い攻撃成功率（40％から90％）を示しましたが、UAPを生成するために限られた情報しか使用しないため、成功率が比較的低いものもありました。ブラックボックスUAPの脆弱性は、いくつかのモデルアーキテクチャで観察されました。結果は、攻撃者がブラックボックス条件下で簡単な手順でUAPを生成し、DNNベースの医用画像診断を阻止または制御できること、およびUAPがより現実的なセキュリティの脅威であることを示しています。

Universal adversarial attacks, which hinder most deep neural network (DNN) tasks using only a small single perturbation called a universal adversarial perturbation (UAP), is a realistic security threat to the practical application of a DNN. In particular, such attacks cause serious problems in medical imaging. Given that computer-based systems are generally operated under a black-box condition in which only queries on inputs are allowed and outputs are accessible, the impact of UAPs seems to be limited because well-used algorithms for generating UAPs are limited to a white-box condition in which adversaries can access the model weights and loss gradients. Nevertheless, we demonstrate that UAPs are easily generatable using a relatively small dataset under black-box conditions. In particular, we propose a method for generating UAPs using a simple hill-climbing search based only on DNN outputs and demonstrate the validity of the proposed method using representative DNN-based medical image classifications. Black-box UAPs can be used to conduct both non-targeted and targeted attacks. Overall, the black-box UAPs showed high attack success rates (40% to 90%), although some of them had relatively low success rates because the method only utilizes limited information to generate UAPs. The vulnerability of black-box UAPs was observed in several model architectures. The results indicate that adversaries can also generate UAPs through a simple procedure under the black-box condition to foil or control DNN-based medical image diagnoses, and that UAPs are a more realistic security threat.

updated: Wed Aug 11 2021 00:59:34 GMT+0000 (UTC)

published: Wed Aug 11 2021 00:59:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト