Adversarial Heart Attack: Neural Networks Fooled to Segment Heart Symbols in Chest X-Ray Images

Gerda Bortsova; Florian Dubost; Laurens Hogeweg; Ioannis Katramados; Marleen de Bruijne

敵対的な心臓発作：胸部X線画像で心臓のシンボルをセグメント化するためにだまされたニューラルネットワーク

敵対的攻撃は、入力データを悪意を持って変更して自動決定システムの予測を誤解させることで構成され、自動医療画像分析にとって深刻な脅威となる可能性があります。以前の研究では、ホワイトボックス攻撃の設定で、ニューラルネットワークによって生成された自動セグメンテーションをターゲットを絞った方法で敵対的に操作できることが示されています。この記事では、胸部X線の解剖学的構造のセグメンテーションのターゲットを絞った変更における敵対的攻撃の有効性を研究しました。まず、敵対的操作のターゲットとして、解剖学的に信じがたい形状を使用して実験しました。画像にほとんど知覚できないノイズを追加することで、最先端のニューラルネットワークに心臓を実際の解剖学的形状ではなく心臓のシンボルとして確実にセグメント化できることを示しました。さらに、そのような心臓形成攻撃は、同じ攻撃方法に基づく非標的攻撃よりも高い敵対的ノイズレベルを必要としないようでした。次に、セグメンテーションの敵対的操作の限界を探求しようとしました。そのために、3つの解剖学的構造のセグメンテーション輪郭を縮小および拡大することの有効性を評価しました。構造のセグメンテーションを、それらに特徴のない強度とテクスチャを備えた領域に敵対的に拡張することは、攻撃への挑戦を提示し、場合によっては、ターゲットネットワークによって学習されたクラス隣接の事前情報と競合する方法でセグメンテーションを変更することを観察しました。さらに、画像の異なるサブセットでトレーニングされた代理ネットワークを使用して、ブラックボックス攻撃シナリオでの非標的攻撃と標的心臓発作のパフォーマンスを評価しました。どちらの場合も、攻撃の効果は大幅に低下しました。これらの調査結果は、セマンティックセグメンテーションに対する敵対的攻撃の現在の機能と限界に新たな洞察をもたらすと信じています。

Adversarial attacks consist in maliciously changing the input data to mislead the predictions of automated decision systems and are potentially a serious threat for automated medical image analysis. Previous studies have shown that it is possible to adversarially manipulate automated segmentations produced by neural networks in a targeted manner in the white-box attack setting. In this article, we studied the effectiveness of adversarial attacks in targeted modification of segmentations of anatomical structures in chest X-rays. Firstly, we experimented with using anatomically implausible shapes as targets for adversarial manipulation. We showed that, by adding almost imperceptible noise to the image, we can reliably force state-of-the-art neural networks to segment the heart as a heart symbol instead of its real anatomical shape. Moreover, such heart-shaping attack did not appear to require higher adversarial noise level than an untargeted attack based the same attack method. Secondly, we attempted to explore the limits of adversarial manipulation of segmentations. For that, we assessed the effectiveness of shrinking and enlarging segmentation contours for the three anatomical structures. We observed that adversarially extending segmentations of structures into regions with intensity and texture uncharacteristic for them presented a challenge to our attacks, as well as, in some cases, changing segmentations in ways that conflict with class adjacency priors learned by the target network. Additionally, we evaluated performances of the untargeted attacks and targeted heart attacks in the black-box attack scenario, using a surrogate network trained on a different subset of images. In both cases, the attacks were substantially less effective. We believe these findings bring novel insights into the current capabilities and limits of adversarial attacks for semantic segmentation.

updated: Wed Apr 07 2021 21:20:07 GMT+0000 (UTC)

published: Wed Mar 31 2021 22:20:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト