Making DeepFakes more spurious: evading deep face forgery detection via trace removal attack

Chi Liu; Huajie Chen; Tianqing Zhu; Jun Zhang; Wanlei Zhou

DeepFakesをよりスプリアスにする：トレース除去攻撃による深層偽造の検出を回避する

DeepFakesは重大な社会的懸念を引き起こしています。フォレンジック対策としてさまざまなDeepFake検出器が開発されていますが、これらの検出器は依然として攻撃に対して脆弱です。最近、いくつかの攻撃、主に敵対的攻撃が、検出を回避するためにDeepFake画像をクローキングすることに成功しました。ただし、これらの攻撃には典型的な検出器固有の設計があり、検出器に関する事前の知識が必要であるため、転送性が低下します。さらに、これらの攻撃は単純なセキュリティシナリオのみを考慮します。検出器または攻撃者の知識のいずれかが変化する高レベルのシナリオでそれらがどれほど効果的であるかについてはあまり知られていません。このホワイトペーパーでは、DeepFakeアンチフォレンジック向けの、検出器にとらわれない新しいトレース除去攻撃を提示することで、上記の課題を解決します。検出器側を調査する代わりに、攻撃は元のDeepFake作成パイプラインを調べ、検出可能なすべての自然なDeepFakeトレースを削除して、偽の画像をより「本物」にしようとします。この攻撃を実装するには、まず、DeepFakeトレース検出を実行して、3つの識別可能なトレースを識別します。次に、1つのジェネレータと複数のディスクリミネータが関与する敵対的な学習フレームワークに基づいて、トレース除去ネットワーク（TR-Net）が提案されます。各ディスクリミネーターは、クロストレース干渉を回避するために1つの個別のトレース表現を担当します。これらの弁別器は並列に配置されているため、ジェネレータはさまざまなトレースを同時に削除するように求められます。攻撃の有効性を評価するために、検出器にさまざまなレベルの防御が組み込まれ、攻撃者のデータに関する背景知識が異なる、異種のセキュリティシナリオを作成しました。実験結果は、提案された攻撃が6つの最先端のDeepFake検出器の検出精度を大幅に損なう可能性がある一方で、元のDeepFakeサンプルの視覚的品質の低下はごくわずかであることを示しています。

DeepFakes are raising significant social concerns. Although various DeepFake detectors have been developed as forensic countermeasures, these detectors are still vulnerable to attacks. Recently, a few attacks, principally adversarial attacks, have succeeded in cloaking DeepFake images to evade detection. However, these attacks have typical detector-specific designs, which require prior knowledge about the detector, leading to poor transferability. Moreover, these attacks only consider simple security scenarios. Less is known about how effective they are in high-level scenarios where either the detectors or the attacker's knowledge varies. In this paper, we solve the above challenges with presenting a novel detector-agnostic trace removal attack for DeepFake anti-forensics. Instead of investigating the detector side, our attack looks into the original DeepFake creation pipeline, attempting to remove all detectable natural DeepFake traces to render the fake images more "authentic". To implement this attack, first, we perform a DeepFake trace discovery, identifying three discernible traces. Then a trace removal network (TR-Net) is proposed based on an adversarial learning framework involving one generator and multiple discriminators. Each discriminator is responsible for one individual trace representation to avoid cross-trace interference. These discriminators are arranged in parallel, which prompts the generator to remove various traces simultaneously. To evaluate the attack efficacy, we crafted heterogeneous security scenarios where the detectors were embedded with different levels of defense and the attackers' background knowledge of data varies. The experimental results show that the proposed attack can significantly compromise the detection accuracy of six state-of-the-art DeepFake detectors while causing only a negligible loss in visual quality to the original DeepFake samples.

updated: Tue Mar 22 2022 03:13:33 GMT+0000 (UTC)

published: Tue Mar 22 2022 03:13:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト