Mist: Towards Improved Adversarial Examples for Diffusion Models

Chumeng Liang; Xiaoyu Wu

Mist: 拡散モデルの敵対的サンプルの改善に向けて

拡散モデル (DM) は、人工知能が生成するコンテンツ、特にアートワークの作成において大きな成功を収めてきましたが、知的財産と著作権に関して新たな懸念を引き起こしています。たとえば、侵害者は、DM で人間が作成した非許可の絵画を模倣することで利益を得ることができます。最近の研究では、拡散モデルに対するさまざまな敵対的な例が、これらの著作権侵害に対する効果的なツールとなり得ることが示唆されています。しかし、現在の敵対的な例では、絵画を模倣するさまざまな方法に対する移行性の弱さと、ノイズ除去などの単純な敵対的防御に対する堅牢性が示されています。驚くべきことに、一貫したパラメータの下で融合および修正された敵対的損失項を利用することにより、敵対的例の移転可能性が大幅に向上できることがわかりました。この研究では、敵対的な例のクロスメソッド移行可能性を包括的に評価します。実験的観察は、私たちの方法が、単純な敵対的防御に対してさらに強力な堅牢性を備えた、より転送可能な敵対的例を生成することを示しています。

Diffusion Models (DMs) have empowered great success in artificial-intelligence-generated content, especially in artwork creation, yet raising new concerns in intellectual properties and copyright. For example, infringers can make profits by imitating non-authorized human-created paintings with DMs. Recent researches suggest that various adversarial examples for diffusion models can be effective tools against these copyright infringements. However, current adversarial examples show weakness in transferability over different painting-imitating methods and robustness under straightforward adversarial defense, for example, noise purification. We surprisingly find that the transferability of adversarial examples can be significantly enhanced by exploiting a fused and modified adversarial loss term under consistent parameters. In this work, we comprehensively evaluate the cross-method transferability of adversarial examples. The experimental observation shows that our method generates more transferable adversarial examples with even stronger robustness against the simple adversarial defense.

updated: Mon May 22 2023 03:43:34 GMT+0000 (UTC)

published: Mon May 22 2023 03:43:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト