DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models

Zeyang Sha; Zheng Li; Ning Yu; Yang Zhang

DE-FAKE: テキストから画像への生成モデルによって生成された偽画像の検出と帰属

プロンプトの説明に基づいて画像を生成するテキストから画像への生成モデルは、過去数か月間でますます注目を集めています。有望なパフォーマンスにもかかわらず、これらのモデルは、生成された偽の画像の悪用に関する懸念を引き起こします。この問題に取り組むために、テキストから画像への生成モデルによって生成された偽の画像の検出と帰属に関する体系的な研究を開拓しました。具体的には、まず、さまざまなテキストから画像への生成モデルによって生成された偽の画像を検出するための機械学習分類器を構築します。次に、モデルの所有者がモデルの誤用に対して責任を負うことができるように、これらの偽の画像をソースモデルに帰属させます。さらに、偽の画像を生成するプロンプトが検出と帰属にどのように影響するかを調査します。 DALL∙E 2、Stable Diffusion、GLIDE、Latent Diffusion を含む 4 つの一般的なテキストから画像への生成モデルと、2 つのベンチマークプロンプト画像データセットについて広範な実験を行っています。経験的結果は、(1) 異なるモデルからの偽の画像によって共有される共通のアーティファクトが存在するため、さまざまなモデルによって生成された偽の画像を実際の画像と区別できることを示しています。 (2) 異なるモデルは生成された画像に固有のフィンガープリントを残すため、偽の画像は事実上ソースモデルに起因する可能性があります。 (3) 「人物」トピックまたは 25 から 75 の間の長さのプロンプトにより、モデルはより信頼性の高い偽の画像を生成できます。すべての調査結果は、テキストから画像への生成モデルによって引き起こされる脅威に対するコミュニティの洞察に貢献します。私たちは、急速に進化する偽画像の生成に対して、私たちのような対応するソリューションをコミュニティが検討するよう呼びかけます。

Text-to-image generation models that generate images based on prompt descriptions have attracted an increasing amount of attention during the past few months. Despite their encouraging performance, these models raise concerns about the misuse of their generated fake images. To tackle this problem, we pioneer a systematic study on the detection and attribution of fake images generated by text-to-image generation models. Concretely, we first build a machine learning classifier to detect the fake images generated by various text-to-image generation models. We then attribute these fake images to their source models, such that model owners can be held responsible for their models' misuse. We further investigate how prompts that generate fake images affect detection and attribution. We conduct extensive experiments on four popular text-to-image generation models, including DALL∙E 2, Stable Diffusion, GLIDE, and Latent Diffusion, and two benchmark prompt-image datasets. Empirical results show that (1) fake images generated by various models can be distinguished from real ones, as there exists a common artifact shared by fake images from different models; (2) fake images can be effectively attributed to their source models, as different models leave unique fingerprints in their generated images; (3) prompts with the ``person'' topic or a length between 25 and 75 enable models to generate fake images with higher authenticity. All findings contribute to the community's insight into the threats caused by text-to-image generation models. We appeal to the community's consideration of the counterpart solutions, like ours, against the rapidly-evolving fake image generation.

updated: Mon Jan 09 2023 16:33:43 GMT+0000 (UTC)

published: Thu Oct 13 2022 13:08:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト