Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models

Patrick Schramowski; Manuel Brack; Björn Deiseroth; Kristian Kersting

安全な潜在拡散: 拡散モデルにおける不適切な縮退の緩和

テキスト調整された画像生成モデルは、最近、画像の品質とテキストの配置において驚くべき結果を達成しており、その結果、急速に増加している多くのアプリケーションで採用されています。彼らは高度にデータ駆動型であり、インターネットから無作為に収集された数十億サイズのデータセットに依存しているため、私たちが示しているように、退化した偏った人間の行動にも苦しんでいます.ひいては、そのような偏見を助長することさえあるかもしれません。これらの望ましくない副作用に対処するために、安全な潜在拡散 (SLD) を提示します。具体的には、フィルタリングされていない不均衡なトレーニングセットによる不適切な劣化を測定するために、ヌードや暴力などの概念をカバーする、専用の現実世界の画像からテキストへのプロンプトを含む、不適切な画像プロンプト (I2P) を含む新しい画像生成テストベッドを確立します。 .徹底的な経験的評価が示すように、導入された SLD は、拡散プロセス中に不適切な画像部分を削除および抑制します。追加のトレーニングは必要なく、全体的な画像品質やテキストの配置に悪影響を与えることもありません。

Text-conditioned image generation models have recently achieved astonishing results in image quality and text alignment and are consequently employed in a fast-growing number of applications. Since they are highly data-driven, relying on billion-sized datasets randomly scraped from the internet, they also suffer, as we demonstrate, from degenerated and biased human behavior. In turn, they may even reinforce such biases. To help combat these undesired side effects, we present safe latent diffusion (SLD). Specifically, to measure the inappropriate degeneration due to unfiltered and imbalanced training sets, we establish a novel image generation test bed-inappropriate image prompts (I2P)-containing dedicated, real-world image-to-text prompts covering concepts such as nudity and violence. As our exhaustive empirical evaluation demonstrates, the introduced SLD removes and suppresses inappropriate image parts during the diffusion process, with no additional training required and no adverse effect on overall image quality or text alignment.

updated: Sat Nov 19 2022 16:10:46 GMT+0000 (UTC)

published: Wed Nov 09 2022 18:54:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト