CLIPMasterPrints: Fooling Contrastive Language-Image Pre-training Using Latent Variable Evolution

Matthias Freiberger; Peter Kun; Anders Sundnes Løvlie; Sebastian Risi

CLIPMasterPrints: 潜在変数進化を使用した欺瞞的な対照的言語イメージの事前トレーニング

Contrastive Language-Image Pre-training (CLIP) などの視覚データとテキストデータの両方を活用するモデルの重要性がますます高まっています。この研究では、そのようなモデルが多用途性にもかかわらず、私たちがマスターイメージを騙すものに対して脆弱であることを示します。マスターイメージを騙すことは、人間には認識できない一方で、多数の多種多様なプロンプトに対して CLIP モデルの信頼スコアを最大化することができます。我々は、進化戦略または確率的勾配降下法を用いて生成モデルの潜在空間を探索することによって、だまされているマスターイメージをどのようにマイニングできるかを実証します。私たちは、マイニングされた騙しマスター画像の特性を調査し、少数の画像キャプションでトレーニングされた画像が、意味的に関連するはるかに多数のキャプションに潜在的に一般化する可能性があることを発見しました。さらに、2 つの考えられる緩和戦略を評価し、マスターサンプルをだますことに対する脆弱性が、対照的な事前トレーニング済みマルチモーダルネットワークのモダリティギャップと密接に関連していることを発見しました。したがって、オフマニホールド攻撃に対する脆弱性の観点から、CLIP および関連するマルチモーダルアプローチにおけるモダリティギャップの緩和を主張します。ソースコードとマイニングされた CLIPMasterPrint は、https://github.com/matfrei/CLIPMasterPrints で入手できます。

Models leveraging both visual and textual data such as Contrastive Language-Image Pre-training (CLIP), are increasingly gaining importance. In this work, we show that despite their versatility, such models are vulnerable to what we refer to as fooling master images. Fooling master images are capable of maximizing the confidence score of a CLIP model for a significant number of widely varying prompts, while being unrecognizable for humans. We demonstrate how fooling master images can be mined by searching the latent space of generative models by means of an evolution strategy or stochastic gradient descent. We investigate the properties of the mined fooling master images, and find that images trained on a small number of image captions potentially generalize to a much larger number of semantically related captions. Further, we evaluate two possible mitigation strategies and find that vulnerability to fooling master examples is closely related to a modality gap in contrastive pre-trained multi-modal networks. From the perspective of vulnerability to off-manifold attacks, we therefore argue for the mitigation of modality gaps in CLIP and related multi-modal approaches. Source code and mined CLIPMasterPrints are available at https://github.com/matfrei/CLIPMasterPrints.

updated: Fri Jul 07 2023 18:54:11 GMT+0000 (UTC)

published: Fri Jul 07 2023 18:54:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト