Evaluating a Synthetic Image Dataset Generated with Stable Diffusion

Andreas Stöckl

安定拡散で生成された合成画像データセットの評価

Wordnet分類法とそれに含まれる概念の定義を使用して、「安定した拡散」画像生成モデルで合成画像を生成します。この合成画像データベースは、機械学習アプリケーションでのデータ拡張のトレーニングデータとして使用でき、Stable Diffusion モデルの機能を調査するために使用されます。分析によると、Stable Diffusion は多数の概念に対して正しい画像を生成できるだけでなく、多種多様な異なる表現に対しても生成できることが示されています。結果は、考慮されたテストの概念と非常に具体的な概念の問題に応じた違いを示しています。これらの評価は、画像分類用のビジョントランスフォーマーモデルを使用して実行されました。

We generate synthetic images with the "Stable Diffusion" image generation model using the Wordnet taxonomy and the definitions of concepts it contains. This synthetic image database can be used as training data for data augmentation in machine learning applications, and it is used to investigate the capabilities of the Stable Diffusion model. Analyses show that Stable Diffusion can produce correct images for a large number of concepts, but also a large variety of different representations. The results show differences depending on the test concepts considered and problems with very specific concepts. These evaluations were performed using a vision transformer model for image classification.

updated: Fri Nov 04 2022 09:28:00 GMT+0000 (UTC)

published: Thu Nov 03 2022 13:02:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト