The Infinite Index: Information Retrieval on Generative Text-To-Image Models

Niklas Deckers; Maik Fröbe; Johannes Kiesel; Gianluca Pandolfo; Christopher Schröder; Benno Stein; Martin Potthast

無限インデックス: 生成的なテキストからイメージへのモデルでの情報検索

DALL-E や Stable Diffusion などの条件付き生成モデルは、ユーザー定義のテキストであるプロンプトに基づいて画像を生成します。望ましいイメージを生み出すプロンプトを見つけて洗練することは、プロンプトエンジニアリングの技術となっています。生成モデルは、プロンプトを通じて表現されるユーザーの情報ニーズに対する組み込みの検索モデルを提供しません。広範な文献レビューに照らして、生成モデルの迅速なエンジニアリングを、新しい種類の「無限インデックス」でのインタラクティブなテキストベースの検索として再構成します。これらの洞察を、専門家によるゲームデザインの画像生成に関するケーススタディで初めて適用します。最後に、アクティブラーニングが生成された画像の検索をガイドするのにどのように役立つかを想像します。

Conditional generative models such as DALL-E and Stable Diffusion generate images based on a user-defined text, the prompt. Finding and refining prompts that produce a desired image has become the art of prompt engineering. Generative models do not provide a built-in retrieval model for a user's information need expressed through prompts. In light of an extensive literature review, we reframe prompt engineering for generative models as interactive text-based retrieval on a novel kind of "infinite index". We apply these insights for the first time in a case study on image generation for game design with an expert. Finally, we envision how active learning may help to guide the retrieval of generated images.

updated: Sat Jan 21 2023 18:16:14 GMT+0000 (UTC)

published: Wed Dec 14 2022 19:50:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト