Deep Image Style Transfer from Freeform Text

Tejas Santanam; Mengyang Liu; Jiangyue Yu; Zhaodong Yang

フリーフォームテキストからのディープイメージスタイルの転送

この論文では、フリーフォームのユーザーテキスト入力からスタイル画像を生成することにより、ディープニューラルスタイル転送の新しい方法を作成します。言語モデルとスタイル転送モデルはシームレスなパイプラインを形成し、ベースラインのスタイル転送方法と比較して、同様の損失と改善された品質を持つ出力画像を作成できます。言語モデルは、スタイルテキストと説明の入力が与えられると、厳密に一致する画像を返します。この画像は、入力コンテンツ画像と共にスタイル転送モデルに渡され、最終的な出力が作成されます。モデルを統合し、フリーフォームテキストからのディープイメージスタイル転送の有効性を実証するための概念実証ツールも開発されています。

This paper creates a novel method of deep neural style transfer by generating style images from freeform user text input. The language model and style transfer model form a seamless pipeline that can create output images with similar losses and improved quality when compared to baseline style transfer methods. The language model returns a closely matching image given a style text and description input, which is then passed to the style transfer model with an input content image to create a final output. A proof-of-concept tool is also developed to integrate the models and demonstrate the effectiveness of deep image style transfer from freeform text.

updated: Tue Dec 13 2022 19:24:08 GMT+0000 (UTC)

published: Tue Dec 13 2022 19:24:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト