SEM-CS: Semantic CLIPStyler for Text-Based Image Style Transfer

Chanda G Kamra; Indra Deep Mastan; Debayan Gupta

SEM-CS: テキストベースの画像スタイル転送用のセマンティック CLIPStyler

CLIPStyler は、(参照スタイル画像を必要とする代わりに) スタイルテキストの説明のみを使用して、リアルなテクスチャで画像スタイルの転送を示しました。ただし、スタイル転送出力のオブジェクトのグラウンドセマンティクスは、顕著なオブジェクトと背景オブジェクトへのスタイルのスピルオーバー (コンテンツの不一致) または過剰なスタイル化により失われます。これを解決するために、セマンティックスタイルトランスファーを行う Semantic CLIPStyler (Sem-CS) を提案します。 Sem-CS は、最初にコンテンツイメージを顕著なオブジェクトとそうでないオブジェクトに分割し、次に、指定されたスタイルテキストの説明に基づいて芸術的なスタイルを転送します。セマンティックスタイルの転送は、グローバルなフォアグラウンドロス (顕著なオブジェクトの場合) とグローバルなバックグラウンドロス (非顕著なオブジェクトの場合) を使用して実現されます。 DISTS、NIMA、ユーザー調査スコアなどの経験的結果は、提案されたフレームワークが優れた質的および量的パフォーマンスをもたらすことを示しています。

CLIPStyler demonstrated image style transfer with realistic textures using only the style text description (instead of requiring a reference style image). However, the ground semantics of objects in style transfer output is lost due to style spillover on salient and background objects (content mismatch) or over-stylization. To solve this, we propose Semantic CLIPStyler (Sem-CS) that performs semantic style transfer. Sem-CS first segments the content image into salient and non-salient objects and then transfers artistic style based on a given style text description. The semantic style transfer is achieved using global foreground loss (for salient objects) and global background loss (for non-salient objects). Our empirical results, including DISTS, NIMA and user study scores, show that our proposed framework yields superior qualitative and quantitative performance.

updated: Sat Mar 11 2023 07:33:06 GMT+0000 (UTC)

published: Sat Mar 11 2023 07:33:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト