Domain Enhanced Arbitrary Image Style Transfer via Contrastive Learning

Yuxin Zhang; Fan Tang; Weiming Dong; Haibin Huang; Chongyang Ma; Tong-Yee Lee; Changsheng Xu

対照学習によるドメイン拡張任意画像スタイル転送

この作業では、新しいスタイルの特徴表現学習方法を使用して、任意の画像スタイルの転送という困難な問題に取り組みます。満足のいく結果を得るには、画像のスタイル設定タスクの重要な要素として、適切なスタイル表現が不可欠です。既存のディープニューラルネットワークベースのアプローチは、コンテンツ機能のグラム行列などの2次統計からのガイダンスで妥当な結果を達成します。ただし、十分なスタイル情報を活用していないため、局所的な歪みやスタイルの不整合などのアーティファクトが発生します。これらの問題に対処するために、複数のスタイル間の類似点と相違点を分析し、スタイルの分布を考慮することにより、2次統計ではなく、画像の特徴から直接スタイル表現を学習することを提案します。具体的には、対照学習による新しいスタイル表現学習とスタイル転送方法である対照任意スタイル転送（CAST）を紹介します。私たちのフレームワークは、3つの主要なコンポーネントで構成されています。つまり、スタイルコードエンコーディング用のマルチレイヤースタイルプロジェクター、スタイル分布を効果的に学習するためのドメイン拡張モジュール、および画像スタイル転送用の生成ネットワークです。私たちは定性的および定量的評価を包括的に実施し、私たちのアプローチが最先端の方法で得られたものと比較して大幅に優れた結果を達成することを実証します。コードとモデルはhttps://github.com/zyxElsa/CAST_pytorchで入手できます。

In this work, we tackle the challenging problem of arbitrary image style transfer using a novel style feature representation learning method. A suitable style representation, as a key component in image stylization tasks, is essential to achieve satisfactory results. Existing deep neural network based approaches achieve reasonable results with the guidance from second-order statistics such as Gram matrix of content features. However, they do not leverage sufficient style information, which results in artifacts such as local distortions and style inconsistency. To address these issues, we propose to learn style representation directly from image features instead of their second-order statistics, by analyzing the similarities and differences between multiple styles and considering the style distribution. Specifically, we present Contrastive Arbitrary Style Transfer (CAST), which is a new style representation learning and style transfer method via contrastive learning. Our framework consists of three key components, i.e., a multi-layer style projector for style code encoding, a domain enhancement module for effective learning of style distribution, and a generative network for image style transfer. We conduct qualitative and quantitative evaluations comprehensively to demonstrate that our approach achieves significantly better results compared to those obtained via state-of-the-art methods. Code and models are available at https://github.com/zyxElsa/CAST_pytorch

updated: Thu May 19 2022 13:11:24 GMT+0000 (UTC)

published: Thu May 19 2022 13:11:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト