Each Attribute Matters: Contrastive Attention for Sentence-based Image Editing

Liuqing Zhao; Fan Lyu; Fuyuan Hu; Kaizhu Huang; Fenglei Xu; Linyan Li

各属性の問題：文ベースの画像編集に対する対照的な注意

文ベースの画像編集（SIE）は、自然言語を使用して画像を編集することを目的としています。高価な手動編集を削減する可能性を提供するSIEは、最近多くの関心を集めています。ただし、既存のメソッドでは正確な編集を行うことはほとんどできず、クエリ文に複数の編集可能な属性がある場合、属性の編集に失敗することさえあります。この問題に対処するために、本論文では、属性間の差異を強調することに焦点を当てることにより、対照トレーニングから着想を得た、対照注意生成的敵対的ネットワーク（CA-GAN）と呼ばれる新しいモデルを提案します。具体的には、最初に、トレーニング中に形成される属性のランダムな組み合わせ間の編集の違いを拡大するために、新しい対照的な注意モジュールを設計します。次に、属性識別子を作成して、各属性を効果的に編集できるようにします。一連の実験は、私たちの方法が、CUBおよびCOCOデータセットの複数の属性を使用した文ベースの画像編集で非常に有望な結果を生成できることを示しています。私たちのコードはhttps://github.com/Zlq2021/CA-GANで入手できます

Sentence-based Image Editing (SIE) aims to deploy natural language to edit an image. Offering potentials to reduce expensive manual editing, SIE has attracted much interest recently. However, existing methods can hardly produce accurate editing and even lead to failures in attribute editing when the query sentence is with multiple editable attributes. To cope with this problem, by focusing on enhancing the difference between attributes, this paper proposes a novel model called Contrastive Attention Generative Adversarial Network (CA-GAN), which is inspired from contrastive training. Specifically, we first design a novel contrastive attention module to enlarge the editing difference between random combinations of attributes which are formed during training. We then construct an attribute discriminator to ensure effective editing on each attribute. A series of experiments show that our method can generate very encouraging results in sentence-based image editing with multiple attributes on CUB and COCO dataset. Our code is available at https://github.com/Zlq2021/CA-GAN

updated: Thu Oct 21 2021 14:06:20 GMT+0000 (UTC)

published: Thu Oct 21 2021 14:06:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト