Composition-aware Graphic Layout GAN for Visual-textual Presentation Designs

Min Zhou; Chenchen Xu; Ye Ma; Tiezheng Ge; Yuning Jiang; Weiwei Xu

ビジュアルテキストプレゼンテーションデザイン用の構成対応グラフィックレイアウトGAN

この論文では、与えられた画像に対して高品質のビジュアルテキストプレゼンテーションデザインを作成するグラフィックレイアウト生成の問題を研究します。グローバルセマンティクスだけでなく空間情報も含む画像構成は、レイアウト結果に大きく影響することに注意してください。したがって、入力画像のグローバルおよび空間的な視覚的コンテンツに基づいてレイアウトを合成するために、構成認識グラフィックレイアウトGAN（CGL-GAN）と呼ばれる深い生成モデルを提案します。手動で設計されたグラフィックレイアウトデータをすでに含む画像からトレーニング画像を取得するために、以前の作業では、モデル入力として設計要素（テキストや装飾など）をマスキングすることを提案しています。これにより、必然的にグラウンドトゥルースのヒントが残ります。トレーニング入力（ヒントマスクあり）とテスト入力（マスクなし）の間の不整合を調査し、このギャップを狭めるための新しいドメインアライメントモジュール（DAM）を設計します。トレーニングのために、60,548枚の広告ポスターと注釈付きのレイアウト情報で構成される大規模なレイアウトデータセットを構築しました。生成されたレイアウトを評価するために、美的直感に従って3つの新しいメトリックを提案します。定量的評価と定性的評価の両方を通じて、提案されたモデルが画像構成に従って高品質のグラフィックレイアウトを合成できることを示します。

In this paper, we study the graphic layout generation problem of producing high-quality visual-textual presentation designs for given images. We note that image compositions, which contain not only global semantics but also spatial information, would largely affect layout results. Hence, we propose a deep generative model, dubbed as composition-aware graphic layout GAN (CGL-GAN), to synthesize layouts based on the global and spatial visual contents of input images. To obtain training images from images that already contain manually designed graphic layout data, previous work suggests masking design elements (e.g., texts and embellishments) as model inputs, which inevitably leaves hint of the ground truth. We study the misalignment between the training inputs (with hint masks) and test inputs (without masks), and design a novel domain alignment module (DAM) to narrow this gap. For training, we built a large-scale layout dataset which consists of 60,548 advertising posters with annotated layout information. To evaluate the generated layouts, we propose three novel metrics according to aesthetic intuitions. Through both quantitative and qualitative evaluations, we demonstrate that the proposed model can synthesize high-quality graphic layouts according to image compositions.

updated: Sun Jul 10 2022 10:20:20 GMT+0000 (UTC)

published: Sat Apr 30 2022 16:42:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト