Content-Aware GAN Compression

Yuchen Liu; Zhixin Shu; Yijun Li; Zhe Lin; Federico Perazzi; S. Y. Kung

コンテンツを意識したGAN圧縮

StyleGAN2などの生成的敵対的ネットワーク（GAN）は、さまざまな画像生成および合成タスクで重要な役割を果たしますが、その悪名高い計算コストは、エッジデバイスへの効率的な展開を妨げます。一般的な圧縮アプローチを直接適用すると、GANでの結果が悪くなり、最近のGAN圧縮作業の多くが動機付けられます。以前の作業は主に条件付きGAN、たとえばpix2pixやCycleGANを加速しますが、最先端の無条件GANの圧縮はめったに検討されておらず、より困難です。この論文では、無条件のGAN圧縮のための新しいアプローチを提案します。最初に、無条件GANに特化した効果的なチャネルプルーニングと知識蒸留スキームを紹介します。次に、剪定と蒸留の両方のプロセスをガイドするための新しいコンテンツ認識方法を提案します。コンテンツを意識することで、人間の顔など、関心のあるコンテンツにとって重要ではないチャネルを効果的に除去し、これらの領域に蒸留を集中させることができます。これにより、蒸留の品質が大幅に向上します。 StyleGAN2とSN-GANでは、最先端の圧縮方法を大幅に改善しています。特に、StyleGAN2のフロップを11分の1に削減し、フルサイズモデルと比較して視覚的に無視できる画質の低下を実現します。さらに興味深いことに、さまざまな画像操作タスクに適用すると、圧縮モデルはより滑らかでより解きほぐされた潜在的な多様体を形成し、画像編集により効果的になります。

Generative adversarial networks (GANs), e.g., StyleGAN2, play a vital role in various image generation and synthesis tasks, yet their notoriously high computational cost hinders their efficient deployment on edge devices. Directly applying generic compression approaches yields poor results on GANs, which motivates a number of recent GAN compression works. While prior works mainly accelerate conditional GANs, e.g., pix2pix and CycleGAN, compressing state-of-the-art unconditional GANs has rarely been explored and is more challenging. In this paper, we propose novel approaches for unconditional GAN compression. We first introduce effective channel pruning and knowledge distillation schemes specialized for unconditional GANs. We then propose a novel content-aware method to guide the processes of both pruning and distillation. With content-awareness, we can effectively prune channels that are unimportant to the contents of interest, e.g., human faces, and focus our distillation on these regions, which significantly enhances the distillation quality. On StyleGAN2 and SN-GAN, we achieve a substantial improvement over the state-of-the-art compression method. Notably, we reduce the FLOPs of StyleGAN2 by 11x with visually negligible image quality loss compared to the full-size model. More interestingly, when applied to various image manipulation tasks, our compressed model forms a smoother and better disentangled latent manifold, making it more effective for image editing.

updated: Tue Apr 06 2021 02:23:56 GMT+0000 (UTC)

published: Tue Apr 06 2021 02:23:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト