BIGRoC: Boosting Image Generation via a Robust Classifier

Roy Ganz; Michael Elad

BIGRoC：堅牢な分類器による画像生成の強化

画像合成における機械学習コミュニティの関心は、幅広い深層生成モデルとそれらをトレーニングする手段の導入により、近年大幅に高まっています。この作業では、任意の生成モデルによって取得された、生成された画像の画質と分布の忠実度を向上させるための一般的なモデルにとらわれない手法を提案します。 BIGRoC（ロバスト分類器による画像生成のブースト）と呼ばれる私たちの方法は、生成モデルの追加のトレーニングを必要とせずに、特定のロバスト分類器のガイダンスによる後処理手順に基づいています。合成された画像を前提として、その認識を改善するために、ロバストな分類器上に投影された勾配ステップを介して画像を更新することを提案します。この後処理アルゴリズムをさまざまな画像合成方法で示し、CIFAR-10とImageNetで生成された画像が定量的および定性的に大幅に改善されていることを示します。具体的には、BIGRoCは、ImageNet 128x128で14.81％、256x256で13.29％のFIDスコアを達成し、3.98のFIDを達成して、最先端の画像合成を改善します。

The interest of the machine learning community in image synthesis has grown significantly in recent years, with the introduction of a wide range of deep generative models and means for training them. In this work, we propose a general model-agnostic technique for improving the image quality and the distribution fidelity of generated images, obtained by any generative model. Our method, termed BIGRoC (Boosting Image Generation via a Robust Classifier), is based on a post-processing procedure via the guidance of a given robust classifier and without a need for additional training of the generative model. Given a synthesized image, we propose to update it through projected gradient steps over the robust classifier, in an attempt to refine its recognition. We demonstrate this post-processing algorithm on various image synthesis methods and show a significant improvement of the generated images, both quantitatively and qualitatively, on CIFAR-10 and ImageNet. Specifically, BIGRoC improves the image synthesis state of the art on ImageNet 128x128 by 14.81%, attaining an FID score of 2.53 and on 256x256 by 13.29%, achieving an FID of 3.98.

updated: Sat Jan 29 2022 15:42:06 GMT+0000 (UTC)

published: Sun Aug 08 2021 18:05:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト