BIGRoC: Boosting Image Generation via a Robust Classifier

Roy Ganz; Michael Elad

BIGRoC: 堅牢な分類器による画像生成のブースト

画像合成に対する機械学習コミュニティの関心は、幅広い深層生成モデルとそれらをトレーニングするための手段が導入されたことで、近年大幅に高まっています。この作業では、生成モデルによって得られた生成画像の画質と分布忠実度を改善するための一般的なモデルに依存しない手法を提案します。 BIGRoC (Boosting Image Generation via a Robust Classifier) と呼ばれる私たちの方法は、特定の堅牢な分類器のガイダンスによる後処理手順に基づいており、生成モデルの追加のトレーニングは必要ありません。合成された画像が与えられた場合、ロバストな分類器で投影された勾配ステップを介して画像を更新し、その認識を改善することを提案します。さまざまな画像合成方法でこの後処理アルゴリズムを実証し、CIFAR-10 と ImageNet で定量的および定性的な大幅な改善を示します。驚くべきことに、BIGRoC は改良アプローチの中で最初のモデルにとらわれず、必要な情報がはるかに少ないにもかかわらず、競合する方法よりも優れています。具体的には、BIGRoC は、ImageNet 128x128 で 14.81% 向上し、2.53 の FID スコアを達成し、256x256 で 7.87% 向上し、3.63 の FID を達成します。さらに、私たちは意見調査を実施し、それによると、人間は私たちの方法の出力を大幅に好みます。

The interest of the machine learning community in image synthesis has grown significantly in recent years, with the introduction of a wide range of deep generative models and means for training them. In this work, we propose a general model-agnostic technique for improving the image quality and the distribution fidelity of generated images obtained by any generative model. Our method, termed BIGRoC (Boosting Image Generation via a Robust Classifier), is based on a post-processing procedure via the guidance of a given robust classifier and without a need for additional training of the generative model. Given a synthesized image, we propose to update it through projected gradient steps over the robust classifier to refine its recognition. We demonstrate this post-processing algorithm on various image synthesis methods and show a significant quantitative and qualitative improvement on CIFAR-10 and ImageNet. Surprisingly, although BIGRoC is the first model agnostic among refinement approaches and requires much less information, it outperforms competitive methods. Specifically, BIGRoC improves the image synthesis best performing diffusion model on ImageNet 128x128 by 14.81%, attaining an FID score of 2.53, and on 256x256 by 7.87%, achieving an FID of 3.63. Moreover, we conduct an opinion survey, according to which humans significantly prefer our method's outputs.

updated: Wed Feb 01 2023 11:07:15 GMT+0000 (UTC)

published: Sun Aug 08 2021 18:05:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト