Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models

Weijian Luo; Tianyang Hu; Shifeng Zhang; Jiacheng Sun; Zhenguo Li; Zhihua Zhang

Diff-Instruct: 事前トレーニングされた拡散モデルから知識を転送するための普遍的なアプローチ

トレーニングの容易さ、拡張性、およびサンプル品質の高さにより、拡散モデル (DM) は生成モデリングの推奨オプションとなっており、さまざまなデータセットに多数の事前トレーニング済みモデルが利用可能です。データ配布に関する複雑な情報を含む事前トレーニング済み DM は、ダウンストリームアプリケーションにとって貴重な資産です。この研究では、事前トレーニングされた DM から学習し、その知識をデータフリーの方法で他の生成モデルに転送することを検討します。具体的には、生成されたサンプルがモデルパラメーターに関して微分可能である限り、任意の生成モデルのトレーニングを指示する Diff-Instruct と呼ばれる一般的なフレームワークを提案します。私たちが提案する Diff-Instruct は、厳密な数学的基礎に基づいて構築されており、その指示プロセスは、積分カルバックライブラー (IKL) 発散と呼ばれる新しい発散を最小限に抑えることに直接対応しています。 IKL は、拡散プロセスに沿った KL 発散の積分を計算することによって DM 向けに調整されており、位置ずれしたサポートを含む分布との比較においてより堅牢であることが示されています。また、DreamFusion や生成敵対的トレーニングなどの既存の作品と私たちのメソッドの重要な関連性も明らかにします。 Diff-Instruct の有効性と普遍性を実証するために、事前トレーニングされた拡散モデルの抽出と既存の GAN モデルの改良という 2 つのシナリオを検討します。事前トレーニング済みの拡散モデルを抽出する実験では、Diff-Instruct によって最先端の単一ステップの拡散ベースのモデルが得られることが示されています。 GAN モデルの改良に関する実験では、Diff-Instruct がさまざまな設定にわたって GAN モデルの事前トレーニングされたジェネレーターを一貫して改善できることが示されています。

Due to the ease of training, ability to scale, and high sample quality, diffusion models (DMs) have become the preferred option for generative modeling, with numerous pre-trained models available for a wide variety of datasets. Containing intricate information about data distributions, pre-trained DMs are valuable assets for downstream applications. In this work, we consider learning from pre-trained DMs and transferring their knowledge to other generative models in a data-free fashion. Specifically, we propose a general framework called Diff-Instruct to instruct the training of arbitrary generative models as long as the generated samples are differentiable with respect to the model parameters. Our proposed Diff-Instruct is built on a rigorous mathematical foundation where the instruction process directly corresponds to minimizing a novel divergence we call Integral Kullback-Leibler (IKL) divergence. IKL is tailored for DMs by calculating the integral of the KL divergence along a diffusion process, which we show to be more robust in comparing distributions with misaligned supports. We also reveal non-trivial connections of our method to existing works such as DreamFusion, and generative adversarial training. To demonstrate the effectiveness and universality of Diff-Instruct, we consider two scenarios: distilling pre-trained diffusion models and refining existing GAN models. The experiments on distilling pre-trained diffusion models show that Diff-Instruct results in state-of-the-art single-step diffusion-based models. The experiments on refining GAN models show that the Diff-Instruct can consistently improve the pre-trained generators of GAN models across various settings.

updated: Mon Jan 15 2024 07:51:23 GMT+0000 (UTC)

published: Mon May 29 2023 04:22:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト