Noise Estimation for Generative Diffusion Models

Robin San-Roman; Eliya Nachmani; Lior Wolf

生成拡散モデルのノイズ推定

生成拡散モデルは、音声および画像生成の主要なモデルとして登場しました。ただし、少数のノイズ除去ステップで適切に実行するには、ノイズパラメータのセットのコストのかかる調整が必要です。この作業では、前の作業で各数値を個別に再調整する必要がある一方で、任意のステップ数に対してこれらのノイズパラメータを段階的に調整できるシンプルで用途の広い学習スキームを紹介します。さらに、拡散モデルの重みを変更することなく、少数のステップで合成結果を大幅に改善することができます。私たちのアプローチは、ごくわずかな計算コストで実現します。

Generative diffusion models have emerged as leading models in speech and image generation. However, in order to perform well with a small number of denoising steps, a costly tuning of the set of noise parameters is needed. In this work, we present a simple and versatile learning scheme that can step-by-step adjust those noise parameters, for any given number of steps, while the previous work needs to retune for each number separately. Furthermore, without modifying the weights of the diffusion model, we are able to significantly improve the synthesis results, for a small number of steps. Our approach comes at a negligible computation cost.

updated: Sun Sep 12 2021 07:49:25 GMT+0000 (UTC)

published: Tue Apr 06 2021 15:46:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト