Improving Diffusion Model Efficiency Through Patching

Troy Luhman; Eric Luhman

パッチ適用による拡散モデルの効率の改善

拡散モデルは、サンプルを繰り返しノイズ除去してデータを生成する強力なクラスの生成モデルです。多くの研究がこのサンプリング手順の反復回数に焦点を合わせていますが、各反復のコストに焦点を当てているものはほとんどありません。単純なViTスタイルのパッチ変換を追加すると、拡散モデルのサンプリング時間とメモリ使用量を大幅に削減できることがわかりました。拡散モデルの目的の分析と、LSUN Church、ImageNet 256、およびFFHQ 1024での経験的実験の両方を通じて、アプローチを正当化します。TensorflowとPytorchで実装を提供します。

Diffusion models are a powerful class of generative models that iteratively denoise samples to produce data. While many works have focused on the number of iterations in this sampling procedure, few have focused on the cost of each iteration. We find that adding a simple ViT-style patching transformation can considerably reduce a diffusion model's sampling time and memory usage. We justify our approach both through an analysis of the diffusion model objective, and through empirical experiments on LSUN Church, ImageNet 256, and FFHQ 1024. We provide implementations in Tensorflow and Pytorch.

updated: Sat Jul 09 2022 18:21:32 GMT+0000 (UTC)

published: Sat Jul 09 2022 18:21:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト