Trust the Critics: Generatorless and Multipurpose WGANs with Initial Convergence Guarantees

Tristan Milne; Étienne Bilocq; Adrian Nachman

批評家を信頼する：初期収束保証付きのジェネレータレスおよび多目的WGAN

最適な輸送理論からのアイデアに触発されて、生成モデリングの新しいアルゴリズムであるTrust the Critics（TTC）を紹介します。このアルゴリズムは、WassersteinGANからトレーニング可能なジェネレーターを排除します。代わりに、一連の訓練された評論家ネットワークで最急降下法を使用してソースデータを繰り返し変更します。これは、批評家の勾配によって提供される最適な輸送方向と、トレーニング可能なジェネレーターによってパラメーター化されたときにデータポイントが実際に移動する方向との間に観察された不整合によって部分的に動機付けられています。以前の研究は異なる視点から同様のアイデアに到達しましたが、最適輸送理論の基礎は、一定のステップサイズと比較して収束を大幅に加速する適応ステップサイズの選択を動機付けます。このステップサイズルールを使用して、密度のあるソース分布の場合の初期の幾何学的収束率を証明します。これらの収束率は、無視できない生成データのセットが実際のデータと本質的に区別できない場合にのみ適用されなくなります。ミスアライメントの問題を解決すると、パフォーマンスが向上します。これは、トレーニングエポックの数が固定されている場合、メモリ要件が増加しても、TTCが同等のWGANよりも高品質の画像を生成することを示す実験で実証されています。さらに、TTCは、従来のWGANでは提供されていない変換密度の反復式を提供します。最後に、TTCを適用して、任意のソース分布を任意のターゲットにマッピングできます。実験を通じて、TTCは専用のアルゴリズムなしで画像の生成、変換、ノイズ除去で競争力のあるパフォーマンスを得ることができることを示しています。

Inspired by ideas from optimal transport theory we present Trust the Critics (TTC), a new algorithm for generative modelling. This algorithm eliminates the trainable generator from a Wasserstein GAN; instead, it iteratively modifies the source data using gradient descent on a sequence of trained critic networks. This is motivated in part by the misalignment which we observed between the optimal transport directions provided by the gradients of the critic and the directions in which data points actually move when parametrized by a trainable generator. Previous work has arrived at similar ideas from different viewpoints, but our basis in optimal transport theory motivates the choice of an adaptive step size which greatly accelerates convergence compared to a constant step size. Using this step size rule, we prove an initial geometric convergence rate in the case of source distributions with densities. These convergence rates cease to apply only when a non-negligible set of generated data is essentially indistinguishable from real data. Resolving the misalignment issue improves performance, which we demonstrate in experiments that show that given a fixed number of training epochs, TTC produces higher quality images than a comparable WGAN, albeit at increased memory requirements. In addition, TTC provides an iterative formula for the transformed density, which traditional WGANs do not. Finally, TTC can be applied to map any source distribution onto any target; we demonstrate through experiments that TTC can obtain competitive performance in image generation, translation, and denoising without dedicated algorithms.

updated: Tue Nov 30 2021 03:36:44 GMT+0000 (UTC)

published: Tue Nov 30 2021 03:36:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト