Relaxed-Responsibility Hierarchical Discrete VAEs

Matthew Willetts; Xenia Miscouridou; Stephen Roberts; Chris Holmes

リラックス-責任階層型離散VAE

離散潜在変数の階層を使用して変分オートエンコーダー（VAE）を正常にトレーニングすることは、引き続き活発な研究分野です。ベクトル量子化VAEは、離散VAEに対する強力なアプローチですが、単純な階層拡張はトレーニング時に不安定になる可能性があります。古典的な推論方法からの洞察を活用して、Relaxed-Responsibility Vector-Quantisation、離散潜在変数をパラメーター化する新しい方法、より良いパフォーマンスとより安定したトレーニングを提供するRelaxed Vector-Quantisationの改良版を紹介します。これにより、エンドツーエンドでトレーニングする潜在変数の多数のレイヤー（ここでは最大32）を備えた階層型離散変分オートエンコーダーへの新しいアプローチが可能になります。エンドツーエンドでトレーニングされた離散潜在変数を使用した階層的確率的深層生成モデル内で、さまざまな標準データセットに対して最先端のビット/ディム結果を実現します。％潜在変数の単一層を持つ離散VAEとは異なり、祖先サンプリングによってサンプルを生成できます。学習した潜在表現に対して2番目の自己回帰生成モデルをトレーニングしてからサンプリングしてからデコードする必要はありません。さらに、これらの深い階層モデルにおける後者のアプローチでは、単一のサンプルを生成するために何千ものフォワードパスが必要になります。さらに、モデルのさまざまなレイヤーがデータのさまざまな側面に関連付けられるようになることを確認します。

Successfully training Variational Autoencoders (VAEs) with a hierarchy of discrete latent variables remains an area of active research. Vector-Quantised VAEs are a powerful approach to discrete VAEs, but naive hierarchical extensions can be unstable when training. Leveraging insights from classical methods of inference we introduce Relaxed-Responsibility Vector-Quantisation, a novel way to parameterise discrete latent variables, a refinement of relaxed Vector-Quantisation that gives better performance and more stable training. This enables a novel approach to hierarchical discrete variational autoencoders with numerous layers of latent variables (here up to 32) that we train end-to-end. Within hierarchical probabilistic deep generative models with discrete latent variables trained end-to-end, we achieve state-of-the-art bits-per-dim results for various standard datasets. % Unlike discrete VAEs with a single layer of latent variables, we can produce samples by ancestral sampling: it is not essential to train a second autoregressive generative model over the learnt latent representations to then sample from and then decode. % Moreover, that latter approach in these deep hierarchical models would require thousands of forward passes to generate a single sample. Further, we observe different layers of our model become associated with different aspects of the data.

updated: Thu Feb 04 2021 18:59:59 GMT+0000 (UTC)

published: Tue Jul 14 2020 19:10:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト