Local Disentanglement in Variational Auto-Encoders Using Jacobian L_1 Regularization

Travers Rhodes; Daniel D. Lee

ヤコビアンL_1正則化を使用した変分オートエンコーダの局所解きほぐし

表現学習には最近多くの進歩がありました。ただし、教師なし表現学習は、潜在空間の回転に関連するモデル識別の問題に依然として苦労する可能性があります。変分オートエンコーダー（VAE）とその拡張機能（β-VAEなど）は、潜在変数とPCA方向の局所的な位置合わせを改善することが示されています。これは、特定の条件下でモデルのもつれを解くのに役立ちます。独立成分分析（ICA）とスパースコーディングからインスピレーションを得て、トレーニング中にVAEの生成ヤコビアンにL_1損失を適用して、複数のオブジェクトの画像または複数の部分を持つ画像の変動の独立した要因との局所潜在変数の位置合わせを促進することを提案します。さまざまなデータセットで結果を示し、情報理論とモジュール性の測定を使用して定性的および定量的な結果を示します。これは、追加されたL_1コストが、潜在的な表現と個々の変動要因とのローカル軸の位置合わせを促進することを示しています。

There have been many recent advances in representation learning; however, unsupervised representation learning can still struggle with model identification issues related to rotations of the latent space. Variational Auto-Encoders (VAEs) and their extensions such as β-VAEs have been shown to improve local alignment of latent variables with PCA directions, which can help to improve model disentanglement under some conditions. Borrowing inspiration from Independent Component Analysis (ICA) and sparse coding, we propose applying an L_1 loss to the VAE's generative Jacobian during training to encourage local latent variable alignment with independent factors of variation in images of multiple objects or images with multiple parts. We demonstrate our results on a variety of datasets, giving qualitative and quantitative results using information theoretic and modularity measures that show our added L_1 cost encourages local axis alignment of the latent representation with individual factors of variation.

updated: Wed Oct 27 2021 21:07:52 GMT+0000 (UTC)

published: Sat Jun 05 2021 15:40:55 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト