Towards Composable Distributions of Latent Space Augmentations

Omead Pooladzandi; Jeffrey Jiang; Sunay Bhat; Gregory Pottie

潜在空間拡張の構成可能な分布に向けて

複数の増強を簡単に組み合わせることを可能にする潜在空間画像増強のための構成可能なフレームワークを提案します。画像拡張は、さまざまな画像分類および生成タスクのパフォーマンスを向上させる効果的な手法であることが示されています。私たちのフレームワークは、Variational Autoencoder アーキテクチャに基づいており、潜在空間自体内での線形変換による増強のための新しいアプローチを使用しています。損失と拡張の潜在的なジオメトリを調査して、変換を構成可能で非自発的にすることで、変換を容易に結合または反転できるようにします。最後に、これらのプロパティは特定の拡張のペアでより優れたパフォーマンスを発揮することを示しますが、潜在空間を他の拡張のセットに転送してパフォーマンスを変更し、VAE のボトルネックを効果的に制約して、特定の拡張と画像の特徴の分散を維持することができます。気にする。標準 VAE と条件付き VAE の両方に対する MNIST データセットの初期結果を使用して、アプローチの有効性を示します。この潜在増強法により、潜在空間のより優れた制御と幾何学的解釈が可能になり、この分野の研究者や実務家にとって貴重なツールになります。

We propose a composable framework for latent space image augmentation that allows for easy combination of multiple augmentations. Image augmentation has been shown to be an effective technique for improving the performance of a wide variety of image classification and generation tasks. Our framework is based on the Variational Autoencoder architecture and uses a novel approach for augmentation via linear transformation within the latent space itself. We explore losses and augmentation latent geometry to enforce the transformations to be composable and involuntary, thus allowing the transformations to be readily combined or inverted. Finally, we show these properties are better performing with certain pairs of augmentations, but we can transfer the latent space to other sets of augmentations to modify performance, effectively constraining the VAE's bottleneck to preserve the variance of specific augmentations and features of the image which we care about. We demonstrate the effectiveness of our approach with initial results on the MNIST dataset against both a standard VAE and a Conditional VAE. This latent augmentation method allows for much greater control and geometric interpretability of the latent space, making it a valuable tool for researchers and practitioners in the field.

updated: Mon Mar 06 2023 19:37:01 GMT+0000 (UTC)

published: Mon Mar 06 2023 19:37:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト