Learning Generalized Transformation Equivariant Representations via Autoencoding Transformations

Guo-Jun Qi; Liheng Zhang; Xiao Wang

自動エンコード変換を介した一般化された変換等価表現の学習

Transformation Equivariant Representations（TER）は、Convolutional Neural Networks（CNN）の成功の根底にある翻訳の等分散の概念を拡張することにより、さまざまな変換に対応する固有の視覚構造をキャプチャすることを目的としています。この目的のために、決定論的自動エンコード変換（AET）と確率的自動エンコード変分変換（AVT）モデルの両方を提示して、変換の一般的なグループから視覚的表現を学習します。 AETは学習された表現から変換を直接デコードすることによりトレーニングされますが、AVTは学習された表現と変換の間の相互情報を最大化することによりトレーニングされます。これにより、変換グループの下で従来の線形等分散を超える視覚構造の複雑なパターンをキャプチャすることにより、変換に対して一般化されたTER（GTER）がより一般的な方法で等価になります。提示されたアプローチは、ラベルと変換の両方で学習された表現の相互情報を共同で最大化することにより、（半）監視モデルに拡張できます。実験は、提案されたモデルが教師なしタスクと（半）教師付きタスクの両方で最先端のモデルよりも優れていることを示しています。

Transformation Equivariant Representations (TERs) aim to capture the intrinsic visual structures that equivary to various transformations by expanding the notion of translation equivariance underlying the success of Convolutional Neural Networks (CNNs). For this purpose, we present both deterministic AutoEncoding Transformations (AET) and probabilistic AutoEncoding Variational Transformations (AVT) models to learn visual representations from generic groups of transformations. While the AET is trained by directly decoding the transformations from the learned representations, the AVT is trained by maximizing the joint mutual information between the learned representation and transformations. This results in Generalized TERs (GTERs) equivariant against transformations in a more general fashion by capturing complex patterns of visual structures beyond the conventional linear equivariance under a transformation group. The presented approach can be extended to (semi-)supervised models by jointly maximizing the mutual information of the learned representation with both labels and transformations. Experiments demonstrate the proposed models outperform the state-of-the-art models in both unsupervised and (semi-)supervised tasks.

updated: Sun Nov 17 2019 16:27:59 GMT+0000 (UTC)

published: Wed Jun 19 2019 06:17:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト