Spectral Tensor Train Parameterization of Deep Learning Layers

Anton Obukhov; Maxim Rakhuba; Alexander Liniger; Zhiwu Huang; Stamatios Georgoulis; Dengxin Dai; Luc Van Gool

深層学習層のスペクトルテンソルトレインのパラメーター化

ディープラーニングのコンテキストで、スペクトルプロパティが埋め込まれた重み行列の低ランクのパラメーター化を研究します。低ランクのプロパティはパラメータの効率につながり、マッピングを計算するときに計算のショートカットを取ることができます。スペクトル特性は、最適化問題の制約を受けることが多く、より良いモデルと最適化の安定性につながります。まず、重み行列のコンパクトなSVDパラメーター化を確認し、パラメーター化の冗長性ソースを特定します。さらに、テンソル列（TT）分解をコンパクトなSVDコンポーネントに適用し、スペクトルテンソル列パラメーター化（STTP）と呼ばれる固定TTランクテンソル多様体の非冗長で微分可能なパラメーター化を提案します。画像分類設定でのニューラルネットワーク圧縮の効果と、生成的敵対的トレーニング設定での圧縮と改善されたトレーニング安定性の両方の効果を示します。

We study low-rank parameterizations of weight matrices with embedded spectral properties in the Deep Learning context. The low-rank property leads to parameter efficiency and permits taking computational shortcuts when computing mappings. Spectral properties are often subject to constraints in optimization problems, leading to better models and stability of optimization. We start by looking at the compact SVD parameterization of weight matrices and identifying redundancy sources in the parameterization. We further apply the Tensor Train (TT) decomposition to the compact SVD components, and propose a non-redundant differentiable parameterization of fixed TT-rank tensor manifolds, termed the Spectral Tensor Train Parameterization (STTP). We demonstrate the effects of neural network compression in the image classification setting and both compression and improved training stability in the generative adversarial training setting.

updated: Tue Jul 13 2021 18:43:07 GMT+0000 (UTC)

published: Sun Mar 07 2021 00:15:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト