No Routing Needed Between Capsules

Adam Byerly; Tatiana Kalganova; Ian Dear

カプセル間のルーティングは必要ありません

ほとんどのカプセルネットワーク設計は、カプセルレイヤー間の従来の行列乗算と、計算コストの高いルーティングメカニズムに依存して、行列乗算によって発生するカプセルの次元エンタングルメントを処理します。行列の乗算ではなく要素ごとの乗算を使用する同次ベクトルカプセル（HVC）を使用することにより、カプセルの寸法は絡み合わないままになります。この作業では、ジェフリーヒントンらのカプセル研究の方向性と直接比較するために、高度に構造化されたMNISTデータセットに適用されるHVCを研究します。私たちの研究では、HVCを使用した単純な畳み込みニューラルネットワークが、5.5分の1のパラメーター、4分の1のトレーニングエポック、再構築サブネットワークを使用せず、ルーティングメカニズムを必要としない、MNISTで以前の最高のパフォーマンスのカプセルネットワークと同様に機能することを示します。ネットワークに複数の分類ブランチを追加すると、これらのモデルのアンサンブルに対して99.87％の精度でMNISTデータセットの新しい最先端が確立され、単一のモデル（99.83％）に対して新しい最先端が確立されます。正確）。

Most capsule network designs rely on traditional matrix multiplication between capsule layers and computationally expensive routing mechanisms to deal with the capsule dimensional entanglement that the matrix multiplication introduces. By using Homogeneous Vector Capsules (HVCs), which use element-wise multiplication rather than matrix multiplication, the dimensions of the capsules remain unentangled. In this work, we study HVCs as applied to the highly structured MNIST dataset in order to produce a direct comparison to the capsule research direction of Geoffrey Hinton, et al. In our study, we show that a simple convolutional neural network using HVCs performs as well as the prior best performing capsule network on MNIST using 5.5x fewer parameters, 4x fewer training epochs, no reconstruction sub-network, and requiring no routing mechanism. The addition of multiple classification branches to the network establishes a new state of the art for the MNIST dataset with an accuracy of 99.87% for an ensemble of these models, as well as establishing a new state of the art for a single model (99.83% accurate).

updated: Thu Jun 17 2021 20:14:13 GMT+0000 (UTC)

published: Fri Jan 24 2020 18:37:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト