Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations

Isaac J. Sledge; Jose C. Principe

より深い表現を学習するための深い予測コーディングネットワークにおけるより速い収束

ディーププレディクティブコーディングネットワーク（DPCN）は、階層的で生成的なモデルです。それらは、フィードフォワード接続とフィードバック接続に依存して、動的で状況に応じた方法で刺激の潜在的な特徴表現を調整します。 DPCNの重要な要素は、スパースで不変の機能を明らかにするための前後の推論手順です。ただし、この推論は主要な計算上のボトルネックです。学習の停滞により、ネットワークの深さが大幅に制限されます。ここでは、このボトルネックが発生する理由を証明します。次に、加速された近位勾配に基づく新しい前方推論戦略を提案します。この戦略は、DPCNに使用されるものよりも速い理論的収束保証を持っています。それは学習の停滞を克服します。また、深くて広い予測コーディングネットワークの構築が可能であることも示しています。このような畳み込みネットワークは、ネットワークがトレーニングされるオブジェクトのクラス全体を適切にキャプチャする受容野を実装します。これにより、ラボの以前の非畳み込みおよび畳み込みDPCNと比較して、特徴表現が改善されます。畳み込みオートエンコーダを超え、教師ありの方法でトレーニングされた畳み込みネットワークと同等の教師なしオブジェクト認識を生成します。

Deep-predictive-coding networks (DPCNs) are hierarchical, generative models. They rely on feed-forward and feed-back connections to modulate latent feature representations of stimuli in a dynamic and context-sensitive manner. A crucial element of DPCNs is a forward-backward inference procedure to uncover sparse, invariant features. However, this inference is a major computational bottleneck. It severely limits the network depth due to learning stagnation. Here, we prove why this bottleneck occurs. We then propose a new forward-inference strategy based on accelerated proximal gradients. This strategy has faster theoretical convergence guarantees than the one used for DPCNs. It overcomes learning stagnation. We also demonstrate that it permits constructing deep and wide predictive-coding networks. Such convolutional networks implement receptive fields that capture well the entire classes of objects on which the networks are trained. This improves the feature representations compared with our lab's previous non-convolutional and convolutional DPCNs. It yields unsupervised object recognition that surpass convolutional autoencoders and are on par with convolutional networks trained in a supervised manner.

updated: Sat May 15 2021 21:52:47 GMT+0000 (UTC)

published: Mon Jan 18 2021 02:30:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト