FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

David W. Romero; Robert-Jan Bruintjes; Jakub M. Tomczak; Erik J. Bekkers; Mark Hoogendoorn; Jan C. van Gemert

FlexConv：微分可能なカーネルサイズの連続カーネル畳み込み

畳み込みニューラルネットワーク（CNN）を設計するときは、トレーニングの前に畳み込みカーネルのサイズを選択する必要があります。最近の研究では、CNNはさまざまなレイヤーでさまざまなカーネルサイズの恩恵を受けていることが示されていますが、考えられるすべての組み合わせを調査することは実際には実行不可能です。より効率的なアプローチは、トレーニング中にカーネルサイズを学習することです。ただし、カーネルサイズを学習する既存の作業では、帯域幅が制限されています。これらのアプローチは、拡張によってカーネルをスケーリングするため、説明できる詳細は限られています。この作業では、FlexConvを提案します。これは、学習可能なカーネルサイズの高帯域幅畳み込みカーネルを固定パラメーターコストで学習できる新しい畳み込み演算です。 FlexNetsは、プーリングを使用せずに長期的な依存関係をモデル化し、いくつかのシーケンシャルデータセットで最先端のパフォーマンスを実現し、学習したカーネルサイズで最近の作業を上回り、画像ベンチマークデータセットではるかに深いResNetと競合します。さらに、FlexNetは、トレーニング中に見られる解像度よりも高い解像度で展開できます。エイリアシングを回避するために、カーネルの頻度を分析的に制御できる新しいカーネルパラメータ化を提案します。私たちの新しいカーネルパラメータ化は、既存のパラメータ化よりも高い記述力と速い収束速度を示しています。これにより、分類精度が大幅に向上します。

When designing Convolutional Neural Networks (CNNs), one must select the size of the convolutional kernels before training. Recent works show CNNs benefit from different kernel sizes at different layers, but exploring all possible combinations is unfeasible in practice. A more efficient approach is to learn the kernel size during training. However, existing works that learn the kernel size have a limited bandwidth. These approaches scale kernels by dilation, and thus the detail they can describe is limited. In this work, we propose FlexConv, a novel convolutional operation with which high bandwidth convolutional kernels of learnable kernel size can be learned at a fixed parameter cost. FlexNets model long-term dependencies without the use of pooling, achieve state-of-the-art performance on several sequential datasets, outperform recent works with learned kernel sizes, and are competitive with much deeper ResNets on image benchmark datasets. Additionally, FlexNets can be deployed at higher resolutions than those seen during training. To avoid aliasing, we propose a novel kernel parameterization with which the frequency of the kernels can be analytically controlled. Our novel kernel parameterization shows higher descriptive power and faster convergence speed than existing parameterizations. This leads to important improvements in classification accuracy.

updated: Fri Oct 15 2021 12:35:49 GMT+0000 (UTC)

published: Fri Oct 15 2021 12:35:49 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト