Spectral Leakage and Rethinking the Kernel Size in CNNs

Nergis Tomen; Jan van Gemert

CNNにおけるスペクトル漏れとカーネルサイズの再考

CNNの畳み込み層は、入力をさまざまな周波数帯域に分解する線形フィルターを実装します。ただし、最新のアーキテクチャのほとんどは、畳み込みカーネルのサイズと形状に関するモデルの選択を最適化するときに、フィルター設計の標準的な原則を無視しています。この作業では、CNNのコンテキストでのフィルタリング操作でウィンドウアーティファクトによって引き起こされるスペクトル漏れのよく知られた問題を検討します。 CNNカーネルのサイズが小さいと、スペクトル漏れの影響を受けやすくなり、パフォーマンスを低下させるアーティファクトが発生する可能性があることを示します。この問題に対処するために、ハミングウィンドウ関数とともにより大きなカーネルサイズを使用して、CNNアーキテクチャのリークを軽減することを提案します。畳み込み層で標準のウィンドウ関数を使用するだけで、Fashion-MNIST、CIFAR-10、CIFAR-100、ImageNetなどの複数のベンチマークデータセットで分類精度が向上することを示します。最後に、ハミングウィンドウを使用するCNNが、さまざまな敵対的攻撃に対する堅牢性を向上させたことを示します。

Convolutional layers in CNNs implement linear filters which decompose the input into different frequency bands. However, most modern architectures neglect standard principles of filter design when optimizing their model choices regarding the size and shape of the convolutional kernel. In this work, we consider the well-known problem of spectral leakage caused by windowing artifacts in filtering operations in the context of CNNs. We show that the small size of CNN kernels make them susceptible to spectral leakage, which may induce performance-degrading artifacts. To address this issue, we propose the use of larger kernel sizes along with the Hamming window function to alleviate leakage in CNN architectures. We demonstrate improved classification accuracy on multiple benchmark datasets including Fashion-MNIST, CIFAR-10, CIFAR-100 and ImageNet with the simple use of a standard window function in convolutional layers. Finally, we show that CNNs employing the Hamming window display increased robustness against various adversarial attacks.

updated: Thu Jul 29 2021 10:30:21 GMT+0000 (UTC)

published: Mon Jan 25 2021 14:49:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト