WaveCNet: Wavelet Integrated CNNs to Suppress Aliasing Effect for Noise-Robust Image Classification

Qiufu Li; Linlin Shen; Sheng Guo; Zhihui Lai

WaveCNet：ノイズに強い画像分類のためのエイリアシング効果を抑制するウェーブレット統合CNN

画像分類で広く使用されていますが、畳み込みニューラルネットワーク（CNN）はノイズが途切れる傾向があります。つまり、CNN出力は小さな画像ノイズによって大幅に変化する可能性があります。ノイズのロバスト性を向上させるために、一般的なダウンサンプリング（最大プーリング、ストライド畳み込み、平均プーリング）を離散ウェーブレット変換（DWT）に置き換えることにより、CNNをウェーブレットと統合しようとしています。まず、Haar、Daubechies、Cohenなどのさまざまな直交および双直交離散ウェーブレットに適用可能な一般的なDWTおよび逆DWT（IDWT）層を提案し、次にDWTを一般的に使用されるCNN（VGG）に統合することによってウェーブレット統合CNN（WaveCNets）を設計します。、ResNets、およびDenseNet）。ダウンサンプリング中に、WaveCNetsはDWTを適用して、フィーチャマップを低周波数成分と高周波数成分に分解します。基本的なオブジェクト構造を含む主要な情報を含む低周波成分は、次のレイヤーに送信され、堅牢な高レベルの機能を生成します。高周波成分は、ほとんどのデータノイズを除去するためにドロップされます。実験結果は、％waveletがCNNトレーニングを加速し、WaveCNetsがさまざまなバニラCNNよりもImageNetで高い精度を達成することを示しています。また、ノイズの多いバージョンのImageNet、ImageNet-C、および6つの敵対的攻撃に対するWaveCNetsのパフォーマンスをテストしました。結果は、提案されたDWT / IDWTレイヤーがより優れたノイズロバスト性と敵対的ロバスト性を提供できることを示唆しています。 WaveCNetsをバックボーンとして適用すると、COCO検出データセットでのオブジェクト検出器（つまり、より高速なR-CNNとRetinaNet）のパフォーマンスが一貫して向上します。エイリアシング効果の抑制、つまり低周波数情報と高周波数情報の分離が、私たちのアプローチの主な利点であると信じています。 DWT / IDWTレイヤーとさまざまなWaveCNetsのコードは、https：//github.com/CVI-SZU/WaveCNetで入手できます。

Though widely used in image classification, convolutional neural networks (CNNs) are prone to noise interruptions, i.e. the CNN output can be drastically changed by small image noise. To improve the noise robustness, we try to integrate CNNs with wavelet by replacing the common down-sampling (max-pooling, strided-convolution, and average pooling) with discrete wavelet transform (DWT). We firstly propose general DWT and inverse DWT (IDWT) layers applicable to various orthogonal and biorthogonal discrete wavelets like Haar, Daubechies, and Cohen, etc., and then design wavelet integrated CNNs (WaveCNets) by integrating DWT into the commonly used CNNs (VGG, ResNets, and DenseNet). During the down-sampling, WaveCNets apply DWT to decompose the feature maps into the low-frequency and high-frequency components. Containing the main information including the basic object structures, the low-frequency component is transmitted into the following layers to generate robust high-level features. The high-frequency components are dropped to remove most of the data noises. The experimental results show that %wavelet accelerates the CNN training, and WaveCNets achieve higher accuracy on ImageNet than various vanilla CNNs. We have also tested the performance of WaveCNets on the noisy version of ImageNet, ImageNet-C and six adversarial attacks, the results suggest that the proposed DWT/IDWT layers could provide better noise-robustness and adversarial robustness. When applying WaveCNets as backbones, the performance of object detectors (i.e., faster R-CNN and RetinaNet) on COCO detection dataset are consistently improved. We believe that suppression of aliasing effect, i.e. separation of low frequency and high frequency information, is the main advantages of our approach. The code of our DWT/IDWT layer and different WaveCNets are available at https://github.com/CVI-SZU/WaveCNet.

updated: Wed Jul 28 2021 12:59:15 GMT+0000 (UTC)

published: Wed Jul 28 2021 12:59:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト