MALICE: Manipulation Attacks on Learned Image ComprEssion

Kang Liu; Di Wu; Yiru Wang; Dan Feng; Benjamin Tan; Siddharth Garg

MALICE: 学習した画像圧縮に対する操作攻撃

深層学習技術は、画像圧縮において有望な結果を示しており、競争力のあるビットレートと圧縮された潜在画像からの画像再構成品質を備えています。ただし、画像圧縮は、より高いピーク信号対ノイズ比 (PSNR) とより少ないピクセルあたりのビット数 (bpp) に向かって進歩してきましたが、敵対的な画像に対する堅牢性は検討されたことはありません。この作業では、入力画像のわずかな摂動が圧縮された潜在的なビットレートの大幅な増加を引き起こす可能性がある画像圧縮システムの堅牢性を初めて調査します。最先端の学習済み画像圧縮の堅牢性を特徴付けるために、ホワイトボックス攻撃とブラックボックス攻撃をマウントします。私たちのホワイトボックス攻撃は、ビットストリームのエントロピー推定に高速勾配符号法を採用し、ビットレートの近似値として使用します。ブラックボックス攻撃の代替として、アーキテクチャのシンプルさと軽量のトレーニングでJPEG圧縮をシミュレートするDCT-Netを提案し、高速な敵対的転送可能性を可能にします。それぞれが 6 つの異なるビットレート品質を持つ 6 つの画像圧縮モデル (合計 36 モデル) の結果は、ホワイトボックス攻撃が最大 56.326x、ブラックボックス攻撃が 1.947x bpp の変化を達成する驚くほど壊れやすいことを示しています。堅牢性を向上させるために、アテンションモジュールと基本的な因数分解されたエントロピーモデルを組み込んだ新しい圧縮アーキテクチャ factorAtn を提案します。これにより、レート歪み性能と、既存の学習済み画像コンプレッサーを超える敵対的攻撃に対する堅牢性との間の有望なトレードオフが実現します。

Deep learning techniques have shown promising results in image compression, with competitive bitrate and image reconstruction quality from compressed latent. However, while image compression has progressed towards a higher peak signal-to-noise ratio (PSNR) and fewer bits per pixel (bpp), their robustness to adversarial images has never received deliberation. In this work, we, for the first time, investigate the robustness of image compression systems where imperceptible perturbation of input images can precipitate a significant increase in the bitrate of their compressed latent. To characterize the robustness of state-of-the-art learned image compression, we mount white-box and black-box attacks. Our white-box attack employs fast gradient sign method on the entropy estimation of the bitstream as its bitrate approximation. We propose DCT-Net simulating JPEG compression with architectural simplicity and lightweight training as the substitute in the black-box attack and enable fast adversarial transferability. Our results on six image compression models, each with six different bitrate qualities (thirty-six models in total), show that they are surprisingly fragile, where the white-box attack achieves up to 56.326x and black-box 1.947x bpp change. To improve robustness, we propose a novel compression architecture factorAtn which incorporates attention modules and a basic factorized entropy model, resulting in a promising trade-off between the rate-distortion performance and robustness to adversarial attacks that surpasses existing learned image compressors.

updated: Tue Aug 23 2022 06:20:19 GMT+0000 (UTC)

published: Thu May 26 2022 09:46:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト