Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks

Yunshan Zhong; Mingbao Lin; Xunchao Li; Ke Li; Yunhang Shen; Fei Chao; Yongjian Wu; Rongrong Ji

超低精度超解像ネットワークのための動的デュアルトレーニング可能境界

軽量の超解像（SR）モデルは、モバイルデバイスでの保守性でかなりの注目を集めています。多くの取り組みでは、ネットワーク量子化を使用してSRモデルを圧縮しています。ただし、これらの方法では、SRモデルを低コストのレイヤーワイズ量子化器を使用して超低精度（2ビットや3ビットなど）に量子化すると、パフォーマンスが大幅に低下します。この論文では、パフォーマンスの低下は、層ごとの対称量子化器とSRモデルの高度に非対称な活性化分布との間の矛盾に起因することを確認します。この不一致は、量子化レベルの浪費または再構成された画像の詳細損失のいずれかにつながります。したがって、アクティベーションの非対称性に対応するために、Dynamic Dual Trainable Bounds（DDTB）と呼ばれる新しいアクティベーション量子化器を提案します。具体的には、DDTBは次の点で革新を行っています。1）高度に非対称な活性化に取り組むためのトレーニング可能な上限と下限を備えた層ごとの量子化器。 2）動的ゲートコントローラーは、実行時に上限と下限を適応的に調整して、さまざまなサンプルで大幅に変化するアクティベーション範囲を克服します。余分なオーバーヘッドを削減するために、動的ゲートコントローラーは2ビットに量子化され、導入された動的強度に応じたSRネットワーク。広範な実験により、当社のDDTBは超低精度で大幅なパフォーマンスの向上を示すことが実証されています。たとえば、EDSRを2ビットに量子化し、出力画像をx4にスケールアップすると、DDTBはUrban100ベンチマークで0.70dBのPSNRの増加を達成します。コードはhttps://github.com/zysxmu/DDTBにあります。

Light-weight super-resolution (SR) models have received considerable attention for their serviceability in mobile devices. Many efforts employ network quantization to compress SR models. However, these methods suffer from severe performance degradation when quantizing the SR models to ultra-low precision (e.g., 2-bit and 3-bit) with the low-cost layer-wise quantizer. In this paper, we identify that the performance drop comes from the contradiction between the layer-wise symmetric quantizer and the highly asymmetric activation distribution in SR models. This discrepancy leads to either a waste on the quantization levels or detail loss in reconstructed images. Therefore, we propose a novel activation quantizer, referred to as Dynamic Dual Trainable Bounds (DDTB), to accommodate the asymmetry of the activations. Specifically, DDTB innovates in: 1) A layer-wise quantizer with trainable upper and lower bounds to tackle the highly asymmetric activations. 2) A dynamic gate controller to adaptively adjust the upper and lower bounds at runtime to overcome the drastically varying activation ranges over different samples.To reduce the extra overhead, the dynamic gate controller is quantized to 2-bit and applied to only part of the SR networks according to the introduced dynamic intensity. Extensive experiments demonstrate that our DDTB exhibits significant performance improvements in ultra-low precision. For example, our DDTB achieves a 0.70dB PSNR increase on Urban100 benchmark when quantizing EDSR to 2-bit and scaling up output images to x4. Code is at https://github.com/zysxmu/DDTB.

updated: Thu Mar 10 2022 06:58:24 GMT+0000 (UTC)

published: Tue Mar 08 2022 04:26:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト