Learning True Rate-Distortion-Optimization for End-To-End Image Compression

Fabian Brand; Kristian Fischer; Alexander Kopte; André Kaup

真のレート歪みの学習-エンドツーエンドの画像圧縮のための最適化

レート歪み最適化は従来の画像およびビデオ圧縮の重要な部分ですが、この概念をエンドツーエンドでトレーニングされた画像圧縮に移行するアプローチは多くありません。ほとんどのフレームワークには、トレーニング後に修正される静的圧縮および解凍モデルが含まれているため、効率的なレート歪み最適化は不可能です。以前の研究では、HEVCの適応ブロック分割に匹敵するRDOアプローチを可能にするRDONetを提案しました。この論文では、RDO結果の複雑度の低い推定をトレーニングに導入することにより、トレーニングを強化します。さらに、高速および非常に高速なRDO推論モードを提案します。新しいトレーニング方法を使用すると、MS-SSIMで以前のRDONetモデルに比べて19.6％の平均レート節約を達成できます。これは、同等の従来のディープイメージコーダーに比べて27.3％のレート節約に相当します。

Even though rate-distortion optimization is a crucial part of traditional image and video compression, not many approaches exist which transfer this concept to end-to-end-trained image compression. Most frameworks contain static compression and decompression models which are fixed after training, so efficient rate-distortion optimization is not possible. In a previous work, we proposed RDONet, which enables an RDO approach comparable to adaptive block partitioning in HEVC. In this paper, we enhance the training by introducing low-complexity estimations of the RDO result into the training. Additionally, we propose fast and very fast RDO inference modes. With our novel training method, we achieve average rate savings of 19.6% in MS-SSIM over the previous RDONet model, which equals rate savings of 27.3% over a comparable conventional deep image coder.

updated: Wed Jan 05 2022 13:02:00 GMT+0000 (UTC)

published: Wed Jan 05 2022 13:02:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト