Enhanced Invertible Encoding for Learned Image Compression

Yueqi Xie; Ka Leong Cheng; Qifeng Chen

学習した画像圧縮のための強化された可逆エンコーディング

ディープラーニングベースの画像圧縮方法は、最近有望な進歩を遂げていますが、これらの方法のパフォーマンスは、最新の圧縮標準であるVersatile Video Coding（VVC）に匹敵するものではありません。最近の開発のほとんどは、潜在的な特徴の分布をより適切にパラメーター化できる、より正確で柔軟なエントロピーモデルの設計に焦点を合わせています。ただし、画像空間と潜在的な特徴空間の間のより良い変換を構築することに専念する努力はほとんどありません。このホワイトペーパーでは、以前のオートエンコーダスタイルのネットワークを使用してこの変換を構築する代わりに、情報損失の問題を大幅に軽減して圧縮を向上させる、可逆ニューラルネットワーク（INN）を備えた拡張可逆エンコーディングネットワークを提案します。 Kodak、CLIC、およびTecnickデータセットの実験結果は、特に高解像度画像の場合、私たちの方法がVVC（VTM 12.1）を含む既存の学習済み画像圧縮方法および圧縮標準よりも優れていることを示しています。ソースコードはhttps://github.com/xyq7/InvCompressで入手できます。

Although deep learning based image compression methods have achieved promising progress these days, the performance of these methods still cannot match the latest compression standard Versatile Video Coding (VVC). Most of the recent developments focus on designing a more accurate and flexible entropy model that can better parameterize the distributions of the latent features. However, few efforts are devoted to structuring a better transformation between the image space and the latent feature space. In this paper, instead of employing previous autoencoder style networks to build this transformation, we propose an enhanced Invertible Encoding Network with invertible neural networks (INNs) to largely mitigate the information loss problem for better compression. Experimental results on the Kodak, CLIC, and Tecnick datasets show that our method outperforms the existing learned image compression methods and compression standards, including VVC (VTM 12.1), especially for high-resolution images. Our source code is available at https://github.com/xyq7/InvCompress.

updated: Sun Aug 08 2021 17:32:10 GMT+0000 (UTC)

published: Sun Aug 08 2021 17:32:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト