Universal Deep Image Compression via Content-Adaptive Optimization with Adapters

Koki Tsubota; Hiroaki Akutsu; Kiyoharu Aizawa

アダプターを使用したコンテンツ適応最適化による普遍的な深層画像圧縮

深層画像圧縮は、自然画像に対して、JPEG などの従来のコーデックよりも優れたパフォーマンスを発揮します。ただし、深層画像圧縮は学習ベースであり、問題が発生します。ドメイン外の画像では、圧縮パフォーマンスが大幅に低下します。この研究では、この問題に焦点を当て、新しいタスクである普遍的な深層画像圧縮に取り組みます。このタスクは、自然画、線画、漫画など、任意の領域に属する画像を圧縮することを目的としています。この問題に対処するために、コンテンツ適応最適化フレームワークを提案します。このフレームワークは、事前にトレーニングされた圧縮モデルを使用し、圧縮中にモデルをターゲットイメージに適応させます。アダプターは、モデルのデコーダーに挿入されます。入力画像ごとに、フレームワークは、エンコーダーによって抽出された潜在的な表現と、レート歪みの観点からのアダプターパラメーターを最適化します。アダプターのパラメーターは、画像ごとに追加で送信されます。実験では、4 つのドメイン (自然画像、線画、漫画、ベクターアート) の非圧縮画像を含むベンチマークデータセットが構築され、提案された普遍的な深層圧縮が評価されます。最後に、提案されたモデルは、非適応および既存の適応圧縮モデルと比較されます。比較すると、提案されたモデルがこれらよりも優れていることがわかります。コードとデータセットは、https://github.com/kktsubota/universal-dic で公開されています。

Deep image compression performs better than conventional codecs, such as JPEG, on natural images. However, deep image compression is learning-based and encounters a problem: the compression performance deteriorates significantly for out-of-domain images. In this study, we highlight this problem and address a novel task: universal deep image compression. This task aims to compress images belonging to arbitrary domains, such as natural images, line drawings, and comics. To address this problem, we propose a content-adaptive optimization framework; this framework uses a pre-trained compression model and adapts the model to a target image during compression. Adapters are inserted into the decoder of the model. For each input image, our framework optimizes the latent representation extracted by the encoder and the adapter parameters in terms of rate-distortion. The adapter parameters are additionally transmitted per image. For the experiments, a benchmark dataset containing uncompressed images of four domains (natural images, line drawings, comics, and vector arts) is constructed and the proposed universal deep compression is evaluated. Finally, the proposed model is compared with non-adaptive and existing adaptive compression models. The comparison reveals that the proposed model outperforms these. The code and dataset are publicly available at https://github.com/kktsubota/universal-dic.

updated: Wed Nov 02 2022 07:01:30 GMT+0000 (UTC)

published: Wed Nov 02 2022 07:01:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト