Learning End-to-End Lossy Image Compression: A Benchmark

Yueyu Hu; Wenhan Yang; Zhan Ma; Jiaying Liu

エンドツーエンドの非可逆画像圧縮の学習：ベンチマーク

画像圧縮は、画像およびビデオ処理の分野で最も基本的な技術であり、一般的に使用されているアプリケーションの1つです。以前の方法では、適切に設計されたパイプラインが構築され、手作りのチューニングによってパイプラインのすべてのモジュールを改善するための努力が払われました。その後、特にデータ駆動型の方法が、新しく設計されたモジュールと制約を組み込む際の優れたモデリング能力と柔軟性でドメインを活性化したときに、多大な貢献がなされました。大きな進歩にもかかわらず、体系的なベンチマークとエンドツーエンドの学習された画像圧縮方法の包括的な分析が欠けています。この論文では、最初に、学習した画像圧縮方法の包括的な文献調査を実施します。文献は、ニューラルネットワークと共同でレート歪み性能を最適化するためのいくつかの側面に基づいて構成されています。つまり、ネットワークアーキテクチャ、エントロピーモデル、レート制御です。最先端の学習画像圧縮方法のマイルストーンについて説明し、既存の幅広い作品をレビューし、それらの歴史的な開発ルートへの洞察を提供します。この調査では、最近の高度な学習方法に関連する問題に対処する機会とともに、画像圧縮方法の主な課題が明らかになりました。この分析は、より高効率の画像圧縮に向けてさらなる一歩を踏み出す機会を提供します。エントロピー推定と信号再構成のための粗いものから細かいものへのハイパープリアーモデルを導入することにより、特に高解像度画像で、改善されたレート歪み性能を実現します。広範なベンチマーク実験は、マルチコアCPUとGPUでのレート歪みパフォーマンスと時間計算量におけるモデルの優位性を示しています。

Image compression is one of the most fundamental techniques and commonly used applications in the image and video processing field. Earlier methods built a well-designed pipeline, and efforts were made to improve all modules of the pipeline by handcrafted tuning. Later, tremendous contributions were made, especially when data-driven methods revitalized the domain with their excellent modeling capacities and flexibility in incorporating newly designed modules and constraints. Despite great progress, a systematic benchmark and comprehensive analysis of end-to-end learned image compression methods are lacking. In this paper, we first conduct a comprehensive literature survey of learned image compression methods. The literature is organized based on several aspects to jointly optimize the rate-distortion performance with a neural network, i.e., network architecture, entropy model and rate control. We describe milestones in cutting-edge learned image-compression methods, review a broad range of existing works, and provide insights into their historical development routes. With this survey, the main challenges of image compression methods are revealed, along with opportunities to address the related issues with recent advanced learning methods. This analysis provides an opportunity to take a further step towards higher-efficiency image compression. By introducing a coarse-to-fine hyperprior model for entropy estimation and signal reconstruction, we achieve improved rate-distortion performance, especially on high-resolution images. Extensive benchmark experiments demonstrate the superiority of our model in rate-distortion performance and time complexity on multi-core CPUs and GPUs.

updated: Tue Mar 09 2021 02:21:07 GMT+0000 (UTC)

published: Mon Feb 10 2020 13:13:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト