Microdosing: Knowledge Distillation for GAN based Compression

Leonhard Helminger; Roberto Azevedo; Abdelaziz Djelouah; Markus Gross; Christopher Schroers

マイクロドージング：GANベースの圧縮のための知識蒸留

最近、学習した画像とビデオの圧縮に大きな進歩が見られました。特に、Generative Adversarial Networksの使用は、低ビットレート体制で印象的な結果をもたらしました。ただし、モデルサイズは、現在の最先端の提案では依然として重要な問題であり、既存のソリューションでは、デコード側でかなりの計算作業が必要です。これにより、現実的なシナリオでの使用とビデオ圧縮への拡張が制限されます。このホワイトペーパーでは、知識の蒸留を活用して、元のパラメータ数の何分の1かで同等の機能を備えた画像デコーダを取得する方法を示します。画像コーディングのサイド情報を使用したシーケンスの特殊化など、ソリューションのいくつかの側面を調査します。最後に、得られた利点をビデオ圧縮の設定に転送する方法も示します。全体として、これによりモデルサイズを20分の1に縮小し、デコード時間を50％短縮することができます。

Recently, significant progress has been made in learned image and video compression. In particular the usage of Generative Adversarial Networks has lead to impressive results in the low bit rate regime. However, the model size remains an important issue in current state-of-the-art proposals and existing solutions require significant computation effort on the decoding side. This limits their usage in realistic scenarios and the extension to video compression. In this paper, we demonstrate how to leverage knowledge distillation to obtain equally capable image decoders at a fraction of the original number of parameters. We investigate several aspects of our solution including sequence specialization with side information for image coding. Finally, we also show how to transfer the obtained benefits into the setting of video compression. Overall, this allows us to reduce the model size by a factor of 20 and to achieve 50% reduction in decoding time.

updated: Fri Jan 07 2022 14:27:16 GMT+0000 (UTC)

published: Fri Jan 07 2022 14:27:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト