Semantic Relation Preserving Knowledge Distillation for Image-to-Image Translation

Zeqi Li; Ruowei Jiang; Parham Aarabi

画像から画像への翻訳のための意味論的関係保存知識蒸留

敵対的生成ネットワーク（GAN）は、特に画像から画像への変換タスクにおいて、画像データの高次元分布をモデル化する上で大きな可能性を示しています。ただし、これらのタスクは複雑であるため、最先端のモデルには膨大な量のパラメーターが含まれていることが多く、その結果、モデルのサイズが大きくなり、推論時間が長くなります。この作業では、意味関係保存行列の蒸留と一緒に知識蒸留を適用することにより、この問題に対処するための新しい方法を提案します。教師の特徴エンコーディングから導出されたこのマトリックスは、学生モデルがより良い意味関係を学習するのに役立ちます。分類タスク用に設計された既存の圧縮方法とは対照的に、提案された方法は、GANでの画像から画像への変換タスクにうまく適応します。 5つの異なるデータセットと3つの異なる教師と生徒のモデルのペアで実施された実験は、私たちの方法が定性的および定量的に印象的な結果を達成するという強力な証拠を提供します。

Generative adversarial networks (GANs) have shown significant potential in modeling high dimensional distributions of image data, especially on image-to-image translation tasks. However, due to the complexity of these tasks, state-of-the-art models often contain a tremendous amount of parameters, which results in large model size and long inference time. In this work, we propose a novel method to address this problem by applying knowledge distillation together with distillation of a semantic relation preserving matrix. This matrix, derived from the teacher's feature encoding, helps the student model learn better semantic relations. In contrast to existing compression methods designed for classification tasks, our proposed method adapts well to the image-to-image translation task on GANs. Experiments conducted on 5 different datasets and 3 different pairs of teacher and student models provide strong evidence that our methods achieve impressive results both qualitatively and quantitatively.

updated: Wed May 19 2021 01:44:41 GMT+0000 (UTC)

published: Fri Apr 30 2021 16:04:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト