Generalizing Dataset Distillation via Deep Generative Prior

George Cazenavette; Tongzhou Wang; Antonio Torralba; Alexei A. Efros; Jun-Yan Zhu

Deep Generative Prior によるデータセット蒸留の一般化

データセット蒸留は、データセット全体の知識をいくつかの合成画像に蒸留することを目的としています。少数の合成データポイントを合成し、トレーニングデータとして学習アルゴリズムに与えると、元のデータでトレーニングされたモデルに近いモデルが得られます。この分野での最近の進歩にもかかわらず、既存のデータセットの抽出方法は、新しいアーキテクチャに一般化できず、高解像度のデータセットに拡張できません。上記の問題を克服するために、事前にトレーニングされた深い生成モデルから学習した事前確率を使用して、抽出されたデータを合成することを提案します。これを達成するために、生成モデルの潜在空間で多数の画像をいくつかの中間特徴ベクトルに抽出する新しい最適化アルゴリズムを提示します。私たちの方法は既存の手法を強化し、すべての設定でクロスアーキテクチャの一般化を大幅に改善します。

Dataset Distillation aims to distill an entire dataset's knowledge into a few synthetic images. The idea is to synthesize a small number of synthetic data points that, when given to a learning algorithm as training data, result in a model approximating one trained on the original data. Despite recent progress in the field, existing dataset distillation methods fail to generalize to new architectures and scale to high-resolution datasets. To overcome the above issues, we propose to use the learned prior from pre-trained deep generative models to synthesize the distilled data. To achieve this, we present a new optimization algorithm that distills a large number of images into a few intermediate feature vectors in the generative model's latent space. Our method augments existing techniques, significantly improving cross-architecture generalization in all settings.

updated: Wed May 03 2023 20:19:13 GMT+0000 (UTC)

published: Tue May 02 2023 17:59:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト