Dataset Distillation by Matching Training Trajectories

George Cazenavette; Tongzhou Wang; Antonio Torralba; Alexei A. Efros; Jun-Yan Zhu

トレーニング軌道のマッチングによるデータセットの抽出

データセットの蒸留は、合成セットでトレーニングされたモデルが完全なデータセットでトレーニングされたモデルのテスト精度と一致するように、小さなデータセットを合成するタスクです。このホワイトペーパーでは、蒸留データを最適化して、ネットワークを多くのトレーニングステップで実際のデータでトレーニングされたものと同様の状態に導く新しい定式化を提案します。ネットワークが与えられると、蒸留データで数回の反復のためにそれをトレーニングし、合成的にトレーニングされたパラメーターと実際のデータでトレーニングされたパラメーターとの間の距離に関して蒸留データを最適化します。大規模なデータセットの初期ネットワークパラメータとターゲットネットワークパラメータを効率的に取得するために、実際のデータセットでトレーニングされたエキスパートネットワークのトレーニング軌跡を事前に計算して保存します。私たちの方法は、既存の方法よりも手軽に優れており、より高解像度の視覚データを抽出することもできます。

Dataset distillation is the task of synthesizing a small dataset such that a model trained on the synthetic set will match the test accuracy of the model trained on the full dataset. In this paper, we propose a new formulation that optimizes our distilled data to guide networks to a similar state as those trained on real data across many training steps. Given a network, we train it for several iterations on our distilled data and optimize the distilled data with respect to the distance between the synthetically trained parameters and the parameters trained on real data. To efficiently obtain the initial and target network parameters for large-scale datasets, we pre-compute and store training trajectories of expert networks trained on the real dataset. Our method handily outperforms existing methods and also allows us to distill higher-resolution visual data.

updated: Tue Mar 22 2022 17:58:59 GMT+0000 (UTC)

published: Tue Mar 22 2022 17:58:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト