Expanding Small-Scale Datasets with Guided Imagination

Yifan Zhang; Daquan Zhou; Bryan Hooi; Kai Wang; Jiashi Feng

ガイド付きの想像力で小規模データセットを拡張する

DNN の能力は、トレーニングデータの量と質に大きく依存します。ただし、大規模なデータの収集と注釈付けには多くの場合、費用と時間がかかり、DNN の適用を大きく妨げます。この問題に対処するために、新しいラベル付きサンプルを自動的に作成することにより、すぐに使用できる小さなデータセットを拡張しようとする、データセット拡張と呼ばれる新しいタスクを調査します。この目的のために、最先端の生成モデル (DALL-E2、Stable Diffusion (SD) など) を活用して「想像」し、入力シードデータから有益な新しいデータを作成する Guided Imagination Framework (GIF) を提示します。具体的には、GIF は、新しいコンテンツを含む写真のようにリアルな画像を作成するために使用される、前のモデルの意味的に意味のある空間でシードデータの潜在的な特徴を最適化することにより、データの想像を行います。モデルトレーニング用の有益なサンプルを作成するための想像力を導くために、2 つの重要な基準、つまり、クラスで維持される情報のブーストとサンプルの多様性の促進を導入します。 2 つの基準は、効果的なデータセット拡張に不可欠であることが検証されています。GIF-SD は、SD を使用したガイドなしの拡張よりも、自然画像データセットで 13.5% 高いモデル精度を取得します。これらの重要な基準により、GIF はさまざまな小規模データシナリオでデータセットを効果的に拡張し、モデルの精度を 6 つの自然画像データセットで平均 36.9%、3 つの医療データセットで平均 13.5% 向上させます。ソースコードは、https://github.com/Vanint/DatasetExpansion で公開されます。

The power of DNNs depends heavily on the quantity and quality of training data. However, collecting and annotating data on a large scale is often costly and time-consuming, which severely hinders the application of DNNs. To address this issue, we explore a new task, termed as dataset expansion, which seeks to expand a ready-to-use small dataset by automatically creating new labeled samples. To this end, we present a Guided Imagination Framework (GIF) that leverages cutting-edge generative models (e.g., DALL-E2, Stable Diffusion (SD)) to ``imagine'' and create informative new data from the input seed data. Specifically, GIF conducts data imagination by optimizing the latent features of the seed data in the semantically meaningful space of the prior model, which are used to create photo-realistic images with new content. To guide the imagination towards creating informative samples for model training, we introduce two key criteria, i.e., class-maintained information boosting and sample diversity promotion. The two criteria are verified to be essential for effective dataset expansion: GIF-SD obtains 13.5% higher model accuracy on natural image datasets than unguided expansion with SD. With these essential criteria, GIF expands datasets effectively in various small-data scenarios, boosting model accuracy by 36.9% on average over six natural image datasets and by 13.5% on average over three medical datasets. The source code will be released: https://github.com/Vanint/DatasetExpansion.

updated: Fri Mar 03 2023 12:50:12 GMT+0000 (UTC)

published: Fri Nov 25 2022 09:38:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト