Single Image Texture Translation for Data Augmentation

Boyi Li; Yin Cui; Tsung-Yi Lin; Serge Belongie

データ拡張のための単一画像テクスチャ変換

画像合成の最近の進歩により、ソースドメインとターゲットドメイン間のマッピングを学習することで画像を翻訳することができます。既存の方法は、さまざまなデータセットでモデルをトレーニングすることによって分布を学習する傾向があり、結果は主に主観的な方法で評価されます。ただし、この分野での研究は比較的少なく、画像認識タスクでのセマンティック画像変換方法の使用の可能性を研究しています。このホワイトペーパーでは、データ拡張のための単一画像テクスチャ変換（SITT）の使用について説明します。最初に、ソーステクスチャの単一の入力に基づいてテクスチャを画像に変換するための軽量モデルを提案します。これにより、高速なトレーニングとテストが可能になります。次に、SITTに基づいて、ロングテールおよび数ショットの画像分類タスクでの拡張データの使用を検討します。提案された方法は、入力データをターゲットドメインに変換することができ、一貫して改善された画像認識性能につながることがわかります。最後に、SITTおよび関連する画像変換方法が、モデルトレーニングに対するデータ効率の高い拡張エンジニアリングアプローチの基礎をどのように提供できるかを調べます。

Recent advances in image synthesis enables one to translate images by learning the mapping between a source domain and a target domain. Existing methods tend to learn the distributions by training a model on a variety of datasets, with results evaluated largely in a subjective manner. Relatively few works in this area, however, study the potential use of semantic image translation methods for image recognition tasks. In this paper, we explore the use of Single Image Texture Translation (SITT) for data augmentation. We first propose a lightweight model for translating texture to images based on a single input of source texture, allowing for fast training and testing. Based on SITT, we then explore the use of augmented data in long-tailed and few-shot image classification tasks. We find the proposed method is capable of translating input data into a target domain, leading to consistent improved image recognition performance. Finally, we examine how SITT and related image translation methods can provide a basis for a data-efficient, augmentation engineering approach to model training.

updated: Fri Jun 25 2021 17:59:04 GMT+0000 (UTC)

published: Fri Jun 25 2021 17:59:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト