Use the Detection Transformer as a Data Augmenter

Luping Wang; Bin Liu

検出トランスフォーマーをデータオーグメンターとして使用する

検出トランスフォーマー (DETR) は、トランスフォーマーアーキテクチャベースのオブジェクト検出モデルです。この論文では、それがデータオーグメンターとしても使用できることを示しています。私たちはこのアプローチを、DETR アシスト CutMix、略して DeMix と呼んでいます。 DeMix は、近年人気を博しているシンプルでありながら非常に効果的なデータ拡張手法である CutMix に基づいています。 CutMix は、ある画像から別の画像にパッチをカットアンドペーストして新しい画像を作成することで、モデルのパフォーマンスを向上させます。この新しい例に対応するラベルは、元のラベルの加重平均として指定されます。重みはパッチの面積に比例します。 CutMix はカットするパッチをランダムに選択します。対照的に、DeMix は、事前にトレーニングされた DETR によって配置された、意味的に豊富なパッチを精巧に選択します。新しい画像のラベルは、CutMix と同じ方法で指定されます。画像分類のベンチマークデータセットに関する実験結果は、DeMix が CutMix を含む従来技術のデータ拡張方法よりも大幅に優れていることを示しています。

Detection Transformer (DETR) is a Transformer architecture based object detection model. In this paper, we demonstrate that it can also be used as a data augmenter. We term our approach as DETR assisted CutMix, or DeMix for short. DeMix builds on CutMix, a simple yet highly effective data augmentation technique that has gained popularity in recent years. CutMix improves model performance by cutting and pasting a patch from one image onto another, yielding a new image. The corresponding label for this new example is specified as the weighted average of the original labels, where the weight is proportional to the area of the patches. CutMix selects a random patch to be cut. In contrast, DeMix elaborately selects a semantically rich patch, located by a pre-trained DETR. The label of the new image is specified in the same way as in CutMix. Experimental results on benchmark datasets for image classification demonstrate that DeMix significantly outperforms prior art data augmentation methods including CutMix.

updated: Mon Apr 10 2023 12:50:17 GMT+0000 (UTC)

published: Mon Apr 10 2023 12:50:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト