Image-to-image Transformation with Auxiliary Condition

Robert Leer; Hessi Roma; James Amelia

補助条件による画像から画像への変換

シミュレートされた画像でトレーニングされた人間の姿勢検出のような画像認識のパフォーマンスは、通常、実際のデータとシミュレートされたデータの相違のために悪化します。シミュレートされた画像の分布を実際の画像の分布に近づけるために、GANベースの画像から画像への変換方法を適用するいくつかの作業があります（SimGANやCycleGANなど）。ただし、これらの方法は、特にトレーニングデータが不均衡な場合、たとえば、トレーニングデータ内の特定のポーズや形状が小さい場合、被験者のポーズや形状のさまざまな変化に十分に敏感ではありません。この問題を克服するために、CycleGANのトレーニングで被験者のラベル情報（ポーズやオブジェクトのタイプなど）を導入し、ラベルごとの変換モデルを取得するように導くことを提案します。 SVHNからMNISTへの数字画像変換と、シミュレーション画像から実画像への監視カメラ画像変換の実験を通じて、Label-CycleGANと呼ばれる提案手法を評価します。

The performance of image recognition like human pose detection, trained with simulated images would usually get worse due to the divergence between real and simulated data. To make the distribution of a simulated image close to that of real one, there are several works applying GAN-based image-to-image transformation methods, e.g., SimGAN and CycleGAN. However, these methods would not be sensitive enough to the various change in pose and shape of subjects, especially when the training data are imbalanced, e.g., some particular poses and shapes are minor in the training data. To overcome this problem, we propose to introduce the label information of subjects, e.g., pose and type of objects in the training of CycleGAN, and lead it to obtain label-wise transforamtion models. We evaluate our proposed method called Label-CycleGAN, through experiments on the digit image transformation from SVHN to MNIST and the surveillance camera image transformation from simulated to real images.

updated: Fri Jun 25 2021 15:33:11 GMT+0000 (UTC)

published: Fri Jun 25 2021 15:33:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト