Carton dataset synthesis method for domain shift based on foreground texture decoupling and replacement

Lijun Gou; Shengkai Wu; Jinrong Yang; Hangcheng Yu; Chenxi Lin; Xiaoping Li; Chao Deng

前景テクスチャのデカップリングと置換に基づくドメインシフトのためのカートンデータセット合成方法

産業用アプリケーション向けのオブジェクト検出モデルを迅速に展開する上での大きな障害の1つは、大きな注釈付きデータセットがないことです。現在、包括的な製薬ロジスティクス会社（CPLC）、eコマースロジスティクス会社（ECLC）、果物市場（FM）などの3つのシナリオからのカートン画像を含むSacked Carton Dataset（SCD）を提示しています。ただし、ドメインシフトのため、SCDの3つのシナリオのいずれかでトレーニングされたモデルは、残りのシナリオに適用した場合、一般化能力が低くなります。この問題を解決するために、ソースデータセットの前景テクスチャをターゲットデータセットのテクスチャに置き換える新しい画像合成方法が提案されています。私たちの方法は、前景オブジェクトと背景のコンテキスト関係を変更せずに維持し、ターゲットデータセットを大幅に増強することができます。まず、各インスタンスのテクスチャデカップリングを実現するための表面セグメンテーションアルゴリズムを提案します。次に、インスタンスのオクルージョンとトランケーションの関係を変更しないようにするために、輪郭再構成アルゴリズムが提案されます。最後に、ガウス融合アルゴリズムを使用して、ソースデータセットの前景テクスチャをターゲットデータセットのテクスチャに置き換えます。新しい画像合成方法は、ターゲットドメインのAPをRetinaNetで少なくとも4.3％〜6.5％、Faster R-CNNで3.4％〜6.8％大幅に向上させることができます。コードはhttps://github.com/hustgetlijun/RCANで入手できます。

One major impediment in rapidly deploying object detection models for industrial applications is the lack of large annotated datasets. We currently have presented the Sacked Carton Dataset(SCD) that contains carton images from three scenarios, such as comprehensive pharmaceutical logistics company(CPLC), e-commerce logistics company(ECLC), fruit market(FM). However, due to domain shift, the model trained with one of the three scenarios in SCD has poor generalization ability when applied to the rest scenarios. To solve this problem, a novel image synthesis method is proposed to replace the foreground texture of the source datasets with the texture of the target datasets. Our method can keep the context relationship of foreground objects and backgrounds unchanged and greatly augment the target datasets. We firstly propose a surface segmentation algorithm to achieve texture decoupling of each instance. Secondly, a contour reconstruction algorithm is proposed to keep the occlusion and truncation relationship of the instance unchanged. Finally, the Gaussian fusion algorithm is used to replace the foreground texture from the source datasets with the texture from the target datasets. The novel image synthesis method can largely boost AP by at least 4.3%~6.5% on RetinaNet and 3.4%~6.8% on Faster R-CNN for the target domain. Code is available at https://github.com/hustgetlijun/RCAN.

updated: Mon Apr 26 2021 11:13:11 GMT+0000 (UTC)

published: Fri Mar 19 2021 11:21:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト