Data-Efficient Instance Segmentation with a Single GPU

Pengyu Chen; Wanhua Li; Jiwen Lu

単一のGPUを使用したデータ効率の高いインスタンスセグメンテーション

誰もが何百ものGPUやTPUを所有できるほど裕福なわけではありません。したがって、私たちは抜け道を見つけなければなりません。このホワイトペーパーでは、2021年のVIPriorsインスタンスセグメンテーションチャレンジで使用したデータ効率の高いインスタンスセグメンテーション方法を紹介します。私たちのソリューションは、強力なツールボックスであるmmdetectionに基づいたSwinTransformerの修正バージョンです。データ不足の問題を解決するために、ランダムフリップやマルチスケールトレーニングなどのデータ拡張を利用してモデルをトレーニングします。推論中、パフォーマンスを向上させるためにマルチスケールフュージョンが使用されます。トレーニングとテストの全段階で、1つのGPUのみを使用します。最終的に、THU_IVG_2018という名前のチームは、テストセットでAP @ 0.50：0.95に対して0.366の結果を達成しました。これは、GPUを1つだけ使用しながら、他のトップランクの方法と競合します。その上、私たちの方法は0.592のAP@0.50：0.95（中）を達成しました。これはすべての出場者の中で2番目にランクされています。主催者が発表したように、最終的に、私たちのチームはすべての出場者の中で3位にランクされました。

Not everyone is wealthy enough to have hundreds of GPUs or TPUs. Therefore, we've got to find a way out. In this paper, we introduce a data-efficient instance segmentation method we used in the 2021 VIPriors Instance Segmentation Challenge. Our solution is a modified version of Swin Transformer, based on the mmdetection which is a powerful toolbox. To solve the problem of lack of data, we utilize data augmentation including random flip and multiscale training to train our model. During inference, multiscale fusion is used to boost the performance. We only use a single GPU during the whole training and testing stages. In the end, our team named THU_IVG_2018 achieved the result of 0.366 for AP@0.50:0.95 on the test set, which is competitive with other top-ranking methods while only one GPU is used. Besides, our method achieved the AP@0.50:0.95 (medium) of 0.592, which ranks second among all contestants. In the end, our team ranked third among all the contestants, as announced by the organizers.

updated: Fri Oct 08 2021 16:31:52 GMT+0000 (UTC)

published: Fri Oct 01 2021 07:36:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト