Improving Object Detection by Label Assignment Distillation

Chuong H. Nguyen; Thuy C. Nguyen; Tuan N. Tang; Nam L. H. Phan

ラベル割り当て蒸留によるオブジェクト検出の改善

オブジェクト検出でのラベル割り当ては、画像内のサンプリングされた領域に前景または背景のターゲットを割り当てることを目的としています。画像分類のラベリングとは異なり、この問題はオブジェクトのバウンディングボックスのために明確に定義されていません。この論文では、蒸留の観点から問題を調査するため、ラベル割り当て蒸留（LAD）と呼びます。私たちの最初の動機は非常に単純です。教師ネットワークを使用して生徒のラベルを生成します。これは、教師の予測を直接のターゲットとして使用する（ソフトラベル）か、教師によって動的に割り当てられたハードラベル（LAD）を使用するかの2つの方法で実現できます。私たちの実験では、次のことが明らかになりました。（i）LADはソフトラベルよりも効果的ですが、補完的です。（ii）LADを使用すると、小さい教師でも大きい生徒を大幅に改善できますが、ソフトラベルでは改善できません。次に、2つのネットワークが同時にゼロから学習し、教師と生徒の役割が動的に交換される共同学習LADを紹介します。教師としてPAA-ResNet50を使用すると、LAD技術により、検出器PAA-ResNet101およびPAA-ResNeXt101をCOCOtest-devセットの46 \ rmAPおよび47.5 \ rmAPに改善できます。強力な教師PAA-SwinBを使用して、PAA-ResNet50を\ 1xスケジュールトレーニングのみで43.9 \ rm APに、PAA-ResNet101を47.9 \ rm APに改善し、現在の方法を大幅に上回っています。ソースコードとチェックポイントはhttps://github.com/cybercore-co-ltd/CoLAD_paperでリリースされます。

Label assignment in object detection aims to assign targets, foreground or background, to sampled regions in an image. Unlike labeling for image classification, this problem is not well defined due to the object's bounding box. In this paper, we investigate the problem from a perspective of distillation, hence we call Label Assignment Distillation (LAD). Our initial motivation is very simple, we use a teacher network to generate labels for the student. This can be achieved in two ways: either using the teacher's prediction as the direct targets (soft label), or through the hard labels dynamically assigned by the teacher (LAD). Our experiments reveal that: (i) LAD is more effective than soft-label, but they are complementary. (ii) Using LAD, a smaller teacher can also improve a larger student significantly, while soft-label can't. We then introduce Co-learning LAD, in which two networks simultaneously learn from scratch and the role of teacher and student are dynamically interchanged. Using PAA-ResNet50 as a teacher, our LAD techniques can improve detectors PAA-ResNet101 and PAA-ResNeXt101 to 46 \rm AP and 47.5\rm AP on the COCO test-dev set. With a strong teacher PAA-SwinB, we improve the PAA-ResNet50 to 43.9\rm AP with only \1x schedule training, and PAA-ResNet101 to 47.9\rm AP, significantly surpassing the current methods. Our source code and checkpoints will be released at https://github.com/cybercore-co-ltd/CoLAD_paper.

updated: Tue Aug 24 2021 04:37:48 GMT+0000 (UTC)

published: Tue Aug 24 2021 04:37:48 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト