Localization Distillation for Object Detection

Zhaohui Zheng; Rongguang Ye; Qibin Hou; Dongwei Ren; Ping Wang; Wangmeng Zuo; Ming-Ming Cheng

オブジェクト検出のためのローカリゼーション蒸留

オブジェクト検出のための以前の知識蒸留（KD）手法は、ローカリゼーション情報の抽出が非効率的であるため、分類ロジットを模倣するのではなく、主に特徴の模倣に焦点を合わせています。この論文では、ロジット模倣が常に機能模倣に遅れをとっているかどうかを調査します。この目標に向けて、まず、ローカリゼーションの知識を教師から生徒に効率的に伝達できる新しいローカリゼーション蒸留（LD）メソッドを紹介します。次に、特定の領域の分類とローカリゼーションの知識を選択的に抽出するのに役立つ、貴重なローカリゼーション領域の概念を紹介します。これらの2つの新しいコンポーネントを組み合わせることで、ロジット模倣が機能の模倣よりも優れていることを初めて示します。ローカリゼーション蒸留がないことが、ロジット模倣が何年にもわたってパフォーマンスを低下させる重要な理由です。徹底的な研究は、ローカリゼーションのあいまいさを大幅に軽減し、堅牢な機能表現を学習し、初期段階でトレーニングの難しさを軽減できるロジット模倣の大きな可能性を示しています。また、提案されたLDと分類KDの間の理論的な関係を提供し、それらが同等の最適化効果を共有するようにします。私たちの蒸留スキームはシンプルで効果的であり、高密度の水平物体検出器と回転物体検出器の両方に簡単に適用できます。 MS COCO、PASCAL VOC、およびDOTAベンチマークに関する広範な実験は、私たちの方法が推論速度を犠牲にすることなく、かなりのAP改善を達成できることを示しています。ソースコードと事前トレーニング済みモデルは、https：//github.com/HikariTJU/LDで公開されています。

Previous knowledge distillation (KD) methods for object detection mostly focus on feature imitation instead of mimicking the classification logits due to its inefficiency in distilling the localization information. In this paper, we investigate whether logit mimicking always lags behind feature imitation. Towards this goal, we first present a novel localization distillation (LD) method which can efficiently transfer the localization knowledge from the teacher to the student. Second, we introduce the concept of valuable localization region that can aid to selectively distill the classification and localization knowledge for a certain region. Combining these two new components, for the first time, we show that logit mimicking can outperform feature imitation and the absence of localization distillation is a critical reason for why logit mimicking underperforms for years. The thorough studies exhibit the great potential of logit mimicking that can significantly alleviate the localization ambiguity, learn robust feature representation, and ease the training difficulty in the early stage. We also provide the theoretical connection between the proposed LD and the classification KD, that they share the equivalent optimization effect. Our distillation scheme is simple as well as effective and can be easily applied to both dense horizontal object detectors and rotated object detectors. Extensive experiments on the MS COCO, PASCAL VOC, and DOTA benchmarks demonstrate that our method can achieve considerable AP improvement without any sacrifice on the inference speed. Our source code and pretrained models are publicly available at https://github.com/HikariTJU/LD.

updated: Tue Apr 12 2022 17:14:34 GMT+0000 (UTC)

published: Tue Apr 12 2022 17:14:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト