Scale-Equivalent Distillation for Semi-Supervised Object Detection

Qiushan Guo; Yao Mu; Jianyu Chen; Tianqi Wang; Yizhou Yu; Ping Luo

半教師あり物体検出のためのスケール等価蒸留

最近の半教師ありオブジェクト検出（SS-OD）の方法は、主に自己トレーニングに基づいています。つまり、教師モデルによって、ラベルのないデータを監視信号としてハード疑似ラベルを生成します。彼らは一定の成功を収めましたが、半教師あり学習の限られたラベル付きデータは、オブジェクト検出の課題を拡大します。これらの方法が経験的な実験結果と遭遇する課題を分析します。大量のFalseNegativeサンプルと劣ったローカリゼーション精度は考慮されていないことがわかります。その上、オブジェクトサイズの大きな変動とクラスの不均衡（つまり、背景とオブジェクトの極端な比率）は、先行技術のパフォーマンスを妨げます。さらに、新しいアプローチであるScale-Equivalent Distillation（SED）を導入することで、これらの課題を克服します。これは、大きなオブジェクトサイズの変動とクラスの不均衡に強い、シンプルでありながら効果的なエンドツーエンドの知識蒸留フレームワークです。 SEDには、以前の作品と比較していくつかの魅力的な利点があります。（1）SEDは、大規模な分散の問題を処理するために一貫性の正則化を課します。（2）SEDは、FalseNegativeサンプルと劣ったローカリゼーション精度からのノイズ問題を軽減します。（3）再重み付け戦略では、ラベルのないデータの潜在的な前景領域を暗黙的にスクリーニングして、クラスの不均衡の影響を減らすことができます。広範な実験により、SEDは、さまざまなデータセットで最近の最先端の方法を一貫して上回っており、かなりのマージンがあることが示されています。たとえば、MS-COCOで5％および10％のラベル付きデータを使用すると、監視対象のデータを10mAP以上上回ります。

Recent Semi-Supervised Object Detection (SS-OD) methods are mainly based on self-training, i.e., generating hard pseudo-labels by a teacher model on unlabeled data as supervisory signals. Although they achieved certain success, the limited labeled data in semi-supervised learning scales up the challenges of object detection. We analyze the challenges these methods meet with the empirical experiment results. We find that the massive False Negative samples and inferior localization precision lack consideration. Besides, the large variance of object sizes and class imbalance (i.e., the extreme ratio between background and object) hinder the performance of prior arts. Further, we overcome these challenges by introducing a novel approach, Scale-Equivalent Distillation (SED), which is a simple yet effective end-to-end knowledge distillation framework robust to large object size variance and class imbalance. SED has several appealing benefits compared to the previous works. (1) SED imposes a consistency regularization to handle the large scale variance problem. (2) SED alleviates the noise problem from the False Negative samples and inferior localization precision. (3) A re-weighting strategy can implicitly screen the potential foreground regions of the unlabeled data to reduce the effect of class imbalance. Extensive experiments show that SED consistently outperforms the recent state-of-the-art methods on different datasets with significant margins. For example, it surpasses the supervised counterpart by more than 10 mAP when using 5% and 10% labeled data on MS-COCO.

updated: Wed Mar 23 2022 07:33:37 GMT+0000 (UTC)

published: Wed Mar 23 2022 07:33:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト