Box-Level Active Detection

Mengyao Lyu; Jundong Zhou; Hui Chen; Yijie Huang; Dongdong Yu; Yaqian Li; Yandong Guo; Yuchen Guo; Liuyu Xiang; Guiguang Ding

ボックスレベルのアクティブ検出

アクティブラーニングは、予算内でアノテーション用の有益なサンプルを選択します。これは、最近オブジェクト検出で効率的であることが証明されています。ただし、広く使用されているアクティブ検出ベンチマークでは、画像レベルの評価が行われます。これは、人間のワークロードの推定では非現実的であり、混雑した画像に偏っています。さらに、既存の方法は依然として画像レベルの注釈を実行しますが、同じ画像内のすべてのターゲットを均等にスコアリングすると、予算の浪費と冗長なラベルが発生します。上記の問題と制限を明らかにしたので、サイクルごとにボックスベースの予算を制御し、有益なターゲットに優先順位を付け、公平な比較と効率的なアプリケーションのために冗長性を回避する、ボックスレベルのアクティブ検出フレームワークを導入します。提案されたボックスレベルの設定の下で、新しいパイプライン、つまり Complementary Pseudo Active Strategy (ComPAS) を考案します。人間の注釈とモデルインテリジェンスの両方を補完的な方法で利用します。効率的な入力側委員会は、有益なオブジェクトのみのラベルをクエリします。その間、十分に学習されたターゲットはモデルによって識別され、疑似ラベルで補正されます。 ComPAS は、統一されたコードベースの 4 つの設定で 10 の競合他社よりも一貫して優れています。ラベル付きデータのみからの監視により、わずか 19% のボックスアノテーションで VOC0712 の 100% 監視されたパフォーマンスを達成します。 COCO データセットでは、2 番目に優れた方法よりも最大 4.3% の mAP 改善が得られます。 ComPAS は、ラベルのないプールでのトレーニングもサポートしており、85% のラベル削減で 90% の COCO 監視パフォーマンスを上回ります。ソースコードは、https://github.com/lyumengyao/blad で公開されています。

Active learning selects informative samples for annotation within budget, which has proven efficient recently on object detection. However, the widely used active detection benchmarks conduct image-level evaluation, which is unrealistic in human workload estimation and biased towards crowded images. Furthermore, existing methods still perform image-level annotation, but equally scoring all targets within the same image incurs waste of budget and redundant labels. Having revealed above problems and limitations, we introduce a box-level active detection framework that controls a box-based budget per cycle, prioritizes informative targets and avoids redundancy for fair comparison and efficient application. Under the proposed box-level setting, we devise a novel pipeline, namely Complementary Pseudo Active Strategy (ComPAS). It exploits both human annotations and the model intelligence in a complementary fashion: an efficient input-end committee queries labels for informative objects only; meantime well-learned targets are identified by the model and compensated with pseudo-labels. ComPAS consistently outperforms 10 competitors under 4 settings in a unified codebase. With supervision from labeled data only, it achieves 100% supervised performance of VOC0712 with merely 19% box annotations. On the COCO dataset, it yields up to 4.3% mAP improvement over the second-best method. ComPAS also supports training with the unlabeled pool, where it surpasses 90% COCO supervised performance with 85% label reduction. Our source code is publicly available at https://github.com/lyumengyao/blad.

updated: Thu Mar 23 2023 08:06:10 GMT+0000 (UTC)

published: Thu Mar 23 2023 08:06:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト