Towards A Category-extended Object Detector without Relabeling or Conflicts

Bowen Zhao; Chen Chen; Wanpeng Xiao; Xi Xiao; Qi Ju; Shutao Xia

再ラベル付けや競合のないカテゴリ拡張オブジェクト検出器に向けて

オブジェクト検出器は通常、事前定義されたカテゴリが固定された完全に注釈が付けられたトレーニングデータに基づいて学習されます。ただし、多くの現実的なアプリケーションではクラスを段階的に増やす必要があることが多いため、関心のあるすべての可能なカテゴリを事前に知ることができるわけではありません。このようなシナリオでは、古いクラスで注釈が付けられた元のトレーニングセットと、新しいクラスでラベル付けされたいくつかの新しいトレーニングデータのみが使用可能です。この論文では、限られたデータセットに基づいて、余分な手作業なしですべてのカテゴリを処理できる強力な統合検出器の学習を目指しています。ラベルのあいまいさを考慮しないバニラジョイントトレーニングは、不完全な注釈のために大きなバイアスとパフォーマンスの低下につながります。このような状況を回避するために、3つの側面に焦点を当てた実用的なフレームワークを提案します。より良いベースモデル、より良いラベルのないグラウンドトゥルースマイニング戦略、および疑似アノテーションを使用したより良い再トレーニング方法です。最初に、使用可能な塩基検出器を得るために、競合のない損失が提案されます。次に、モンテカルロドロップアウトを使用して、ローカリゼーションの信頼度を分類の信頼度と組み合わせて計算し、より正確な境界ボックスをマイニングします。第三に、より強力な検出器を実現するために、再トレーニング中に疑似注釈をより有効に活用するためのいくつかの戦略を検討します。複数のデータセットで実施された広範な実験は、カテゴリ拡張オブジェクト検出器のフレームワークの有効性を示しています。

Object detectors are typically learned based on fully-annotated training data with fixed pre-defined categories. However, not all possible categories of interest can be known beforehand, as classes are often required to be increased progressively in many realistic applications. In such scenario, only the original training set annotated with the old classes and some new training data labeled with the new classes are available. In this paper, we aim at leaning a strong unified detector that can handle all categories based on the limited datasets without extra manual labor. Vanilla joint training without considering label ambiguity leads to heavy biases and poor performance due to the incomplete annotations. To avoid such situation, we propose a practical framework which focuses on three aspects: better base model, better unlabeled ground-truth mining strategy and better retraining method with pseudo annotations. First, a conflict-free loss is proposed to obtain a usable base detector. Second, we employ Monte Carlo Dropout to calculate the localization confidence, combined with the classification confidence, to mine more accurate bounding boxes. Third, we explore several strategies for making better use of pseudo annotations during retraining to achieve more powerful detectors. Extensive experiments conducted on multiple datasets demonstrate the effectiveness of our framework for category-extended object detectors.

updated: Mon Dec 28 2020 06:44:53 GMT+0000 (UTC)

published: Mon Dec 28 2020 06:44:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト